CV engineer's notes and resources
Top 18.6% on sourcepulse
This repository serves as a personal knowledge base for Computer Vision (CV) algorithm engineers, documenting their growth path and sharing notes on CV and model compression/deployment technologies. It also promotes a paid course on building a custom large model inference framework.
How It Works
The project's core is a collection of notes and resources for CV engineers, covering topics from foundational programming and machine learning to advanced areas like model compression, high-performance computing, and deployment. A significant portion highlights a custom inference framework built with Triton and PyTorch, designed for ease of use and GPU acceleration via Triton kernels, aiming to simplify CUDA programming.
Quick Start & Requirements
The repository itself is primarily a collection of notes and documentation. The associated course project, however, requires Python and PyTorch. Specific hardware requirements for running the inference framework are not detailed in the README, but performance claims suggest GPU acceleration is essential.
Highlighted Details
transformers
library.Maintenance & Community
The project is actively maintained by the author, harleyszhang, with content being continuously updated. The author also promotes a paid course and a WeChat public account ("嵌入式视觉") for further engagement.
Licensing & Compatibility
The repository's licensing is not explicitly stated in the README. The content is presented as personal notes and educational material.
Limitations & Caveats
The README explicitly states that "This project is gradually being deprecated, and most of the content will no longer be updated." Users are directed to other repositories (dl_note
, lite_llama
) for updated information on deep learning and inference frameworks.
4 months ago
Inactive