gpuocelot by gtcasl

Dynamic compilation framework for PTX

Created 11 years ago

290 stars

Top 91.0% on SourcePulse

Project Summary

GPUOCelot is a modular dynamic compilation framework for heterogeneous systems, designed to execute CUDA programs on NVIDIA GPUs, AMD GPUs, and x86-CPUs without recompilation. It targets researchers and developers working with parallel computing and GPU architectures who need a flexible platform for analyzing and executing CUDA code across diverse hardware.

How It Works

GPUOCelot employs a dynamic compilation approach, analyzing and transforming PTX (Parallel Thread Execution) virtual instruction sets. This allows for runtime adaptation and execution on different hardware backends, including NVIDIA GPUs, AMD GPUs, and x86 CPUs, aiming for full execution speed.

Quick Start & Requirements

Installation instructions are available at https://github.com/gtcasl/gpuocelot/wiki/Installation.

Highlighted Details

Enables CUDA program execution on NVIDIA GPUs, AMD GPUs, and x86-CPUs without recompilation.
Provides analysis modules for the PTX virtual instruction set.
Aims for full execution speed across supported platforms.

Maintenance & Community

The project is no longer actively maintained. The last news update was in March 2013, seeking developers for AMD and Intel GPU backends. A mailing list is available at http://groups.google.com/group/gpuocelot.

Licensing & Compatibility

The README does not specify a license.

Limitations & Caveats

The project is explicitly stated as no longer actively maintained, indicating a lack of ongoing development, bug fixes, or support for newer hardware architectures or CUDA versions. Documentation for installation and common usage is also noted as lacking.

Health Check

Last Commit

2 years ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

llama3.cuda by likejazz

C/CUDA implementation for Llama 3 model

Created 1 year ago

Updated 10 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

gpu-optimization-workshop by mlops-discord

Workshop materials for GPU optimization

Created 1 year ago

Updated 1 year ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

data-science-stack by NVIDIA

NVIDIA Data Science Stack: tool for GPU-accelerated data science setup

Created 6 years ago

Updated 2 years ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera),

Wing Lian

Wing Lian(Founder of Axolotl AI), and

1 more.

resource-stream by gpu-mode

CUDA resource collection for GPU programming

Created 2 years ago

Updated 5 months ago

Starred by

Luis Capelo

Luis Capelo(Cofounder of Lightning AI),

Tristan Hume

Tristan Hume(MTS at Anthropic), and

2 more.

gdrcopy by NVIDIA

GPU memory copy library using GPUDirect RDMA

Created 11 years ago

Updated 2 months ago

Starred by

Jonathan Ragan-Kelley

Jonathan Ragan-Kelley(Professor at MIT) and

Luis Capelo

Luis Capelo(Cofounder of Lightning AI).

kompute by KomputeProject

GPU compute framework for cross-vendor graphics cards

Created 5 years ago

Updated 3 weeks ago

Starred by

Andrej Karpathy

Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n),

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and

4 more.

gpu.cpp by AnswerDotAI

C++ library for portable GPU computation using WebGPU

Created 1 year ago

Updated 4 months ago

Starred by

Ji Yichao

Ji Yichao(Cofounder of Manus) and

Ying Sheng

Ying Sheng(Coauthor of SGLang).

how-to-optim-algorithm-in-cuda by BBuf

CUDA optimization guide for common algorithms

Created 7 years ago

Updated 1 week ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Ying Sheng

Ying Sheng(Coauthor of SGLang).

fastllm by ztxz16

High-performance C++ LLM inference library

Created 2 years ago

Updated 17 hours ago

Starred by

David Cournapeau

David Cournapeau(Author of scikit-learn),

Stas Bekman

Stas Bekman(Author of "Machine Learning Engineering Open Book"; Research Engineer at Snowflake), and

5 more.

lectures by gpu-mode

Lecture series for GPU-accelerated computing

Created 2 years ago

Updated 3 weeks ago

Starred by

Alex Chen

Alex Chen(Cofounder of Nexa AI).

cuda-course by Infatoshi

CUDA course materials

Created 1 year ago

Updated 2 weeks ago

Starred by

Pankaj Gupta

Pankaj Gupta(Cofounder of Baseten),

Tri Dao

Tri Dao(Chief Scientist at Together AI), and

24 more.

cutlass by NVIDIA

CUDA C++ and Python DSLs for high-performance linear algebra

Created 8 years ago

Updated 20 hours ago

Feedback? Help us improve.