Open Source AI_

A comprehensive list of open-source AI tools, frameworks, and models

260 repos7.1M total stars+5,656 this week13 categories
7-DAY TRENDING
#repostars7d
06langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

24,240+305
07whisper

Robust Speech Recognition via Large-Scale Weak Supervision

97,114+259
08docling

Get your documents ready for gen AI

56,909+258
09ultralytics

Ultralytics YOLO 🚀

55,338+219
10transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

158,690+199
ALL REPOS243 of 243 repos
13 of 13 categories
CATEGORIESall selected
sort:
repostars7d
open-webui
🖥️ 12. User Interfaces & Self-hosted Platforms

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

129.7k+597
anything-llm
🖥️ 12. User Interfaces & Self-hosted Platforms

The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.

57.4k+465
hindsight
🤖 4. Agentic AI & Multi-Agent Systems

Hindsight: Agent Memory That Learns

7.0k+446
ollama
⚡ 3. Inference Engines & Serving

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

166.8k+439
promptfoo
🧪 13. Developer Tools & Integrations🛡️ 10. AI Safety, Alignment & Interpretability+1

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.

19.1k+433
langfuse
📊 8. MLOps / LLMOps & Production

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

24.2k+305
whisper
🧠 2. Open Foundation Models

Robust Speech Recognition via Large-Scale Weak Supervision

97.1k+259
docling
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Get your documents ready for gen AI

56.9k+258
ultralytics
🧩 11. Specialized Domains

Ultralytics YOLO 🚀

55.3k+219
transformers
🧬 1. Core Frameworks & Libraries

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

158.7k+199
khoj
🖥️ 12. User Interfaces & Self-hosted Platforms

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

33.8k+159
pytorch
🧬 1. Core Frameworks & Libraries

Tensors and Dynamic neural networks in Python with strong GPU acceleration

98.8k+150
mlx
⚡ 3. Inference Engines & Serving

MLX: An array framework for Apple silicon

25.0k+139
tensorflow
🧬 1. Core Frameworks & Libraries

An Open Source Machine Learning Framework for Everyone

194.4k+76
opik
📊 8. MLOps / LLMOps & Production

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

18.6k+76
ray
🛠️ 7. Training & Fine-tuning Ecosystem📊 8. MLOps / LLMOps & Production

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

41.9k+62
opencv
🧩 11. Specialized Domains

Open Source Computer Vision Library

86.9k+56
haystack
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

24.7k+49
optuna
🧬 1. Core Frameworks & Libraries

A hyperparameter optimization framework

13.8k+47
Gymnasium
🧩 11. Specialized Domains

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

11.6k+33
candle
🧬 1. Core Frameworks & Libraries

Minimalist ML framework for Rust

19.9k+31
codecompanion.nvim
🧪 13. Developer Tools & Integrations

✨ AI Coding, Vim Style

6.4k+30
stable-baselines3
🧩 11. Specialized Domains

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

13.0k+29
dvc
📊 8. MLOps / LLMOps & Production

🦉 Data Versioning and ML Experiments

15.5k+26
burn
🧬 1. Core Frameworks & Libraries

Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.

14.8k+22
xgboost
🧬 1. Core Frameworks & Libraries

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

28.2k+19
keras
🧬 1. Core Frameworks & Libraries

Deep Learning for humans

63.9k+17
tokenizers
🧬 1. Core Frameworks & Libraries

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

10.6k+17
kubeflow
📊 8. MLOps / LLMOps & Production

Machine Learning Toolkit for Kubernetes

15.6k+12
darts
🧩 11. Specialized Domains

A python library for user-friendly forecasting and anomaly detection on time series.

9.3k+12
detectron2
🧩 11. Specialized Domains

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

34.3k+11
scipy
🧬 1. Core Frameworks & Libraries

SciPy library main repository

14.6k+8
kornia
🧩 11. Specialized Domains

🐍 Geometric Computer Vision Library for Spatial AI

11.1k+8
evidently
📊 8. MLOps / LLMOps & Production

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

7.4k+8
catboost
🧬 1. Core Frameworks & Libraries

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

8.9k+6
clearml
📊 8. MLOps / LLMOps & Production

ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution

6.6k+5
dask
🧬 1. Core Frameworks & Libraries

Parallel computing with task scheduling

13.8k+3
jupyter-ai
🧪 13. Developer Tools & Integrations

An open source extension that connects AI agents to computational notebooks in JupyterLab.

4.2k+1
openclaw
🖥️ 12. User Interfaces & Self-hosted Platforms

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

347.4k
langflow
🖥️ 12. User Interfaces & Self-hosted Platforms🤖 4. Agentic AI & Multi-Agent Systems

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

146.6k
opencode
🤖 4. Agentic AI & Multi-Agent Systems

The open source coding agent.

136.7k
dify
🤖 4. Agentic AI & Multi-Agent Systems🖥️ 12. User Interfaces & Self-hosted Platforms

Production-ready platform for agentic workflow development.

135.7k
langchain
🤖 4. Agentic AI & Multi-Agent Systems

The agent engineering platform

132.3k
ComfyUI
🎨 6. Generative Media Tools

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

107.7k
DeepSeek-V3
🧠 2. Open Foundation Models
102.5k
llama.cpp
⚡ 3. Inference Engines & Serving

LLM inference in C/C++

101.2k
ragflow
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

77.1k
vllm
⚡ 3. Inference Engines & Serving

A high-throughput and memory-efficient inference and serving engine for LLMs

75.2k
lobe-chat
🖥️ 12. User Interfaces & Self-hosted Platforms

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.

74.7k
OpenHands
🤖 4. Agentic AI & Multi-Agent Systems

🙌 OpenHands: AI-Driven Development

70.5k
LLaMA-Factory
🛠️ 7. Training & Fine-tuning Ecosystem

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

69.5k
MetaGPT
🤖 4. Agentic AI & Multi-Agent Systems

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

66.6k
scikit-learn
🧬 1. Core Frameworks & Libraries

scikit-learn: machine learning in Python

65.6k
crawl4ai
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

63.3k
open-interpreter
🧪 13. Developer Tools & Integrations

A natural language interface for computers

63.0k
cline
🧪 13. Developer Tools & Integrations

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

59.9k
unsloth
🛠️ 7. Training & Fine-tuning Ecosystem

Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally.

59.3k
deer-flow
🤖 4. Agentic AI & Multi-Agent Systems

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours.

57.4k
autogen
🤖 4. Agentic AI & Multi-Agent Systems

A programming framework for agentic AI

56.7k
mem0
🤖 4. Agentic AI & Multi-Agent Systems

Universal memory layer for AI Agents

51.9k
Flowise
🖥️ 12. User Interfaces & Self-hosted Platforms

Build AI Agents, Visually

51.5k
pandas
🧬 1. Core Frameworks & Libraries

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

48.3k
llama_index
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

LlamaIndex is the leading document agent and OCR platform

48.3k
Fooocus
🎨 6. Generative Media Tools

Focus on prompting and generating

48.0k
crewAI
🤖 4. Agentic AI & Multi-Agent Systems

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

48.0k
text-generation-webui
🖥️ 12. User Interfaces & Self-hosted Platforms

The original local LLM interface. Text, vision, tool-calling, training, and more. 100% offline.

46.4k
milvus
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

43.6k
aider
🧪 13. Developer Tools & Integrations🤖 4. Agentic AI & Multi-Agent Systems

aider is AI pair programming in your terminal

42.8k
DeepSpeed
🛠️ 7. Training & Fine-tuning Ecosystem

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

42.0k
DeepSpeed
🧬 1. Core Frameworks & Libraries

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

42.0k
jan
🖥️ 12. User Interfaces & Self-hosted Platforms

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

41.5k
faiss
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

A library for efficient similarity search and clustering of dense vectors.

39.6k
polars
🧬 1. Core Frameworks & Libraries

Extremely fast Query Engine for DataFrames, written in Rust

38.0k
VibeVoice
🧠 2. Open Foundation Models

Open-Source Frontier Voice AI

35.7k
jax
🧬 1. Core Frameworks & Libraries

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

35.3k
LibreChat
🖥️ 12. User Interfaces & Self-hosted Platforms

Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active.

35.2k
Retrieval-based-Voice-Conversion-WebUI
🎨 6. Generative Media Tools

Easily train a good VC model with voice data <= 10 mins!

35.1k
goose
🤖 4. Agentic AI & Multi-Agent Systems

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

35.0k
mediapipe
🧩 11. Specialized Domains

Cross-platform, customizable ML solutions for live and streaming media.

34.5k
dspy
🤖 4. Agentic AI & Multi-Agent Systems

DSPy: The framework for programming—not prompting—language models

33.4k
tabby
🧪 13. Developer Tools & Integrations

Self-hosted AI coding assistant

33.3k
diffusers
🎨 6. Generative Media Tools

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

33.3k
continue
🧪 13. Developer Tools & Integrations

⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI

32.3k
graphrag
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

A modular graph-based Retrieval-Augmented Generation (RAG) system

32.0k
numpy
🧬 1. Core Frameworks & Libraries

The fundamental package for scientific computing with Python.

31.7k
pi-mono
🤖 4. Agentic AI & Multi-Agent Systems

AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods

31.2k
lightning
🧬 1. Core Frameworks & Libraries

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

31.0k
qdrant
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

30.0k
fish-speech
🧠 2. Open Foundation Models🎨 6. Generative Media Tools

SOTA Open Source TTS

29.0k
Open-Sora
🧠 2. Open Foundation Models🎨 6. Generative Media Tools

Open-Sora: Democratizing Efficient Video Production for All

28.8k
langgraph
🤖 4. Agentic AI & Multi-Agent Systems

Build resilient language agents as graphs.

28.4k
semantic-kernel
🤖 4. Agentic AI & Multi-Agent Systems

Integrate cutting-edge LLM technology quickly and easily into your apps

27.6k
chroma
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Data infrastructure for AI

27.1k
generative-models
🎨 6. Generative Media Tools

Generative Models by Stability AI

27.1k
browser
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Lightpanda: the headless browser designed for AI and automation

27.0k
InvokeAI
🎨 6. Generative Media Tools

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.

26.9k
smolagents
🤖 4. Agentic AI & Multi-Agent Systems

🤗 smolagents: a barebones library for agents that think in code.

26.4k
sglang
⚡ 3. Inference Engines & Serving

SGLang is a high-performance serving framework for large language models and multimodal models.

25.4k
flux
🎨 6. Generative Media Tools

Official inference repo for FLUX.1 models

25.4k
SillyTavern
🖥️ 12. User Interfaces & Self-hosted Platforms

LLM Frontend for Power Users.

25.2k
mlflow
📊 8. MLOps / LLMOps & Production

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

25.1k
MiniCPM-V
🧠 2. Open Foundation Models

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

24.3k
hermes-agent
🤖 4. Agentic AI & Multi-Agent Systems

The agent that grows with you

24.0k
audiocraft
🎨 6. Generative Media Tools🧠 2. Open Foundation Models

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

23.1k
flash-attention
🧬 1. Core Frameworks & Libraries

Fast and memory-efficient exact attention

23.1k
DeepSeek-Coder
🧠 2. Open Foundation Models

DeepSeek Coder: Let the Code Write Itself

23.0k
Roo-Code
🧪 13. Developer Tools & Integrations

Roo Code gives you a whole dev team of AI agents in your code editor.

23.0k
mastra
🤖 4. Agentic AI & Multi-Agent Systems

From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.

22.7k
mlc-llm
⚡ 3. Inference Engines & Serving

Universal LLM Deployment Engine with ML Compilation

22.3k
letta
🤖 4. Agentic AI & Multi-Agent Systems

Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.

21.9k
datasets
📈 9. Evaluation, Benchmarks & Datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

21.4k
swarm
🤖 4. Agentic AI & Multi-Agent Systems

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

21.3k
Qwen
🧠 2. Open Foundation Models

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

20.9k
peft
🛠️ 7. Training & Fine-tuning Ecosystem

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

20.9k
pgvector
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Open-source vector similarity search for Postgres

20.6k
CosyVoice
🎨 6. Generative Media Tools

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

20.4k
onnxruntime
🧩 11. Specialized Domains🧬 1. Core Frameworks & Libraries

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

19.7k
owl
🤖 4. Agentic AI & Multi-Agent Systems

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

19.3k
Qwen3-VL
🧠 2. Open Foundation Models

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

18.9k
sam2
🧩 11. Specialized Domains

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

18.8k
sentence-transformers
🧬 1. Core Frameworks & Libraries

State-of-the-Art Text Embeddings

18.5k
LightGBM
🧬 1. Core Frameworks & Libraries

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

18.2k
trl
🛠️ 7. Training & Fine-tuning Ecosystem

Train transformer language models with reinforcement learning.

17.9k
SuperAGI
🤖 4. Agentic AI & Multi-Agent Systems

<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

17.4k
codellama
🧠 2. Open Foundation Models

Inference code for CodeLlama models

16.3k
Qwen3-Coder
🧠 2. Open Foundation Models

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.

16.2k
weaviate
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

15.9k
Megatron-LM
🛠️ 7. Training & Fine-tuning Ecosystem

Ongoing research training transformer models at scale

15.9k
Wan2.1
🎨 6. Generative Media Tools

Wan: Open and Advanced Large-Scale Video Generative Models

15.7k
deepeval
📈 9. Evaluation, Benchmarks & Datasets🧪 13. Developer Tools & Integrations

The LLM Evaluation Framework

14.4k
unstructured
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

14.4k
Hunyuan3D-2
🎨 6. Generative Media Tools

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

13.4k
litgpt
🛠️ 7. Training & Fine-tuning Ecosystem

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

13.3k
TensorRT-LLM
⚡ 3. Inference Engines & Serving

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

13.3k
tvm
🧩 11. Specialized Domains

Open Machine Learning Compiler Framework

13.2k
ragas
📈 9. Evaluation, Benchmarks & Datasets

Supercharge Your LLM Application Evaluations 🚀

13.2k
CogVideo
🧠 2. Open Foundation Models

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

12.6k
TRELLIS
🎨 6. Generative Media Tools

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

12.1k
AnimateDiff
🧠 2. Open Foundation Models🎨 6. Generative Media Tools

Official implementation of AnimateDiff.

12.1k
lm-evaluation-harness
📈 9. Evaluation, Benchmarks & Datasets

A framework for few-shot evaluation of language models.

12.0k
Time-Series-Library
🧩 11. Specialized Domains

A Library for Advanced Deep Time Series Models for General Time Series Analysis.

11.9k
HunyuanVideo
🎨 6. Generative Media Tools

HunyuanVideo: A Systematic Framework For Large Video Generation Model

11.9k
axolotl
🛠️ 7. Training & Fine-tuning Ecosystem

Go ahead and axolotl questions

11.6k
FlagEmbedding
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Retrieval and Retrieval-augmented LLMs

11.5k
nerfstudio
🎨 6. Generative Media Tools

A collaboration friendly studio for NeRFs

11.4k
text-generation-inference
⚡ 3. Inference Engines & Serving

Large Language Model Text Generation Inference

10.8k
chat-ui
🖥️ 12. User Interfaces & Self-hosted Platforms

The open source codebase powering HuggingChat

10.6k
xformers
🧬 1. Core Frameworks & Libraries

Hackable and optimized Transformers building blocks, supporting a composable construction.

10.4k
autogluon
🧬 1. Core Frameworks & Libraries

Fast and Accurate ML in 3 Lines of Code

10.2k
llama-cpp-python
⚡ 3. Inference Engines & Serving

Python bindings for llama.cpp

10.1k
tpot
🧬 1. Core Frameworks & Libraries

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

10.0k
openvino
🧩 11. Specialized Domains

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

10.0k
InternVL
🧠 2. Open Foundation Models

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

9.9k
koboldcpp
⚡ 3. Inference Engines & Serving

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

9.9k
LTX-Video
🎨 6. Generative Media Tools

Official repository for LTX-Video

9.8k
lancedb
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

9.8k
accelerate
🧬 1. Core Frameworks & Libraries

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

9.6k
cleanrl
🧩 11. Specialized Domains

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

9.5k
einops
🧬 1. Core Frameworks & Libraries

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

9.4k
autokeras
🧬 1. Core Frameworks & Libraries

AutoML library for deep learning

9.3k
OpenRLHF
🛡️ 10. AI Safety, Alignment & Interpretability

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

9.3k
assistant-ui
🧪 13. Developer Tools & Integrations

Typescript/React Library for AI Chat💬🚀

9.2k
phoenix
🧪 13. Developer Tools & Integrations📊 8. MLOps / LLMOps & Production

AI Observability & Evaluation

9.2k
BentoML
📊 8. MLOps / LLMOps & Production

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

8.6k
ACE-Step-1.5
🎨 6. Generative Media Tools

The most powerful local music generation model that outperforms almost all commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.

8.5k
bitsandbytes
⚡ 3. Inference Engines & Serving

Accessible large language models via k-bit quantization for PyTorch.

8.1k
Verba
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

7.6k
garak
🛡️ 10. AI Safety, Alignment & Interpretability🧪 13. Developer Tools & Integrations

the LLM vulnerability scanner

7.5k
garak
📊 8. MLOps / LLMOps & Production

the LLM vulnerability scanner

7.5k
mergekit
🛠️ 7. Training & Fine-tuning Ecosystem

Tools for merging pretrained large language models.

6.9k
opencompass
📈 9. Evaluation, Benchmarks & Datasets

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

6.8k
IsaacLab
🧩 11. Specialized Domains

Unified framework for robot learning built on NVIDIA Isaac Sim

6.8k
SkyReels-V2
🎨 6. Generative Media Tools

SkyReels-V2: Infinite-length Film Generative model

6.7k
guardrails
📊 8. MLOps / LLMOps & Production

Adding guardrails to large language models.

6.6k
OLMo
🧠 2. Open Foundation Models

Modeling, training, eval, and inference code for OLMo

6.5k
TripoSR
🎨 6. Generative Media Tools

TripoSR: Fast 3D Object Reconstruction from a Single Image

6.3k
Liger-Kernel
🛠️ 7. Training & Fine-tuning Ecosystem

Efficient Triton Kernels for LLM Training

6.3k
StyleTTS2
🎨 6. Generative Media Tools

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

6.2k
data-juicer
🛠️ 7. Training & Fine-tuning Ecosystem

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

6.2k
swarms
🤖 4. Agentic AI & Multi-Agent Systems

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

6.2k
NeMo-Guardrails
📊 8. MLOps / LLMOps & Production

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

5.9k
torchtune
🛠️ 7. Training & Fine-tuning Ecosystem

PyTorch native post-training library

5.7k
captum
🛡️ 10. AI Safety, Alignment & Interpretability

Model interpretability and understanding for PyTorch

5.6k
alignment-handbook
🛡️ 10. AI Safety, Alignment & Interpretability

Robust recipes to align language models with human and AI preferences

5.5k
Wonder3D
🎨 6. Generative Media Tools

Single Image to 3D using Cross-Domain Diffusion for 3D Generation

5.3k
zenml
📊 8. MLOps / LLMOps & Production

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

5.3k
kserve
📊 8. MLOps / LLMOps & Production

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

5.3k
chronos-forecasting
🧩 11. Specialized Domains

Chronos: Pretrained Models for Time Series Forecasting

5.1k
AutoGPTQ
⚡ 3. Inference Engines & Serving

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

5.0k
RedPajama-Data
📈 9. Evaluation, Benchmarks & Datasets

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

4.9k
argilla
🛠️ 7. Training & Fine-tuning Ecosystem

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

4.9k
gsplat
🎨 6. Generative Media Tools

CUDA accelerated rasterization of gaussian splatting

4.8k
AI-Scientist-v2
🤖 4. Agentic AI & Multi-Agent Systems

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

4.7k
LLaVA-NeXT
🧠 2. Open Foundation Models
4.6k
exllamav2
⚡ 3. Inference Engines & Serving

A fast inference library for running LLMs locally on modern consumer-class GPUs

4.5k
executorch
🧩 11. Specialized Domains

On-device AI across mobile, embedded and edge for PyTorch

4.5k
FLAML
🧬 1. Core Frameworks & Libraries

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

4.3k
gemma
🧠 2. Open Foundation Models

Gemma open-weight LLM library, from Google DeepMind

4.3k
PurpleLlama
📊 8. MLOps / LLMOps & Production

Set of tools to assess and improve LLM security.

4.1k
RAGatouille
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

3.9k
mistral-vibe
🤖 4. Agentic AI & Multi-Agent Systems

Minimal CLI coding agent by Mistral

3.8k
PhiCookBook
🧠 2. Open Foundation Models

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks

3.7k
safetensors
🧬 1. Core Frameworks & Libraries

Simple, safe way to store and distribute tensors

3.7k
mochi
🎨 6. Generative Media Tools🧠 2. Open Foundation Models

The best OSS video generation models, created by Genmo

3.6k
SDV
🛠️ 7. Training & Fine-tuning Ecosystem

Synthetic data generation for tabular data

3.5k
optimum
⚡ 3. Inference Engines & Serving

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

3.3k
TransformerLens
🛡️ 10. AI Safety, Alignment & Interpretability

A library for mechanistic interpretability of GPT-style language models

3.3k
distilabel
🛠️ 7. Training & Fine-tuning Ecosystem

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

3.2k
MiniMax-M1
🧠 2. Open Foundation Models

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

3.1k
llm-guard
🛡️ 10. AI Safety, Alignment & Interpretability📊 8. MLOps / LLMOps & Production

The Security Toolkit for LLM Interactions

2.8k
ao
🧬 1. Core Frameworks & Libraries

PyTorch native quantization and sparsity for training and inference

2.8k
helm
📈 9. Evaluation, Benchmarks & Datasets

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.

2.7k
OSWorld
📈 9. Evaluation, Benchmarks & Datasets

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

2.7k
colpali
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

2.6k
evaluate
📈 9. Evaluation, Benchmarks & Datasets

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

2.4k
torchmetrics
🧬 1. Core Frameworks & Libraries

Machine learning metrics for distributed, scalable PyTorch applications.

2.4k
lighteval
📈 9. Evaluation, Benchmarks & Datasets

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

2.4k
AutoAWQ
⚡ 3. Inference Engines & Serving

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

2.3k
GLM-V
🧠 2. Open Foundation Models

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

2.3k
llama-agents
🤖 4. Agentic AI & Multi-Agent Systems

Deploy your agentic worfklows to production

2.1k
starcoder2
🧠 2. Open Foundation Models

Home of StarCoder2!

2.1k
GLM-5
🧠 2. Open Foundation Models

GLM-5: From Vibe Coding to Agentic Engineering

2.0k
llama.vim
🧪 13. Developer Tools & Integrations

Vim plugin for LLM-assisted code/text completion

1.9k
aphrodite-engine
⚡ 3. Inference Engines & Serving

Large-scale LLM inference engine

1.7k
Kimi-K2.5
🧠 2. Open Foundation Models

Moonshot's most powerful model

1.7k
nanocoder
🤖 4. Agentic AI & Multi-Agent Systems

A beautiful local-first coding agent running in your terminal - built by the community for the community ⚒

1.6k
safe-rlhf
🛡️ 10. AI Safety, Alignment & Interpretability

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

1.6k
sd3.5
🎨 6. Generative Media Tools
1.5k
OuteTTS
🧠 2. Open Foundation Models🎨 6. Generative Media Tools

Interface for OuteTTS models.

1.4k
AutoTS
🧩 11. Specialized Domains

Automated Time Series Forecasting

1.4k
Newelle
🖥️ 12. User Interfaces & Self-hosted Platforms

Newelle - Your Ultimate Virtual Assistant

1.3k
SAELens
🛡️ 10. AI Safety, Alignment & Interpretability

Training Sparse Autoencoders on Language Models

1.3k
weave
📊 8. MLOps / LLMOps & Production

Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.

1.1k
hqq
⚡ 3. Inference Engines & Serving

Official implementation of Half-Quadratic Quantization (HQQ)

924
nnsight
🛡️ 10. AI Safety, Alignment & Interpretability

The nnsight package enables interpreting and manipulating the internals of deep learned models.

883
livecodebench
📈 9. Evaluation, Benchmarks & Datasets

Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"

834
JaxMARL
🧩 11. Specialized Domains

Multi-Agent Reinforcement Learning with JAX

780
ome
⚡ 3. Inference Engines & Serving

Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

412
NornicDB
🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge

Nornicdb is a low-latency, Graph + Vector, Temporal MVCC with all sub-ms HNSW search, graph traversal, and writes. Uses Neo4j Bolt/Cypher and qdrant's gRPC drivers so you can switch with no changes. Then, adding intelligent features like schemas, managed embeddings, LLM reranking+inferrence, GPU acceleration, Auto-TLP, Memory Decay, and MCP server.

375
MMLU-Pro
📈 9. Evaluation, Benchmarks & Datasets

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

359
forgetful
🤖 4. Agentic AI & Multi-Agent Systems

Opensource Memory for Agents

237
gpt4all
🖥️ 12. User Interfaces & Self-hosted Platforms

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

77.2k-1
ColossalAI
🛠️ 7. Training & Fine-tuning Ecosystem

Making large AI models cheaper, faster and more accessible

41.4k-3