██████╗ ██████╗ ███████╗███╗ ██╗ ███████╗ ██████╗ ██╗ ██╗██████╗ ██████╗███████╗ █████╗ ██╗
██╔═══██╗██╔══██╗██╔════╝████╗ ██║ ██╔════╝██╔═══██╗██║ ██║██╔══██╗██╔════╝██╔════╝ ██╔══██╗██║
██║ ██║██████╔╝█████╗ ██╔██╗ ██║ ███████╗██║ ██║██║ ██║██████╔╝██║ █████╗ ███████║██║
██║ ██║██╔═══╝ ██╔══╝ ██║╚██╗██║ ╚════██║██║ ██║██║ ██║██╔══██╗██║ ██╔══╝ ██╔══██║██║
╚██████╔╝██║ ███████╗██║ ╚████║ ███████║╚██████╔╝╚██████╔╝██║ ██║╚██████╗███████╗ ██║ ██║██║
╚═════╝ ╚═╝ ╚══════╝╚═╝ ╚═══╝ ╚══════╝ ╚═════╝ ╚═════╝ ╚═╝ ╚═╝ ╚═════╝╚══════╝ ╚═╝ ╚═╝╚═╝
A comprehensive list of open-source AI tools, frameworks, and models
open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
anything-llm
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
hindsight
Hindsight: Agent Memory That Learns
ollama
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
promptfoo
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.
| # | repo | stars | 7d |
|---|---|---|---|
| 06 | langfuse 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23 | 24,240 | +305 |
| 07 | whisper Robust Speech Recognition via Large-Scale Weak Supervision | 97,114 | +259 |
| 08 | docling Get your documents ready for gen AI | 56,909 | +258 |
| 09 | ultralytics Ultralytics YOLO 🚀 | 55,338 | +219 |
| 10 | transformers 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training. | 158,690 | +199 |
| repo | stars | 7d |
|---|---|---|
open-webui 🖥️ 12. User Interfaces & Self-hosted Platforms User-friendly AI Interface (Supports Ollama, OpenAI API, ...) | 129.7k | +597 |
anything-llm 🖥️ 12. User Interfaces & Self-hosted Platforms The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration. | 57.4k | +465 |
hindsight 🤖 4. Agentic AI & Multi-Agent Systems Hindsight: Agent Memory That Learns | 7.0k | +446 |
ollama ⚡ 3. Inference Engines & Serving Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models. | 166.8k | +439 |
promptfoo 🧪 13. Developer Tools & Integrations🛡️ 10. AI Safety, Alignment & Interpretability+1 Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic. | 19.1k | +433 |
langfuse 📊 8. MLOps / LLMOps & Production 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23 | 24.2k | +305 |
whisper 🧠 2. Open Foundation Models Robust Speech Recognition via Large-Scale Weak Supervision | 97.1k | +259 |
docling 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Get your documents ready for gen AI | 56.9k | +258 |
ultralytics 🧩 11. Specialized Domains Ultralytics YOLO 🚀 | 55.3k | +219 |
transformers 🧬 1. Core Frameworks & Libraries 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training. | 158.7k | +199 |
khoj 🖥️ 12. User Interfaces & Self-hosted Platforms Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free. | 33.8k | +159 |
pytorch 🧬 1. Core Frameworks & Libraries Tensors and Dynamic neural networks in Python with strong GPU acceleration | 98.8k | +150 |
mlx ⚡ 3. Inference Engines & Serving MLX: An array framework for Apple silicon | 25.0k | +139 |
tensorflow 🧬 1. Core Frameworks & Libraries An Open Source Machine Learning Framework for Everyone | 194.4k | +76 |
opik 📊 8. MLOps / LLMOps & Production Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards. | 18.6k | +76 |
ray 🛠️ 7. Training & Fine-tuning Ecosystem📊 8. MLOps / LLMOps & Production Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. | 41.9k | +62 |
opencv 🧩 11. Specialized Domains Open Source Computer Vision Library | 86.9k | +56 |
haystack 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems. | 24.7k | +49 |
optuna 🧬 1. Core Frameworks & Libraries A hyperparameter optimization framework | 13.8k | +47 |
Gymnasium 🧩 11. Specialized Domains An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym) | 11.6k | +33 |
candle 🧬 1. Core Frameworks & Libraries Minimalist ML framework for Rust | 19.9k | +31 |
codecompanion.nvim 🧪 13. Developer Tools & Integrations ✨ AI Coding, Vim Style | 6.4k | +30 |
stable-baselines3 🧩 11. Specialized Domains PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms. | 13.0k | +29 |
dvc 📊 8. MLOps / LLMOps & Production 🦉 Data Versioning and ML Experiments | 15.5k | +26 |
burn 🧬 1. Core Frameworks & Libraries Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability. | 14.8k | +22 |
xgboost 🧬 1. Core Frameworks & Libraries Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow | 28.2k | +19 |
keras 🧬 1. Core Frameworks & Libraries Deep Learning for humans | 63.9k | +17 |
tokenizers 🧬 1. Core Frameworks & Libraries 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production | 10.6k | +17 |
kubeflow 📊 8. MLOps / LLMOps & Production Machine Learning Toolkit for Kubernetes | 15.6k | +12 |
darts 🧩 11. Specialized Domains A python library for user-friendly forecasting and anomaly detection on time series. | 9.3k | +12 |
detectron2 🧩 11. Specialized Domains Detectron2 is a platform for object detection, segmentation and other visual recognition tasks. | 34.3k | +11 |
scipy 🧬 1. Core Frameworks & Libraries SciPy library main repository | 14.6k | +8 |
kornia 🧩 11. Specialized Domains 🐍 Geometric Computer Vision Library for Spatial AI | 11.1k | +8 |
evidently 📊 8. MLOps / LLMOps & Production Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics. | 7.4k | +8 |
catboost 🧬 1. Core Frameworks & Libraries A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU. | 8.9k | +6 |
clearml 📊 8. MLOps / LLMOps & Production ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution | 6.6k | +5 |
dask 🧬 1. Core Frameworks & Libraries Parallel computing with task scheduling | 13.8k | +3 |
jupyter-ai 🧪 13. Developer Tools & Integrations An open source extension that connects AI agents to computational notebooks in JupyterLab. | 4.2k | +1 |
openclaw 🖥️ 12. User Interfaces & Self-hosted Platforms Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞 | 347.4k | |
langflow 🖥️ 12. User Interfaces & Self-hosted Platforms🤖 4. Agentic AI & Multi-Agent Systems Langflow is a powerful tool for building and deploying AI-powered agents and workflows. | 146.6k | |
opencode 🤖 4. Agentic AI & Multi-Agent Systems The open source coding agent. | 136.7k | |
dify 🤖 4. Agentic AI & Multi-Agent Systems🖥️ 12. User Interfaces & Self-hosted Platforms Production-ready platform for agentic workflow development. | 135.7k | |
langchain 🤖 4. Agentic AI & Multi-Agent Systems The agent engineering platform | 132.3k | |
ComfyUI 🎨 6. Generative Media Tools The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. | 107.7k | |
DeepSeek-V3 🧠 2. Open Foundation Models | 102.5k | |
llama.cpp ⚡ 3. Inference Engines & Serving LLM inference in C/C++ | 101.2k | |
ragflow 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs | 77.1k | |
vllm ⚡ 3. Inference Engines & Serving A high-throughput and memory-efficient inference and serving engine for LLMs | 75.2k | |
lobe-chat 🖥️ 12. User Interfaces & Self-hosted Platforms The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction. | 74.7k | |
OpenHands 🤖 4. Agentic AI & Multi-Agent Systems 🙌 OpenHands: AI-Driven Development | 70.5k | |
LLaMA-Factory 🛠️ 7. Training & Fine-tuning Ecosystem Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) | 69.5k | |
MetaGPT 🤖 4. Agentic AI & Multi-Agent Systems 🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming | 66.6k | |
scikit-learn 🧬 1. Core Frameworks & Libraries scikit-learn: machine learning in Python | 65.6k | |
crawl4ai 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN | 63.3k | |
open-interpreter 🧪 13. Developer Tools & Integrations A natural language interface for computers | 63.0k | |
cline 🧪 13. Developer Tools & Integrations Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way. | 59.9k | |
unsloth 🛠️ 7. Training & Fine-tuning Ecosystem Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally. | 59.3k | |
deer-flow 🤖 4. Agentic AI & Multi-Agent Systems An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours. | 57.4k | |
autogen 🤖 4. Agentic AI & Multi-Agent Systems A programming framework for agentic AI | 56.7k | |
mem0 🤖 4. Agentic AI & Multi-Agent Systems Universal memory layer for AI Agents | 51.9k | |
Flowise 🖥️ 12. User Interfaces & Self-hosted Platforms Build AI Agents, Visually | 51.5k | |
pandas 🧬 1. Core Frameworks & Libraries Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more | 48.3k | |
llama_index 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge LlamaIndex is the leading document agent and OCR platform | 48.3k | |
Fooocus 🎨 6. Generative Media Tools Focus on prompting and generating | 48.0k | |
crewAI 🤖 4. Agentic AI & Multi-Agent Systems Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks. | 48.0k | |
text-generation-webui 🖥️ 12. User Interfaces & Self-hosted Platforms The original local LLM interface. Text, vision, tool-calling, training, and more. 100% offline. | 46.4k | |
milvus 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search | 43.6k | |
aider 🧪 13. Developer Tools & Integrations🤖 4. Agentic AI & Multi-Agent Systems aider is AI pair programming in your terminal | 42.8k | |
DeepSpeed 🛠️ 7. Training & Fine-tuning Ecosystem DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. | 42.0k | |
DeepSpeed 🧬 1. Core Frameworks & Libraries DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. | 42.0k | |
jan 🖥️ 12. User Interfaces & Self-hosted Platforms Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. | 41.5k | |
faiss 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge A library for efficient similarity search and clustering of dense vectors. | 39.6k | |
polars 🧬 1. Core Frameworks & Libraries Extremely fast Query Engine for DataFrames, written in Rust | 38.0k | |
VibeVoice 🧠 2. Open Foundation Models Open-Source Frontier Voice AI | 35.7k | |
jax 🧬 1. Core Frameworks & Libraries Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more | 35.3k | |
LibreChat 🖥️ 12. User Interfaces & Self-hosted Platforms Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active. | 35.2k | |
Retrieval-based-Voice-Conversion-WebUI 🎨 6. Generative Media Tools Easily train a good VC model with voice data <= 10 mins! | 35.1k | |
goose 🤖 4. Agentic AI & Multi-Agent Systems an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM | 35.0k | |
mediapipe 🧩 11. Specialized Domains Cross-platform, customizable ML solutions for live and streaming media. | 34.5k | |
dspy 🤖 4. Agentic AI & Multi-Agent Systems DSPy: The framework for programming—not prompting—language models | 33.4k | |
tabby 🧪 13. Developer Tools & Integrations Self-hosted AI coding assistant | 33.3k | |
diffusers 🎨 6. Generative Media Tools 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch. | 33.3k | |
continue 🧪 13. Developer Tools & Integrations ⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI | 32.3k | |
graphrag 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge A modular graph-based Retrieval-Augmented Generation (RAG) system | 32.0k | |
numpy 🧬 1. Core Frameworks & Libraries The fundamental package for scientific computing with Python. | 31.7k | |
pi-mono 🤖 4. Agentic AI & Multi-Agent Systems AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods | 31.2k | |
lightning 🧬 1. Core Frameworks & Libraries Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes. | 31.0k | |
qdrant 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/ | 30.0k | |
fish-speech 🧠 2. Open Foundation Models🎨 6. Generative Media Tools SOTA Open Source TTS | 29.0k | |
Open-Sora 🧠 2. Open Foundation Models🎨 6. Generative Media Tools Open-Sora: Democratizing Efficient Video Production for All | 28.8k | |
langgraph 🤖 4. Agentic AI & Multi-Agent Systems Build resilient language agents as graphs. | 28.4k | |
semantic-kernel 🤖 4. Agentic AI & Multi-Agent Systems Integrate cutting-edge LLM technology quickly and easily into your apps | 27.6k | |
chroma 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Data infrastructure for AI | 27.1k | |
generative-models 🎨 6. Generative Media Tools Generative Models by Stability AI | 27.1k | |
browser 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Lightpanda: the headless browser designed for AI and automation | 27.0k | |
InvokeAI 🎨 6. Generative Media Tools Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products. | 26.9k | |
smolagents 🤖 4. Agentic AI & Multi-Agent Systems 🤗 smolagents: a barebones library for agents that think in code. | 26.4k | |
sglang ⚡ 3. Inference Engines & Serving SGLang is a high-performance serving framework for large language models and multimodal models. | 25.4k | |
flux 🎨 6. Generative Media Tools Official inference repo for FLUX.1 models | 25.4k | |
SillyTavern 🖥️ 12. User Interfaces & Self-hosted Platforms LLM Frontend for Power Users. | 25.2k | |
mlflow 📊 8. MLOps / LLMOps & Production The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data. | 25.1k | |
MiniCPM-V 🧠 2. Open Foundation Models A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone | 24.3k | |
hermes-agent 🤖 4. Agentic AI & Multi-Agent Systems The agent that grows with you | 24.0k | |
audiocraft 🎨 6. Generative Media Tools🧠 2. Open Foundation Models Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning. | 23.1k | |
flash-attention 🧬 1. Core Frameworks & Libraries Fast and memory-efficient exact attention | 23.1k | |
DeepSeek-Coder 🧠 2. Open Foundation Models DeepSeek Coder: Let the Code Write Itself | 23.0k | |
Roo-Code 🧪 13. Developer Tools & Integrations Roo Code gives you a whole dev team of AI agents in your code editor. | 23.0k | |
mastra 🤖 4. Agentic AI & Multi-Agent Systems From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack. | 22.7k | |
mlc-llm ⚡ 3. Inference Engines & Serving Universal LLM Deployment Engine with ML Compilation | 22.3k | |
letta 🤖 4. Agentic AI & Multi-Agent Systems Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time. | 21.9k | |
datasets 📈 9. Evaluation, Benchmarks & Datasets 🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools | 21.4k | |
swarm 🤖 4. Agentic AI & Multi-Agent Systems Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team. | 21.3k | |
Qwen 🧠 2. Open Foundation Models The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud. | 20.9k | |
peft 🛠️ 7. Training & Fine-tuning Ecosystem 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning. | 20.9k | |
pgvector 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Open-source vector similarity search for Postgres | 20.6k | |
CosyVoice 🎨 6. Generative Media Tools Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability. | 20.4k | |
onnxruntime 🧩 11. Specialized Domains🧬 1. Core Frameworks & Libraries ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator | 19.7k | |
owl 🤖 4. Agentic AI & Multi-Agent Systems 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation | 19.3k | |
Qwen3-VL 🧠 2. Open Foundation Models Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud. | 18.9k | |
sam2 🧩 11. Specialized Domains The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model. | 18.8k | |
sentence-transformers 🧬 1. Core Frameworks & Libraries State-of-the-Art Text Embeddings | 18.5k | |
LightGBM 🧬 1. Core Frameworks & Libraries A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. | 18.2k | |
trl 🛠️ 7. Training & Fine-tuning Ecosystem Train transformer language models with reinforcement learning. | 17.9k | |
SuperAGI 🤖 4. Agentic AI & Multi-Agent Systems <⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably. | 17.4k | |
codellama 🧠 2. Open Foundation Models Inference code for CodeLlama models | 16.3k | |
Qwen3-Coder 🧠 2. Open Foundation Models Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team. | 16.2k | |
weaviate 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database. | 15.9k | |
Megatron-LM 🛠️ 7. Training & Fine-tuning Ecosystem Ongoing research training transformer models at scale | 15.9k | |
Wan2.1 🎨 6. Generative Media Tools Wan: Open and Advanced Large-Scale Video Generative Models | 15.7k | |
deepeval 📈 9. Evaluation, Benchmarks & Datasets🧪 13. Developer Tools & Integrations The LLM Evaluation Framework | 14.4k | |
unstructured 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding. | 14.4k | |
Hunyuan3D-2 🎨 6. Generative Media Tools High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. | 13.4k | |
litgpt 🛠️ 7. Training & Fine-tuning Ecosystem 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale. | 13.3k | |
TensorRT-LLM ⚡ 3. Inference Engines & Serving TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way. | 13.3k | |
tvm 🧩 11. Specialized Domains Open Machine Learning Compiler Framework | 13.2k | |
ragas 📈 9. Evaluation, Benchmarks & Datasets Supercharge Your LLM Application Evaluations 🚀 | 13.2k | |
CogVideo 🧠 2. Open Foundation Models text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023) | 12.6k | |
TRELLIS 🎨 6. Generative Media Tools Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight). | 12.1k | |
AnimateDiff 🧠 2. Open Foundation Models🎨 6. Generative Media Tools Official implementation of AnimateDiff. | 12.1k | |
lm-evaluation-harness 📈 9. Evaluation, Benchmarks & Datasets A framework for few-shot evaluation of language models. | 12.0k | |
Time-Series-Library 🧩 11. Specialized Domains A Library for Advanced Deep Time Series Models for General Time Series Analysis. | 11.9k | |
HunyuanVideo 🎨 6. Generative Media Tools HunyuanVideo: A Systematic Framework For Large Video Generation Model | 11.9k | |
axolotl 🛠️ 7. Training & Fine-tuning Ecosystem Go ahead and axolotl questions | 11.6k | |
FlagEmbedding 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Retrieval and Retrieval-augmented LLMs | 11.5k | |
nerfstudio 🎨 6. Generative Media Tools A collaboration friendly studio for NeRFs | 11.4k | |
text-generation-inference ⚡ 3. Inference Engines & Serving Large Language Model Text Generation Inference | 10.8k | |
chat-ui 🖥️ 12. User Interfaces & Self-hosted Platforms The open source codebase powering HuggingChat | 10.6k | |
xformers 🧬 1. Core Frameworks & Libraries Hackable and optimized Transformers building blocks, supporting a composable construction. | 10.4k | |
autogluon 🧬 1. Core Frameworks & Libraries Fast and Accurate ML in 3 Lines of Code | 10.2k | |
llama-cpp-python ⚡ 3. Inference Engines & Serving Python bindings for llama.cpp | 10.1k | |
tpot 🧬 1. Core Frameworks & Libraries A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming. | 10.0k | |
openvino 🧩 11. Specialized Domains OpenVINO™ is an open source toolkit for optimizing and deploying AI inference | 10.0k | |
InternVL 🧠 2. Open Foundation Models [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型 | 9.9k | |
koboldcpp ⚡ 3. Inference Engines & Serving Run GGUF models easily with a KoboldAI UI. One File. Zero Install. | 9.9k | |
LTX-Video 🎨 6. Generative Media Tools Official repository for LTX-Video | 9.8k | |
lancedb 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less. | 9.8k | |
accelerate 🧬 1. Core Frameworks & Libraries 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support | 9.6k | |
cleanrl 🧩 11. Specialized Domains High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG) | 9.5k | |
einops 🧬 1. Core Frameworks & Libraries Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others) | 9.4k | |
autokeras 🧬 1. Core Frameworks & Libraries AutoML library for deep learning | 9.3k | |
OpenRLHF 🛡️ 10. AI Safety, Alignment & Interpretability An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL) | 9.3k | |
assistant-ui 🧪 13. Developer Tools & Integrations Typescript/React Library for AI Chat💬🚀 | 9.2k | |
phoenix 🧪 13. Developer Tools & Integrations📊 8. MLOps / LLMOps & Production AI Observability & Evaluation | 9.2k | |
BentoML 📊 8. MLOps / LLMOps & Production The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more! | 8.6k | |
ACE-Step-1.5 🎨 6. Generative Media Tools The most powerful local music generation model that outperforms almost all commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices. | 8.5k | |
bitsandbytes ⚡ 3. Inference Engines & Serving Accessible large language models via k-bit quantization for PyTorch. | 8.1k | |
Verba 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Retrieval Augmented Generation (RAG) chatbot powered by Weaviate | 7.6k | |
garak 🛡️ 10. AI Safety, Alignment & Interpretability🧪 13. Developer Tools & Integrations the LLM vulnerability scanner | 7.5k | |
garak 📊 8. MLOps / LLMOps & Production the LLM vulnerability scanner | 7.5k | |
mergekit 🛠️ 7. Training & Fine-tuning Ecosystem Tools for merging pretrained large language models. | 6.9k | |
opencompass 📈 9. Evaluation, Benchmarks & Datasets OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets. | 6.8k | |
IsaacLab 🧩 11. Specialized Domains Unified framework for robot learning built on NVIDIA Isaac Sim | 6.8k | |
SkyReels-V2 🎨 6. Generative Media Tools SkyReels-V2: Infinite-length Film Generative model | 6.7k | |
guardrails 📊 8. MLOps / LLMOps & Production Adding guardrails to large language models. | 6.6k | |
OLMo 🧠 2. Open Foundation Models Modeling, training, eval, and inference code for OLMo | 6.5k | |
TripoSR 🎨 6. Generative Media Tools TripoSR: Fast 3D Object Reconstruction from a Single Image | 6.3k | |
Liger-Kernel 🛠️ 7. Training & Fine-tuning Ecosystem Efficient Triton Kernels for LLM Training | 6.3k | |
StyleTTS2 🎨 6. Generative Media Tools StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models | 6.2k | |
data-juicer 🛠️ 7. Training & Fine-tuning Ecosystem Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷 | 6.2k | |
swarms 🤖 4. Agentic AI & Multi-Agent Systems The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai | 6.2k | |
NeMo-Guardrails 📊 8. MLOps / LLMOps & Production NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems. | 5.9k | |
torchtune 🛠️ 7. Training & Fine-tuning Ecosystem PyTorch native post-training library | 5.7k | |
captum 🛡️ 10. AI Safety, Alignment & Interpretability Model interpretability and understanding for PyTorch | 5.6k | |
alignment-handbook 🛡️ 10. AI Safety, Alignment & Interpretability Robust recipes to align language models with human and AI preferences | 5.5k | |
Wonder3D 🎨 6. Generative Media Tools Single Image to 3D using Cross-Domain Diffusion for 3D Generation | 5.3k | |
zenml 📊 8. MLOps / LLMOps & Production ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io. | 5.3k | |
kserve 📊 8. MLOps / LLMOps & Production Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes | 5.3k | |
chronos-forecasting 🧩 11. Specialized Domains Chronos: Pretrained Models for Time Series Forecasting | 5.1k | |
AutoGPTQ ⚡ 3. Inference Engines & Serving An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm. | 5.0k | |
RedPajama-Data 📈 9. Evaluation, Benchmarks & Datasets The RedPajama-Data repository contains code for preparing large datasets for training large language models. | 4.9k | |
argilla 🛠️ 7. Training & Fine-tuning Ecosystem Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets | 4.9k | |
gsplat 🎨 6. Generative Media Tools CUDA accelerated rasterization of gaussian splatting | 4.8k | |
AI-Scientist-v2 🤖 4. Agentic AI & Multi-Agent Systems The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search | 4.7k | |
LLaVA-NeXT 🧠 2. Open Foundation Models | 4.6k | |
exllamav2 ⚡ 3. Inference Engines & Serving A fast inference library for running LLMs locally on modern consumer-class GPUs | 4.5k | |
executorch 🧩 11. Specialized Domains On-device AI across mobile, embedded and edge for PyTorch | 4.5k | |
FLAML 🧬 1. Core Frameworks & Libraries A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP. | 4.3k | |
gemma 🧠 2. Open Foundation Models Gemma open-weight LLM library, from Google DeepMind | 4.3k | |
PurpleLlama 📊 8. MLOps / LLMOps & Production Set of tools to assess and improve LLM security. | 4.1k | |
RAGatouille 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research. | 3.9k | |
mistral-vibe 🤖 4. Agentic AI & Multi-Agent Systems Minimal CLI coding agent by Mistral | 3.8k | |
PhiCookBook 🧠 2. Open Foundation Models This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks | 3.7k | |
safetensors 🧬 1. Core Frameworks & Libraries Simple, safe way to store and distribute tensors | 3.7k | |
mochi 🎨 6. Generative Media Tools🧠 2. Open Foundation Models The best OSS video generation models, created by Genmo | 3.6k | |
SDV 🛠️ 7. Training & Fine-tuning Ecosystem Synthetic data generation for tabular data | 3.5k | |
optimum ⚡ 3. Inference Engines & Serving 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools | 3.3k | |
TransformerLens 🛡️ 10. AI Safety, Alignment & Interpretability A library for mechanistic interpretability of GPT-style language models | 3.3k | |
distilabel 🛠️ 7. Training & Fine-tuning Ecosystem Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers. | 3.2k | |
MiniMax-M1 🧠 2. Open Foundation Models MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. | 3.1k | |
llm-guard 🛡️ 10. AI Safety, Alignment & Interpretability📊 8. MLOps / LLMOps & Production The Security Toolkit for LLM Interactions | 2.8k | |
ao 🧬 1. Core Frameworks & Libraries PyTorch native quantization and sparsity for training and inference | 2.8k | |
helm 📈 9. Evaluation, Benchmarks & Datasets Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models. | 2.7k | |
OSWorld 📈 9. Evaluation, Benchmarks & Datasets [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments | 2.7k | |
colpali 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol. | 2.6k | |
evaluate 📈 9. Evaluation, Benchmarks & Datasets 🤗 Evaluate: A library for easily evaluating machine learning models and datasets. | 2.4k | |
torchmetrics 🧬 1. Core Frameworks & Libraries Machine learning metrics for distributed, scalable PyTorch applications. | 2.4k | |
lighteval 📈 9. Evaluation, Benchmarks & Datasets Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends | 2.4k | |
AutoAWQ ⚡ 3. Inference Engines & Serving AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation: | 2.3k | |
GLM-V 🧠 2. Open Foundation Models GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning | 2.3k | |
llama-agents 🤖 4. Agentic AI & Multi-Agent Systems Deploy your agentic worfklows to production | 2.1k | |
starcoder2 🧠 2. Open Foundation Models Home of StarCoder2! | 2.1k | |
GLM-5 🧠 2. Open Foundation Models GLM-5: From Vibe Coding to Agentic Engineering | 2.0k | |
llama.vim 🧪 13. Developer Tools & Integrations Vim plugin for LLM-assisted code/text completion | 1.9k | |
aphrodite-engine ⚡ 3. Inference Engines & Serving Large-scale LLM inference engine | 1.7k | |
Kimi-K2.5 🧠 2. Open Foundation Models Moonshot's most powerful model | 1.7k | |
nanocoder 🤖 4. Agentic AI & Multi-Agent Systems A beautiful local-first coding agent running in your terminal - built by the community for the community ⚒ | 1.6k | |
safe-rlhf 🛡️ 10. AI Safety, Alignment & Interpretability Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback | 1.6k | |
sd3.5 🎨 6. Generative Media Tools | 1.5k | |
OuteTTS 🧠 2. Open Foundation Models🎨 6. Generative Media Tools Interface for OuteTTS models. | 1.4k | |
AutoTS 🧩 11. Specialized Domains Automated Time Series Forecasting | 1.4k | |
Newelle 🖥️ 12. User Interfaces & Self-hosted Platforms Newelle - Your Ultimate Virtual Assistant | 1.3k | |
SAELens 🛡️ 10. AI Safety, Alignment & Interpretability Training Sparse Autoencoders on Language Models | 1.3k | |
weave 📊 8. MLOps / LLMOps & Production Weave is a toolkit for developing AI-powered applications, built by Weights & Biases. | 1.1k | |
hqq ⚡ 3. Inference Engines & Serving Official implementation of Half-Quadratic Quantization (HQQ) | 924 | |
nnsight 🛡️ 10. AI Safety, Alignment & Interpretability The nnsight package enables interpreting and manipulating the internals of deep learned models. | 883 | |
livecodebench 📈 9. Evaluation, Benchmarks & Datasets Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code" | 834 | |
JaxMARL 🧩 11. Specialized Domains Multi-Agent Reinforcement Learning with JAX | 780 | |
ome ⚡ 3. Inference Engines & Serving Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton | 412 | |
NornicDB 🔍 5. Retrieval-Augmented Generation (RAG) & Knowledge Nornicdb is a low-latency, Graph + Vector, Temporal MVCC with all sub-ms HNSW search, graph traversal, and writes. Uses Neo4j Bolt/Cypher and qdrant's gRPC drivers so you can switch with no changes. Then, adding intelligent features like schemas, managed embeddings, LLM reranking+inferrence, GPU acceleration, Auto-TLP, Memory Decay, and MCP server. | 375 | |
MMLU-Pro 📈 9. Evaluation, Benchmarks & Datasets The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024] | 359 | |
forgetful 🤖 4. Agentic AI & Multi-Agent Systems Opensource Memory for Agents | 237 | |
gpt4all 🖥️ 12. User Interfaces & Self-hosted Platforms GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use. | 77.2k | -1 |
ColossalAI 🛠️ 7. Training & Fine-tuning Ecosystem Making large AI models cheaper, faster and more accessible | 41.4k | -3 |