A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & Agno integration)
A high-throughput and memory-efficient inference and serving engine for LLMs
A unified inference and post-training framework for accelerated video generation.
Portable file server with accelerated resumable uploads, dedup, WebDAV, FTP, TFTP, zeroconf, media indexer, thumbnails++ all in one file, no deps
An Open Source implementation of Notebook LM with more flexibility and features
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Cybersecurity AI (CAI), the framework for AI Security
Financial data platform for analysts, quants and AI agents.
A curated list of awesome commands, files, and workflows for Claude Code
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Simple, scalable AI model deployment on GPU clusters
Fast and memory-efficient exact attention
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Verifiers for LLM Reinforcement Learning
OCR, layout analysis, reading order, table recognition in 90+ languages
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.