- 900 most popular open source AI tools:🏆What I learned from looking at 900 most popular open source AI tools list [Mar 2024]
- Open100: Top 100 Open Source achievements.
- Generative AI Design Patterns: 9 architecture patterns for working with LLMs. [Feb 2024]
- Awesome LLM Apps:💡A curated collection of awesome LLM apps built with RAG and AI agents. [Apr 2024]
- GenAI Agents:🏆Tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. [Sep 2024]
- The Last Mile Problem: Why Your AI Models Stumble Before the Finish Line: According to Gartner, by 2025, at least 30 percent of GenAI projects will be abandoned after the POC stage. [25 Oct 2024]
- Azure OpenAI Samples: 🐳 Azure OpenAI (OpenAI) Sample Collection - 🪂 100+ Code Cookbook 🧪 [Mar 2025]
- Micro-Orchestration: LangChain, LlamaIndex, Haystack, AdalFlow | Macro-Orchestration: LangGraph, Burr | Agentic Design: Autogen, CrewAI | Optimizer: DSPy, TextGrad
- x-ref: LangChain | x-ref: LlamaIndex | x-ref: DSPy | x-ref: Semantic Kernel
- x-ref: Agent Framework | x-ref: RAG Development | x-ref: Prompting Frameworks | x-ref: Microsoft Azure OpenAI LLM Framework
- fastText: A library for efficient learning of word representations and sentence classification [Aug 2016]
- Pytorch: PyTorch is the most favorite library among researchers. Papers with code Trends [Sep 2016]
- fairseq: a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling [Sep 2017]
- huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. (github.com) [Oct 2018]
- jax: JAX is Autograd (automatically differentiate native Python & Numpy) and XLA (compile and run NumPy) [Oct 2018]
- Sentence Transformers: Python framework for state-of-the-art sentence, text and image embeddings. Useful for semantic textual similar, semantic search, or paraphrase mining. git [27 Aug 2019]
- Weights & Biases: Visualizing and tracking your machine learning experiments wandb.ai doc:
deeplearning.ai/wandb
[Jan 2020] - Reranker: Training and deploying deep languge model reranker in information retrieval (IR), question answering (QA) [Jan 2021]
- mosaicml/llm-foundry: LLM training code for MosaicML foundation models [Jun 2022]
- vLLM: Easy-to-use library for LLM inference and serving. [Feb 2023]
- string2string: an open-source tool that offers a comprehensive suite of efficient algorithms for a broad range of string-to-string problems. [Mar 2023]
- GPT4All: Open-source large language models that run locally on your CPU [Mar 2023]
- Visual Blocks: Google visual programming framework that lets you create ML pipelines in a no-code graph editor. [Mar 2023]
- LLaMA-Factory: Unify Efficient Fine-Tuning of 100+ LLMs [May 2023]
- WebLLM: High-Performance In-Browser LLM Inference Engine. [Apr 2023]
- ollama:💡Running with Large language models locally [Jun 2023]
- unsloth: Finetune Mistral, Gemma, Llama 2-5x faster with less memory! [Nov 2023]
- LM Studio: UI for Discover, download, and run local LLMs [May 2024]
- YaFSDP: Yet another Fully Sharded Data Parallel (FSDP): enhanced for distributed training. YaFSDP vs DeepSpeed. [May 2024]
- exo: Run your own AI cluster at home with everyday devices [Jun 2024]
- KTransformers: A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations [Jul 2024]
- Kiln: Desktop Apps for for fine-tuning LLM models, synthetic data generation, and collaborating on datasets. [Aug 2024]
- BitNet: Official inference framework for 1-bit LLMs [Aug 2024]
- Meta Lingua: a minimal and fast LLM training and inference library designed for research. [Oct 2024]
- smolmodels: Create machine learning models with minimal code by describing what you want them to do in plain words. [Jan 2025]
- mindsdb: The open-source virtual database for building AI from enterprise data. It supports SQL syntax for development and deployment, with over 70 technology and data integrations. [Aug 2018]
- Jina-Serve: a framework for building and deploying AI services that communicate via gRPC, HTTP and WebSockets. [Feb 2020]
- superduper: Build end-to-end AI-data workflows and applications with your favourite tools. [Aug 2022]
- langflow: LangFlow is a UI for LangChain, designed with react-flow. [Feb 2023]
- MiniChain: A tiny library for coding with llm [Feb 2023]
- marvin: a lightweight AI toolkit for building natural language interfaces. [Mar 2023]
- rivet: an open-source visual AI programming environment and TypeScript library [Mar 2023]
- microsoft/Tokenizer: Tiktoken in C#: .NET and TypeScript implementation of BPE tokenizer for OpenAI LLMs. [Mar 2023]
- Azure OpenAI Proxy: OpenAI API requests converting into Azure OpenAI API requests [Mar 2023]
- ChainForge: An open-source visual programming environment for battle-testing prompts to LLMs. [Mar 2023]
- E2B: an open-source infrastructure that allows you run to AI-generated code in secure isolated sandboxes in the cloud. [Mar 2023]
- Flowise Drag & drop UI to build your customized LLM flow [Apr 2023]
- Dify: an open-source platform for building applications with LLMs, featuring an intuitive interface for AI workflows and model management. [Apr 2023]
- ThinkGPT: Chain of Thoughts library [Apr 2023]
- langfuse: Traces, evals, prompt management and metrics to debug and improve your LLM application. [May 2023]
- Superagent: AI Assistant Framework & API [May 2023]
- DemoGPT: Automatic generation of LangChain code [Jun 2023]
- Spring AI: Developing AI applications for Java. [Jul 2023]
- litellm: Python SDK to call 100+ LLM APIs in OpenAI format [Jul 2023]
- Opencopilot: Build and embed open-source AI Copilots. [Aug 2023]
- BISHENG: an open LLM application devops platform, focusing on enterprise scenarios. [Aug 2023]
- langfun: leverages PyGlove to integrate LLMs and programming. [Aug 2023]
- mirascope: a library that simplifies working with LLMs via a unified interface for multiple providers. [Dec 2023]
- Pipecat: Open Source framework for voice and multimodal conversational AI [Dec 2023]
- Refly: WYSIWYG AI editor to create llm application. [Feb 2024]
- RouteLLM: A framework for serving and evaluating LLM routers [Jun 2024]
- Llama Stack:💡building blocks for Large Language Model (LLM) development [Jun 2024]
- Harbor: Effortlessly run LLM backends, APIs, frontends, and services with one command. a helper for the local LLM development environment. [Jul 2024]
- aisuite: Andrew Ng launches a tool offering a simple, unified interface for multiple generative AI providers. [26 Nov 2024]
vs litellm vs OpenRouter
- PocketFlow: Minimalist LLM Framework in 100 Lines. Enable LLMs to Program Themselves. [Dec 2024]
- zep: Long term memory layer. Zep intelligently integrates new information into the user's Knowledge Graph.
[May 2023]
- Mem0:💡A self-improving memory layer for personalized AI experiences. [Jun 2023]
- cognee: LLM Memory using Dynamic knowledge graphs [Aug 2023]
- Letta (previously MemGPT): Virtual context management to extend the limited context of LLM. A tiered memory system and a set of functions that allow it to manage its own memory. ref / git:old [12 Oct 2023]
- Memary: memary mimics how human memory evolves and learns over time. The memory module comprises the Memory Stream and Entity Knowledge Store. [May 2024]
- Graphiti: Graphiti leverages zep’s memory layer. Build Real-Time Knowledge Graphs for AI Agents [Aug 2024]
- Agentic Memory: A dynamic memory system for LLM agents, inspired by the Zettelkasten method, enabling flexible memory organization. [17 Feb 2025]
- OpenBB: The first financial Platform that is free and fully open source. AI-powered workspace [Dec 2020]
- knowledge: Tool for saving, searching, accessing, and exploring websites and files. Electron based app, built-in Chromium browser, knowledge graph [Jul 2021]
- Nomic python client: Generate, store and retrieve embeddings for your unstructured data. supports from hundreds to tens of millions of points. [Jul 2022]
- guardrails: Adding guardrails to large language models. [Jan 2023]
- aider: AI pair programming in your terminal [Jan 2023]
- BookGPT: Generate books based on your specification [Jan 2023]
- KnowledgeGPT: Upload your documents and get answers to your questions, with citations [Jan 2023]
- DocsGPT: Chatbot for document with your data [Feb 2023]
- LibreChat: a free, open source AI chat platform. [8 Mar 2023]
- BIG-AGI FKA nextjs-chatgpt-app [Mar 2023]
- Next.js AI Chatbot:💡An Open-Source AI Chatbot Template Built With Next.js and the AI SDK by Vercel. [May 2023]
- dataline: Chat with your data - AI data analysis and visualization [Apr 2023]
- pyspark-ai: English instructions and compile them into PySpark objects like DataFrames. [Apr 2023]
- vanna: Chat with your SQL database [May 2023]
- localGPT: Chat with your documents on your local device [May 2023]
- anything-llm: All-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more. [Jun 2023]
- Dialoqbase: Create custom chatbots with your own knowledge base using PostgreSQL [Jun 2023]
- GPT Researcher: Autonomous agent designed for comprehensive online research [Jul 2023] / GPT Newspaper: Autonomous agent designed to create personalized newspapers [Jan 2024]
- Postiz: AI social media scheduling tool. An alternative to: Buffer.com, Hypefury, Twitter Hunter. [Jul 2023]
- SolidGPT: AI searching assistant for developers (VSCode Extension) [Aug 2023]
- notesGPT: Record voice notes & transcribe, summarize, and get tasks [Nov 2023]
- screenshot-to-code: Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue) [Nov 2023]
- Geppeto: Advanced Slack bot using multiple AI models [Jan 2024]
- code2prompt: a command-line tool (CLI) that converts your codebase into a single LLM prompt with a source tree [Mar 2024]
- OpenHands: OpenHands (formerly OpenDevin), a platform for software development agents [Mar 2024]
- LlamaFS: Automatically renames and organizes your files based on their contents [May 2024]
- Micro Agent: An AI agent that writes and fixes code for you. Test case driven code generation [May 2024]
- Cellm: Use LLMs in Excel formulas [Jul 2024]
- Nyro: AI-Powered Desktop Productivity Tool [Aug 2024]
- Auto_Jobs_Applier_AIHawk: automates the jobs application [Aug 2024]
- PDF2Audio: an open-source alternative to NotebookLM for podcast creation [Sep 2024]
- o1-engineer: a command-line tool designed to assist developers [Sep 2024]
- Cofounder: full stack generative web apps ; backend + db + stateful web apps [Sep 2024]
- Podcastfy.ai: An Open Source API alternative to NotebookLM's podcast feature. [Oct 2024]
- DeepSeek Engineer: Simple and just a few lines of code. a powerful coding assistant application that integrates with the DeepSeek API [Dec 2024]
- Meetily: Open source Ai Assistant for taking meeting notes [Dec 2024]
- GitHub Copilot:🏆 AI pair programmer by GitHub and OpenAI. Supports VS Code, Visual Studio, Neovim, and JetBrains IDEs. [29 Jun 2021]
- Zed: a high-performance, multiplayer code editor. the creators of Atom and Tree-sitter. [Feb 2021]
- AI Code Editor: Replit Agent [09 Sep 2024] / Cursor [Mar 2023]
- Continue: open-source AI code assistant inside of VS Code and JetBrains. [May 2023]
- Vercel AI Vercel AI Toolkit for TypeScript [May 2023]
- Firebase Studio: Google. a cloud-based, agentic development environment. [14 May 2024]
- Cline: CLI aNd Editor. Autonomous coding agent. VSCode Extension. [Jul 2024]
- Llama Coder: Open source Claude Artifacts ref: demo [Jul 2024]
- void OSS Cursor alternative. a fork of vscode [Oct 2024]
- Github Spark: an AI-powered tool for creating and sharing micro apps (“sparks”) [29 Oct 2024]
- bolt.new: Dev Sanbox with AI from stackblitz [Sep 2024]
- Windsurf editor: Flows = Agents + Copilots. Cascades (a specific implementation of AI Flows. Advanced chat interface). [13 Nov 2024]
- devin.cursorrules: Transform your Cursor or Windsurf IDE into a Devin-like AI Assistant [Dec 2024]
- CodeLLM by Abacus.AI: AI-powered code editor with automatic selection of state-of-the-art LLMs based on coding tasks. [Dec 2024]
- Trae: Bytedance. Free, but not open-source. [20 Jan 2025]
- LLM Debugger: a VSCode extension that demonstrates the use of LLMs for active debugging of programs. [Feb 2025]
- Code Shaping: Editing code with free-form sketch annotations on the code and console output. [6 Feb 2025] ref
- Deepsite: Mockup UI generator Powered by Deepseek [Mar 2025]
- streamlit:💡Streamlit — A faster way to build and share data apps. [Jan 2018]
- Gradio: Build Machine Learning Web Apps - in Python [Mar 2023]
- GPT 学术优化 (GPT Academic): UI Platform for Academic & Coding Tasks. Optimized for paper reading, writing, and editing. [Mar 2023]
- Text generation web UI: Text generation web UI [Mar 2023]
- Open AI Chat Mockup: An open source ChatGPT UI. mckaywrigley/chatbot-ui [Mar 2023]
- chainlit:💡Build production-ready Conversational AI applications in minutes. [Mar 2023]
- CopilotKit: Built-in React UI components [Jun 2023]
- Open-source GPT Wrappers 1. ChatGPT-Next-Web [Mar 2023] 2. FastGPT [Feb 2023] 3. Lobe Chat [Jan 2024]
- anse: UI for multiple models such as ChatGPT, DALL-E and Stable Diffusion. [Apr 2023]
- Open WebUI: User-friendly AI Interface (Supports Ollama, OpenAI API, ...) [Oct 2023]
- mesop: Rapidly build AI apps in Python [Oct 2023]
- Camelot a Python library that can help you extract tables from PDFs! ref: Comparison with other PDF Table Extraction libraries [Jul 2016]
- Trafilatura: Gather text from the web and convert raw HTML into structured, meaningful data. [Apr 2019]
- Math formula OCR: MathPix, OSS LaTeX-OCR [Jan 2021]
- activeloopai/deeplake: AI Vector Database for LLMs/LangChain. Doubles as a Data Lake for Deep Learning. Store, query, version, & visualize any data. Stream data in real-time to PyTorch/TensorFlow. ref [Jun 2021]
- PostgresML: The GPU-powered AI application database. [Apr 2022]
- agent-twitter-client: twitter-scraper. A Twitter client for agents. no API key necessary [May 2022]
- unstructured: Open-Source Pre-Processing Tools for Unstructured Data [Sep 2022]
- outlines: Structured Text Generation [Mar 2023]
- pandas-ai: Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). [Apr 2023]
- Instructor: Structured outputs for LLMs, easily map LLM outputs to structured data. [Jun 2023]
- Nougat: Neural Optical Understanding for Academic Documents: The academic document PDF parser that understands LaTeX math and tables. git [25 Aug 2023]
- Marker: converts PDF to markdown [Oct 2023]
- Maxun: Open-Source No-Code Web Data Extraction Platform [Oct 2023]
- GPT Crawler: Crawl a site to generate knowledge files to create your own custom GPT from one or multiple URLs [Nov 2023]
- surya: OCR, layout analysis, reading order, table recognition in 90+ languages [Jan 2024]
- firecrawl: Scrap entire websites into LLM-ready markdown or structured data. [Apr 2024]
- LLM Scraper: a TypeScript library that allows you to extract structured data from any webpage using LLMs. [Apr 2024]
- ExtractThinker: A Document Intelligence library for LLMs with ORM-style interaction for flexible workflows. [Apr 2024]
- Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper [May 2024]
- MegaParse: a powerful and versatile parser that can handle various types of documents. Focus on having no information loss during parsing. [30 May 2024]
- Zerox OCR: Zero shot pdf OCR with gpt-4o-mini [Jul 2024]
- pipet: Swiss-army tool for scraping and extracting data from online [Sep 2024]
- docling: IBM. Docling parses documents and exports them to the desired format. [13 Nov 2024]
- markitdown:💡Python tool for converting files and office documents to Markdown. [14 Nov 2024]
- Ollama OCR: A powerful OCR (Optical Character Recognition) package that uses state-of-the-art vision language models. [Nov 2024]
- Gitingest: Turn any Git repository into a prompt-friendly text ingest for LLMs. [Nov 2024]
- Paperless-AI: An automated document analyzer for Paperless-ngx using OpenAI API, Ollama and all OpenAI API compatible Services to automatically analyze and tag your documents. [Dec 2024]
- Azure AI Document Intelligence (FKA. Azure Form Recognizer): ref: Table and Meta data Extraction in the Document
- Table to Markdown: LLM can recognize Markdown-formatted tables more effectively than raw table formats.
- Streaming with Azure OpenAI SSE [May 2023]
- TaxyAI/browser-extension: Browser Automation by Chrome debugger API and Prompt >
src/helpers/determineNextAction.ts
[Mar 2023] - Embedding does not use Open AI. Can be executed locally: pdfGPT [Mar 2023]
- Langchain Ask PDF (Tutorial): git [Apr 2023]
- RAG: x-ref
- Agent Applications and Libraries: x-ref
- OSS Alternatives for OpenAI Code Interpreter: x-ref
- LLMOps: Large Language Model Operations: x-ref
- Caching: A technique to store data that has been previously retrieved or computed, so that future requests for the same data can be served faster.
- To reduce latency, cost, and LLM requests by serving pre-computed or previously served responses.
- Strategies for caching: Caching can be based on item IDs, pairs of item IDs, constrained input, or pre-computation. Caching can also leverage embedding-based retrieval, approximate nearest neighbor search, and LLM-based evaluation. ref
- GPTCache: Semantic cache for LLMs. Fully integrated with LangChain and llama_index. git [Mar 2023]
- Prompt Cache: Modular Attention Reuse for Low-Latency Inference: LLM inference by reusing precomputed attention states from overlapping prompts. [7 Nov 2023]
- Prompt caching with Claude: Reducing costs by up to 90% and latency by up to 85% for long prompts. [15 Aug 2024]
- Defensive UX: A design strategy that aims to prevent and handle errors in user interactions with machine learning or LLM-based products.
- Why defensive UX?: Machine learning and LLMs can produce inaccurate or inconsistent output, which can affect user trust and satisfaction. Defensive UX can help by increasing accessibility, trust, and UX quality.
- Guidelines for Human-AI Interaction: Microsoft: Based on a survey of 168 potential guidelines from various sources, they narrowed it down to 18 action rules organized by user interaction stages.
- People + AI Guidebook: Google: Google’s product teams and academic research, they provide 23 patterns grouped by common questions during the product development process3.
- Human Interface Guidelines for Machine Learning: Apple: Based on practitioner knowledge and experience, emphasizing aspects of UI rather than model functionality4.
- /llms.txt: Proposal for an
/llms.txt
file to guide LLMs in using websites during inference. git [3 Sep 2024] - Model Context Protocol (MCP): Anthropic proposes an open protocol for seamless LLM integration with external data and tools. git [26 Nov 2024]
x-ref
- Vibe Coding: a method where programmers explain their goals in simple language, and AI creates the code. Suggested by Andrej Karpathy in February 2025, it changes the programmer's role to guiding, testing, and improving the AI's work. [Feb 2025]
- A2A: Google. Agent2Agent (A2A) protocol. Agent Card (metadata: self-description). Task (a unit of work). Artifact (output). Streaming (Long-running tasks). Leverages HTTP, SSE, and JSON-RPC. Multi-modality incl. interacting with UI components [Mar 2025]
- PromptCraft-Robotics: Robotics and a robot simulator with ChatGPT integration git [Feb 2023]
- ChatGPT-Robot-Manipulation-Prompts: A set of prompts for Communication between humans and robots for executing tasks. git [Apr 2023]
- Siemens Industrial Copilot ref [31 Oct 2023]
- LeRobot: Hugging Face. LeRobot aims to provide models, datasets, and tools for real-world robotics in PyTorch. git [Jan 2024]
- Mobile ALOHA: Stanford’s mobile ALOHA robot learns from humans to cook, clean, do laundry. Mobile ALOHA extends the original ALOHA system by mounting it on a wheeled base ref [4 Jan 2024] / ALOHA: A Low-cost Open-source Hardware System for Bimanual Teleoperation.
- Figure 01 + OpenAI: Humanoid Robots Powered by OpenAI ChatGPT 📺 [Mar 2024]
- Gemini Robotics: Robotics built on the foundation of Gemini 2.0 [12 Mar 2025]
- FRVR Official Teaser: Prompt to Game: AI-powered end-to-end game creation [16 Jun 2023]
- rewind.ai: Rewind captures everything you’ve seen on your Mac and iPhone [Nov 2023]
- Vercel announced V0.dev: Make a snake game with chat [Oct 2023]
- Mobile ALOHA: A day of Mobile ALOHA [4 Jan 2024]
- groq: An LPU Inference Engine, the LPU is reported to be 10 times faster than NVIDIA’s GPU performance ref [Jan 2024]
- Sora: Introducing Sora — OpenAI’s text-to-video model [Feb 2024]
- Oasis: Minecraft clone. Generated by AI in Real-Time. The first playable AI model that generates open-world games. ref git [31 Oct 2024]