Skip to content

Latest commit

 

History

History
291 lines (196 loc) · 19.3 KB

search-problem-space.md

File metadata and controls

291 lines (196 loc) · 19.3 KB

Screenshot 2024-03-25 at 1 44 32 PM. Twitter thread has a rich discussion of the AI vs Cloud Providers https://x.com/rakyll/status/1771641289840242754?s=20. The emerging AI Cloud is simpler to use.

This repo is an exploration & building modals Programming Artificial intelligence. As we evolve Compound AI Systems I hope we preserve the simplicity.

Attention Timeline

-- Introduction to Transformers w/ Andrej Karpathy
-- A Comprehensive Overview of Large Language Models

"Open" & Closed

Open - w/ Weights, Training & Inference Code, Data & Evaluation

Andrej Karpathy Talking about the importance of building a more open and vibrant AI ecosystem

Tour of Modern LLMs (and surrounding topics)

--

The successful art of model merging is often based purely on experience and intuition of a passionate model hacker, .... In fact, the current Open LLM Leaderboard is dominated by merged models. Surprisingly, merged models work without any additional training, making it very cost-effective (no GPUs required at all!), and so many people, researchers, hackers, and hobbyists alike, are trying this to create the best models for their purposes.

image What I learned from looking at 900 most popular open source AI tools

Compound AI Systems

state-of-the-art AI results are increasingly obtained by compound systems with multiple components, not just monolithic models.
The Shift from Models to Compound AI Systems

Generative language models (LMs) are often chained together and combined with other components into compound AI systems. Compound AI system applications include retrieval-augmented generation (RAG). structured prompting, chatbot verification, multi-hop question answering, agents, and SQL query generation. ALTO: An Efficient Network Orchestrator for Compound AI Systems

Hardware & Accelerators

Leaderboards, Benchmarks & Evaluations

Intelligence is the computational part of the ability to achieve goals. A goal achieving system is one that is more usefully understood in terms of outcomes than in terms of mechanisms.
The Definition of Intelligence

We don't know how to measure LLM abilities well. Most tests are groups of multiple choice questions, tasks, or trivia - they don't represent real world uses well, they are subject to gaming & results are impacted by prompt design in unknown ways. Or they use human preference. Non-trivial Taxonomy in real-world, starting with clear domains / Common LLM workloads:

  1. Languages - Rankings by domain -> https://huggingface.co/models -> Tasks & Languages
  2. Model Card - claude 3 model card Coding, Creative Writing, Instruction-following, Long Document Q&A
  3. Chat, RAG, few-shot benchmark, etc.
  4. Coding - "Code a login component in React"
  5. Freshness - "What was the Warriors game score last night?"
  6. Agent - Web Agents -> https://turkingbench.github.io/
  7. Multimodal (images and video)
  8. Reasoning?

--

--

Inferencing

image

Application Programming Interfaces (APIs) are at the heart of all internet software, compounding with Foundational Model API-First.

Gateway API Concepts: https://gateway-api.sigs.k8s.io/ Use Cases: https://gateway-api.sigs.k8s.io/#use-cases

Cloud Providers: In the ever-evolving cloud computing landscape, understanding the Gateway API is crucial for those using Kubernetes. This API enhances application traffic management, offering better routing and security. For seamless integration of AI into cloud-native applications, a robust framework ? streamlining the deployment and management of AI-driven solutions. Dive into the Gateway API for insights and explore Use Cases for cutting-edge application management.

Training

image

abstract away cloud infra burdens, Launch jobs & clusters on any cloud, Maximize GPU usage

Prompt

Programming—not prompting—Language Models

Frameworks & Scripting

UI/UX

Hugging Face

Vercel

Standards

Security

Learning

Artificial intelligence has recently experienced remarkable advances, fueled by large models, vast datasets, accelerated hardware, and, last but not least, the transformative power of differentiable programming. This new programming paradigm enables end-to-end differentiation of complex computer programs (including those with control flows and data structures), making gradient-based optimization of program parameters possible.
As an emerging paradigm, differentiable programming builds upon several areas of computer science and applied mathematics, including automatic differentiation, graphical models, optimization and statistics. This book presents a comprehensive review of the fundamental concepts useful for differentiable programming. We adopt two main perspectives, that of optimization and that of probability, with clear analogies between the two.
Differentiable programming is not merely the differentiation of programs, but also the thoughtful design of programs intended for differentiation. By making programs differentiable, we inherently introduce probability distributions over their execution, providing a means to quantify the uncertainty associated with program outputs. The Elements of Differentiable Programming (Draft, ~380 Pages!)

A best place to learn all in one place https://huggingface.co/docs & Open-Source AI Cookbook

Articles & Talks

AI Twitter & Discord

Papers

--

Transformers Slides

image

(12) United States Patent

GD7LKuvCY5w8OtmsOMASxS5mpCBUvkGdXxeYMSSYx3ZrOogCkN0GOdoBvCD5DK-1t64R31RpIJhUMYAhmRQJb7KuK1rFLSRiVPGXfPGrwQxorwybqc7rE-F7nVTH