Large Language Model

admin · April 10, 2023, 9:41pm

LLM Models

There are now more and more high quality open Models available to Compute Owners e.g.

Google's Gemma
Facebook's Llama
Alibaba's Qwen
01.ai's Yi

Above models are all available in Ollama (see below).

Model Providers

There are many Model Providers for the different Models e.g.

Ollama is the default as of 2024-08-24.

Model Interfaces

Must support Retrieval Augmented Generation (RAG) since traditional LLMs are difficult to for non-technical Compute Owners to customise (e.g. fine-tuning GPT-3.5)

Anything LLM
GitHub - Mintplex-Labs/anything-llm: The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Open-WebUI
GitHub - open-webui/open-webui: User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
LLMStack
GitHub - trypromptly/LLMStack: No-code multi-agent framework to build LLM Agents, workflows and applications with your data
Danswer
GitHub - danswer-ai/danswer: Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
GPT4All
GitHub - nomic-ai/gpt4all: GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

admin · February 15, 2024, 3:05am

Mix and Match AI

Below are some preferred AI Model standards:

1. Model File Format

GGUF
This is now the standard used by a lot of AI applications.
References:
ggml/docs/gguf.md at master · ggerganov/ggml · GitHub

2. Parameters

5B or above
If possible turn to use LLM with at least 5 billion parameters - the higher the better the quality but uses more resources.

3. Quantization

Q4_K_M or above
The 4 after the Q indicates the number of bits - the higher the better the quality but uses more resources.
References:
Quantization of LLMs with llama.cpp | by Ingrid Stevens | Medium

admin · July 20, 2024, 7:08am

Translation

Collections

Models

BigTranslate

BigTranslate is from Institute of Automation of the Chinese Academy of Sciences (CASIA).

Code:

GitHub - ZNLP/BigTranslate: BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages

References:

Paper page - BigTrans: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages
TheBloke/BigTranslate-13B-GPTQ · Hugging Face

Llama 3

Llama 3 supports multiple languages:

English
Spanish
French
German
Italian
Portuguese
Dutch
Russian
Chinese
Japanese
Korean

As of 2024-07-20 it has 3 different sizes: 8 Billion (available), 70 Billion (available), 400 Billion (almost there!) parameters.

References:

admin · July 20, 2024, 7:51am

Open Source Models

Despite its Open AI name, the ChatGPT it developed is not open sourced, but open sourced Large Language Models are being developed quickly by others:

1. Llama

As of 2024-08-12 the default LLM model is Llama 3.1

Data Cut-off Month: 2023-12

1. Stanford Alpaca

Promising for non-commercial applications.

Interesting how they took it down after short time online:

Can be used on less powerful hardware:

2. FLAN UL2

Promising for commercial applications.

20 Billion parameters can be a bit heavy but the gains may be worth it over the older and leaner FLAN-T5 it is based on.

Cerebras GPT - cerebras/Cerebras-GPT-13B · Hugging Face

admin · August 25, 2024, 7:35am

Retrieval Augmented Generation