dev-resources.site

for different kinds of informations.

Journey towards self hosted AI code completion

Published at

5/1/2024

Categories

llama3

vscode

webdev

ai

Author

sp90

Main Article

https://dev.to/sp90/journey-towards-self-hosted-ai-code-completion-5dpa

Categories

4 categories in total

Author

4 person written this

Journey towards self hosted AI code completion

So after I saw a video of Llama3 8b doing 800-1000 tokens/s I instantly thought it would be phenominal to have that on my laptop instantly giving me responses that I could select or discard with the added side benefit would be to learn some new tech.

I have once coded a small AI trained on like 100.000 items trying to predict housing prices, let me say it was not great.

But again I don't have billions upon billions of records to train on nor the compute to train a decent model but I like the control over my coding environment.

The first goal

Having llama 8b running locally autocompleting code snippets.

Yeah I know the barrier are low but is most def a useful starting point. I personally dont believe in setting unobtainable goals because failing at something hard is not as great as succeeding at small increments.

Lastly I intent to make a guide on my complete journey sub and like to get notified when new content is published

Banner video is from groq.com a great service where you can test these open source models like llama not sponsored by them just wanted to credit

llama3 Article's

30 articles in total

Novita AI API on gptel: Supercharge Emacs with LLMs

How to Effectively Fine-Tune Llama 3 for Optimal Results?

L3 8B Lunaris: Generalist Roleplay Model Merges on Llama-3

Accessing Novita AI API through Portkey AI Gateway: A Comprehensive Guide

Llama 3 vs Qwen 2: The Best Open Source AI Models of 2024

Llama 3.3 vs GPT-4o: Choosing the Right Model

Meta's Llama 3.3 70B Instruct: Powering AI Innovation on Novita AI

MINDcraft: Unleashing Novita AI LLM API in Minecraft

How to Access Llama 3.2: Streamlining Your AI Development Process

Are Llama 3.1 Free? A Comprehensive Guide for Developers

How Much RAM Memory Does Llama 3.1 70B Use?

How to Install Llama-3.3 70B Instruct Locally?

Arcee.ai Llama-3.1-SuperNova-Lite is officially the 8-billion parameter model

LLM Inference using 100% Modern Java ☕️🔥

Enhance Your Projects with Llama 3.1 API Integration

Llama 3.2 Running Locally in VSCode: How to Set It Up with CodeGPT and Ollama

Llama 3.2 is Revolutionizing AI for Edge and Mobile Devices

Two new models: Arcee-Spark and Arcee-Agent

How to deploy Llama 3.1 405B in the Cloud?

ChatPDFLocal: Chat with Your PDFs Offline with Llama3.1 locally,privately and safely.

How to deploy Llama 3.1 in the Cloud: A Comprehensive Guide

How to fine tune a model which is available in ollama

Theoretical Limits and Scalability of Extra-LLMs: Do You Need Llama 405B

Milvus Adventures July 29, 2024

Lightning-Fast Code Assistant with Groq in VSCode

Journey towards self hosted AI code completion

currently reading

Blossoming Intelligence: How to Run Spring AI Locally with Ollama

Setup REST-API service of AI by using Local LLMs with Ollama

Hindi-Language AI Chatbot for Enterprises Using Qdrant, MLFlow, and LangChain

#SemanticKernel: Local LLMs Unleashed on #RaspberryPi 5

Featured ones:

abubakersiddique761