Logo

dev-resources.site

for different kinds of informations.

Train LLM From Scratch

Published at
1/14/2025
Categories
ai
python
opensource
datascience
Author
fareedkhan557
Categories
4 categories in total
ai
open
python
open
opensource
open
datascience
open
Author
13 person written this
fareedkhan557
open
Train LLM From Scratch

I created an end-to-end LLM training project, from downloading the training dataset to generating text with the trained model. It currently supports the PILE dataset, a diverse data for LLM training. You can limit the dataset size, customize the default transformer architecture and training configuration, and more.

This is what my 13 million parameter-trained LLM output looks like, trained on a Colab T4 GPU:

In \*\*\*1978, The park was returned to the factory-plate that the public share to the lower of the electronic fence that follow from the Station's cities. The Canal of ancient Western nations were confined to the city spot. The villages were directly linked to cities in China that revolt that the US budget and in Odambinais is uncertain and fortune established in rural areas.

It's more about learning than making the absolute best AI right away.

Code, documentation, and example can all be found on GitHub:

Github Link

opensource Article's
30 articles in total
Open-source projects foster collaboration and innovation, allowing anyone to view, modify, and share the code freely.
Favicon
Memory Management in Operating Systems
Favicon
2025: The Year of Decentralization – How Nostr Will Make You a Standout Developer
Favicon
10 Must-Bookmark Open Source Projects for Developers
Favicon
[Boost]
Favicon
join my project semester simulator
Favicon
340+ Websites every developer should know
Favicon
KDE vs GNOME vs Others: Choosing the Best Linux Desktop Environment in 2025
Favicon
assert in Nodejs and its usage in Grida source code
Favicon
Contribute to `real-to-sim-to-real` in SmilingRobo Open-Source Sprint!
Favicon
Exploring the CNCF Landscape: A Comprehensive Overview of Cloud Native Technologies
Favicon
🎁 20 Open Source Projects You Shouldn't Miss in 2025
Favicon
Any recommendations of open source asset inventory ?
Favicon
Getting Started with the Open Source AI Hackathon
Favicon
Supercharge Your JavaScript Agents with Firecrawl in KaibanJS
Favicon
Top 10 Trending GitHub Repositories, Nov 24 2024
Favicon
Open-Source TailwindCSS React Color Picker - Zero Dependencies! Perfect for Next.js Projects!
Favicon
Procrastinator’s Guide to Glory: Turning Wasted Time Into Career Gold with Open Source
Favicon
Kubernetes Security Best Practices
Favicon
SPL: a database language featuring easy writing and fast running
Favicon
7 Open-Source Tools for Better Website Analytics
Favicon
Train LLM From Scratch
Favicon
Have you ever used `git submodules`?
Favicon
✨ Introducing Tooltip: A Revolutionary Suite of Developer Tools** ✨
Favicon
Becoming An Open Source Maintainer
Favicon
Open-Source React Icon Picker: Lightweight, Customizable, and Built with ShadCN, TailwindCSS. Perfect for Next.js Projects!
Favicon
Enhance Your App's Security with OTP-Agent
Favicon
3 essential elements for Web publishing
Favicon
Pulumi WASM/Rust devlog #3
Favicon
Sign up to our bug bounty platform!
Favicon
lodash._merge vs Defu

Featured ones: