Logo

dev-resources.site

for different kinds of informations.

Top-Trending LLMs Over the Last Week

Published at
3/19/2024
Categories
llm
ai
largelanguagemodels
Author
llm_explorer
Categories
3 categories in total
llm
open
ai
open
largelanguagemodels
open
Author
12 person written this
llm_explorer
open
Top-Trending LLMs Over the Last Week

In this post, we'd like to share the list of top trending models that caught people's attention in the AI world over the last week. We ranked them by how many times they were downloaded and liked, based on information from Hugging Face and LLM Explorer.

Top-Trending LLMs Over the Last Week

  1. C4ai Command-R V01, developed by Cohere For AI, leads the pack with its 35 billion parameters, excelling in reasoning, summarization, and question answering across multiple languages. It's designed for devices with limited processing power, making advanced AI tasks more accessible.

Community Insights:
AI experts like the C4AI Command-R model for how well it does different tasks like reasoning and answering questions in many languages. They're especially impressed with its ability to work with multiple languages and generate answers based on document information. Although it's not for commercial use without permission, some believe smaller groups might still be able to use it. People also talk about how this model could lead to new advancements in AI.

  1. Hermes 2 Pro on Mistral 7B (by NousResearch) is a new, upgraded version of Nous Hermes 2, optimized with an updated OpenHermes 2.5 Dataset and a new Function Calling and JSON Mode dataset. It excels in general tasks, conversations, function calls, and JSON outputs, all while being efficient on basic devices. It also features a special prompt and structure for easy function calls.

User-Driven Feedback:
Users appreciate that the developer has quantized the model and made both versions available. They find its special skills in handling tasks and making JSON outputs useful, but some say it's not as good as the older ones. Even with mixed opinions on how well it works overall, there's excitement about what it can do differently.

  1. StarChat2 15B V0.1 from HuggingFace H4 is a GPT-like coding assistant that supports English and over 600 programming languages. While it's great for chat and coding, users should be cautious of potential biases and the accuracy of its code outputs.

  2. GemMoE Beta 1, developed by Crystalcareai (also known as Lucas Atkins, an active contributor to the world of AI), is a text generation model that stands out for its mixture of experts approach. This method combines different components to improve text generation, showing continuous development and refinement.

  3. The 4-bit quantized version of C4AI Command-R, developed by Cohere and Cohere For AI, distinguishes itself by optimizing for devices with limited processing capability without compromising on its wide-ranging functionalities including reasoning, summarization, and multilingual question answering. Despite retaining the original model's robust 35 billion parameters and extensive language support, this quantized variant ensures greater accessibility and efficiency in use, making advanced AI tasks more approachable for a broader audience.

  4. Cerebrum 1.0 7B from Aether AI excels in reasoning tasks with a native chain of thought approach, making it ideal for logical and scientific thinking, as well as general use as a language model.

  5. Yi 9B 200K from 01-ai. The Yi series models by 01.AI are next-gen bilingual LLMs trained on a 3T multilingual corpus. The Yi-34B-Chat model ranks second on the AlpacaEval Leaderboard, outperforming giants like GPT-4 and Claude, while the Yi-34B leads in English and Chinese on various benchmarks, thanks to the collaborative efforts within the AI community. These models are designed for comprehensive language understanding and reasoning tasks.

  6. Mistral Orpo Beta from KAIST AI. Mistral-ORPO-Ξ² (7B) improves the original Mistral-7B model with odds ratio preference optimization (ORPO), eliminating the need for a supervised fine-tuning phase. It excels in AlpacaEval and MT-Bench, outperforming models like Llama-2-Chat, and is fine-tuned on a clean UltraFeedback dataset. This model showcases enhanced alignment and response generation, making it efficient for various AI tasks.

  7. Trendyol LLM 7B Chat V1.0, based on Mistral 7B and optimized with LoRA, is designed for conversational text in English and Turkish. It emphasizes the need for ethical use and human oversight. The developer is Trendyol Group.

  8. Phi 2 Layla V1 Chatml (by l3utterfly) offers enhanced performance in multi-turn conversations and character impersonation, showing the versatility of models in embodying characters for various applications.

That's all for now. Want to know more? Check the LLM Explorer website anytime for the newest popular models. The list is always there on the homepage.

Weekly Top-Ranked LLMs: Most Downloaded & Liked Language Models
Stay tuned!

largelanguagemodels Article's
28 articles in total
Favicon
Master AI with an LLM Certification: Your Path to Large Language Model Expertise
Favicon
Alignment Faking in Large Language Models: Could AI Be Deceiving Us?
Favicon
Arcee Orchestra and Arcee Model Engine
Favicon
Alibaba Researchers Introduce MARCO-O1: A Leap Forward in LLM Reasoning Capabilities
Favicon
Local LLMs: Balancing Power, Privacy, and Performance
Favicon
DeepSeek V3: A New Force in Open-Source AI
Favicon
Introduction to LLM
Favicon
How to Build Lightweight GraphRAG with SQLite
Favicon
"LLMs: The Answer to All Your Questions( Except the Meaning of LifeπŸ’¬)"
Favicon
How to Scale GraphRAG with Neo4j for Efficient Document Querying
Favicon
Here’s the script I built to automatically log my transactions
Favicon
LLMs Applications in the Industries
Favicon
The Evolution of Language Models
Favicon
What are Large Language Models (LLMs)?
Favicon
What are Large Language Models (LLMs)?
Favicon
Top LLM Picks for Coding: Community Recommendations
Favicon
What are Large Language Models (LLMs)?
Favicon
Top-Trending LLMs Over the Last Week
Favicon
Uncovering Generative Artificial Intelligence and LLMs: A Brief Introduction
Favicon
Introduction to Large Language Models
Favicon
Run Mixtral 8x7B πŸš€(Mixtral of Experts)in free colab
Favicon
Generative AI: How Self-Hosting Can Help Your Bussiness
Favicon
7 Open Source LLM Text Generators You Need to Know About
Favicon
Announcing Hasura Notebook: Prototype fast on your GenAI apps
Favicon
From first click to prompt output in 1m38s - Running Llama2 in Codesphere
Favicon
git init cohere_streamlit pt2
Favicon
NEW: Code Generation APIs available on Eden AI
Favicon
How to build a LLM powered carbon footprint analysis app

Featured ones: