Skip to main content

Home/ Artificial Intelligence/ Group items tagged LLM

Rss Feed Group items tagged

Janos Haits

open-llm-leaderboard (Open LLM Leaderboard) - 0 views

  •  
    "Open LLM Leaderboard This is the hub organisation maintaining the Open LLM Leaderboard. In this space you will find the dataset with detailed results and queries for the models on the leaderboard."
Janos Haits

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery - 0 views

  •  
    "At Sakana AI, we have pioneered the use of nature-inspired methods to advance cutting-edge foundation models. Earlier this year, we developed methods to automatically merge the knowledge of multiple LLMs. In more recent work, we harnessed LLMs to discover new objective functions for tuning other LLMs. Throughout thes"
Janos Haits

Build a Custom LLM with ChatRTX | NVIDIA - 0 views

  •  
    "ChatRTX is a demo app that lets you personalize a GPT large language model (LLM) connected to your own content-docs, notes, or other data. Leveraging retrieval-augmented generation (RAG), TensorRT-LLM, and RTX acceleration, you can query a custom chatbot to quickly get contextually relevant answers. And because it all runs locally on your Windows RTX PC or workstation, you'll get fast and secure results."
Janos Haits

Build a Custom LLM with ChatRTX | NVIDIA - 0 views

  •  
    "ChatRTX is a demo app that lets you personalize a GPT large language model (LLM) connected to your own content-docs, notes, images, or other data. Leveraging retrieval-augmented generation (RAG), TensorRT-LLM, and RTX acceleration, you can query a custom chatbot to quickly get contextually relevant answers. "
Janos Haits

Chatbot Arena: Elo Rating Calculation (July 17, 2023) - Colab - 0 views

  •  
    "n this notebook, we will employ the Elo rating system to evaluate the performance of large language models (LLMs). The analysis is based on the pairwise battle results we collected from https://arena.lmsys.org between April 24 and July 17, 2023. This crowdsourcing way of data collection represents some use cases of LLMs in the wild. Below, we present the calculation procedure along with some basic analyses. To view the latest leaderboard, see https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard."
Janos Haits

Chatbot Arena (formerly LMSYS): Free AI Chat to Compare & Test Best AI Chatbots - 0 views

  •  
    "Chatbot Arena LLM Leaderboard: Community-driven Evaluation for Best LLM and AI chatbots"
1 - 20 of 92 Next › Last »
Showing 20 items per page