Surreal VINCI - 0 views
Semantic Scholar - 1 views
Chatbot Arena: Elo Rating Calculation (July 17, 2023) - Colab - 0 views
-
"n this notebook, we will employ the Elo rating system to evaluate the performance of large language models (LLMs). The analysis is based on the pairwise battle results we collected from https://arena.lmsys.org between April 24 and July 17, 2023. This crowdsourcing way of data collection represents some use cases of LLMs in the wild. Below, we present the calculation procedure along with some basic analyses. To view the latest leaderboard, see https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard."
Home - POWER AI - 0 views
BlueGPT - 0 views
TheFastest.ai - 0 views
-
"Human conversations are fast, typically around 200ms between turns, and we think LLMs should be just as quick. This site provides reliable measurements for the performance of popular models. You can filter using the hamburger menu in a column header, e.g., Llama 3 70B providers, GPT-4 vs Claude 3 vs Gemini. Definitions, methodology, and links to source below. Stats updated daily."
Welcome to Claude - 0 views
-
"Claude is a family of large language models developed by Anthropic and designed to revolutionize the way you interact with AI. Claude excels at a wide variety of tasks involving language, reasoning, analysis, coding, and more. Our models are highly capable, easy to use, and can be customized to suit your needs."
OpenAI Platform - 0 views
Prompt library - 1 views
Anthropic Console - 1 views
AI Chat for scientific PDFs | SciSpace - 0 views
Pi, your personal AI - 0 views
Chat with Open Large Language Models - 0 views
The A-Z of AI - 2 views
AI & Robotics News - Taimine - 0 views
Perplexity AI: Ask Anything - 0 views
« First
‹ Previous
121 - 140 of 147
Next ›
Showing 20▼ items per page