"n this notebook, we will employ the Elo rating system to evaluate the performance of large language models (LLMs). The analysisis based on the pairwise battle results we collected from https://arena.lmsys.org between April 24 and July 17, 2023. This crowdsourcing way of data collection represents some use cases of LLMs in the wild. Below, we present the calculation procedure along with some basic analyses.
To view the latest leaderboard, see https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard."
"OpenAI's ChatGPT was launched in November 2022 and it crossed a user mark of over 1 million within a week of its launch. But do you know why its popularity grew so quickly in a short period? It is because of its versatility, intelligence, and ability to do conversations like humans."
"Your guide to today's artificial intelligence
ChatGPT was only the beginning. Generative AI is popping up everywhere: at work, at home, on the go. How do you sort it all out? You start right here."
"Get Work Done 10x Faster with AI
Liner is powered by GPT-4 and tuned for your productivity.
Get instant answers to questions, deep dive into any topic, and summarize websites and documents in seconds."
"Further With AI, Faster on RTX
Get next-level AI performance on GeForce RTX™ and NVIDIA RTX™ GPUs. From enhanced creativity and productivity to blisteringly fast gaming, the ultimate in AI power on Windows PCs is on RTX."
"ChatRTX is a demo app that lets you personalize a GPT large language model (LLM) connected to your own content-docs, notes, or other data. Leveraging retrieval-augmented generation (RAG), TensorRT-LLM, and RTX acceleration, you can query a custom chatbot to quickly get contextually relevant answers. And because it all runs locally on your Windows RTX PC or workstation, you'll get fast and secure results."
"A worldwide machine learning lab
Machine learning research should be easily accessible and reusable. OpenML is an open platform for sharing datasets, algorithms, and experiments - to learn how to learn better, together."
"We're beta launching SynthID, a tool for watermarking and identifying AI-generated content. With this tool, users can embed a digital watermark directly into AI-generated images or audio they create. This watermark is imperceptible to humans, but detectable for identification."
"Claude is a family of foundational AI models that can be used in a variety of applications. You can talk directly with Claude at claude.ai to brainstorm ideas, analyze images, and process long documents. For developers and businesses, you can now get API access and build directly on top of our AI infrastructure."
"Makers of Devin, the first AI software engineer. Learn more here.
We are an applied AI lab focused on reasoning, and code is just the beginning.
To hire Devin for engineering work, please join the waitlist.
We're a small team based in New York and the San Francisco Bay Area. Come work with us."
"Making sense of artificial intelligence
This A-Z guide offers a series of simple, bite-sized explainers to help anyone understand what AI is, how it works and how it's changing the world around us."
"Monica is an all-in-one AI assistant equipped with the most advanced AI models (GPT-4, Claude, Gemini, etc.) to help you Chat, Search, Write, Translate and more. It also offers tools for image, video, and PDF processing."
"These self-service demo videos provide a summary of the key features and functions of H2O.ai's AI Cloud platform. H2O.ai is an end-to-end AI platform to manage your entire AI lifecycle journey."