Skip to main content

Home/ Artificial Intelligence/ Group items tagged benchmark

Rss Feed Group items tagged

Janos Haits

Geekbench AI - Cross-Platform AI Benchmark - 0 views

  •  
    "eekbench AI is a cross-platform AI benchmark that uses real-world machine learning tasks to evaluate AI workload performance. Geekbench AI measures your CPU, GPU, and NPU to determine whether your device is ready for today's and tomorrow's cutting-edge machine learning applications."
Janos Haits

LiveBench - 0 views

  •  
    "Introducing LiveBench: a benchmark for LLMs designed with test set contamination and objective evaluation in mind. It has the following properties:"
Janos Haits

MetaMind - Deep Learning for Enterprise - 0 views

  •  
    "MetaMind delivers Artificial Intelligence enterprise solutions via its AI platform and Smart Module offerings. The general-purpose platform can predict outcomes for language, vision and database tasks, and delivers best in class accuracies on standard benchmarks."
Janos Haits

MIT Places Database for Scene Recognition - 1 views

  •  
    "Scene recognition is one of the hallmark tasks of computer vision, allowing defining a context for object recognition. Here we introduce a new scene-centric database called Places, with 205 scene categories and 2.5 millions of images with a category label. Using convolutional neural network (CNN), we learn deep scene features for scene recognition tasks, and establish new state-of-the-art performances on scene-centric benchmarks. Here we provide the Places Database and the trained CNNs for academic research and education purposes."
Janos Haits

www.cognition-labs.com/blog - 0 views

  •  
    "Introducing Devin, the first AI software engineer And setting a new state of the art on the SWE-bench coding benchmark"
Janos Haits

Chat with Open Large Language Models - 0 views

  •  
    "Chatbot Arena: Benchmarking LLMs in the Wild"
mikhail-miguel

LMSYS Chatbot Arena Vision (Multimodal): Benchmarking LLMs and VLMs in the Wild - 1 views

  •  
    The Chatbot Arena has launched a new beta feature supporting images, allowing users to interact with chatbots through images. Each conversation can include the submission of one image, as long as it is under 15MB. The Chatbot Arena logs user requests, including the images submitted, for research purposes. Although this data is not currently publicly disclosed, there may be a possibility of doing so in the future. Therefore, it is recommended that users avoid sending confidential or personal information through this feature. This feature is in its early development stage, so there may be issues or bugs. Users are encouraged to report any issues through the Chatbot Arena communication channels.
1 - 7 of 7
Showing 20 items per page