Scale AI launches Seal Showdown, an alternative to LMArena leaderboard

In the years since OpenAI launched ChatGPT to the world , kicking off the generative AI boom, developers have relied on LMArena (previously Chatbot Arena) as the default AI leaderboard. Now, Scale AI is bringing some much-needed competition to the AI benchmarking space with its new Seal Showdown benchmarking tool.

Like LMAerna, Seal Showdown allows users to test various AI models head-to-head and vote on which one performs better. However, Scale AI says that unlike LMAerna, Seal Showdown will more closely reflect how everyday users feel about various models. In an X post, Scale CEO Jason Droege said that Seal Showdown "actually captures real preferences, powered by a platform used by real people."

"Most benchmarks rely on synthetic tests (coding puzzles, math problems) or feedback from

See Full Page

Interests (0)

Settings

Scale AI launches Seal Showdown, an alternative to LMArena leaderboard

TikTok's algorithm to be licensed to US joint venture led by Oracle and Silver Lake

Analysts react to Nvidia’s $100 billion investment in OpenAI

Biden White House filled with bullies and mean girls, former aide reveals

Trump's 'vengeance': Conservative explains why white evangelicals may be drawn to MAGA

John Oliver Offers Four Defiant Words of Advice to Disney Over the Kimmel Situation

'We want to be offensive too': Trump’s flimsy grasp of history could have serious results

Kim Kardashian's Photos in Sheer Clothing Have the Internet in a Chokehold: 'Perfection'

Katy Perry Says She’s ‘Continuing to Move Forward’ in Reflective Letter

How a young mom is "living, not just surviving" after incurable cancer diagnosis

Donald Trump Rips Apart Jimmy Kimmel at Charlie Kirk's Funeral: 'I Hate My Opponents'

Best 65-inch TV deal: Over $600 off the TCL QM7K at Amazon

Best portable power station deal: Save 50% on the EcoFlow Delta 2 Max and get a free solar panel

Don’t miss Office 2021 and Windows 11 Pro for just $55

'As serious as Watergate': Trump just put nightmare scenario on 'edge of becoming reality'

Social Security: Payments Worth Up to $5,108 Being Paid This Week

'Groundswell' for Kimmel return after 'cancel Disney+' blows up online: insider

Memorial Service for Charlie Kirk Draws Thousands in Arizona

Charlie Kirk inspired Trump and Elon Musk to reunite after ugly feud — here’s what they said at activist’s funeral

Jimmy Kimmel's Show Returns After Suspension Over Comments

NYPD Officers Charged in Fatal Shooting of 19-Year-Old

Hurricane Gabrielle Upgraded to Category 3, Tracking East

At least 11 children killed in el-Fasher drone strike, UN says

SCOTUS allows Trump firing of FTC Commissioner