nick7566

?>

nick7566

2025-04-02 11:59:59

GPT-4.5 Passes the Turing Test | "When prompted to adopt a humanlike persona, GPT-4.5 was judged to be the human 73% of the time: significantly more often than interrogators selected the real human participant."

nick7566

2025-04-02 05:40:07

"Large Language Models Pass the Turing Test", Jones and Bergen 2025 ("When prompted to adopt a humanlike persona, GPT-4.5 was judged to be the human 73% of the time: significantly more often than interrogators selected the real human participant.")

nick7566

2025-03-25 17:45:19

Gemini 2.5: Our newest Gemini model with thinking

nick7566

2025-03-25 15:36:30

DeepSeek-V3-0324

nick7566

2025-03-24 17:58:32

OpenAI’s First Stargate Site to Hold Up to 400,000 Nvidia Chips

nick7566

2025-03-18 07:22:59

Waymo’s self-driving cars headed to San Jose and SFO

nick7566

2025-03-06 08:39:30

QwQ-32B: Embracing the Power of Reinforcement Learning

nick7566

2025-03-04 12:24:38

Waymo is now available exclusively on Uber in Austin

nick7566

2025-02-28 03:54:13

GPT-4.5 compared to Grok 3 base

Unhappy_Spinach_7290

2025-02-28 00:17:27

GPT-4.5 compared to Grok 3 base

nick7566

2025-02-25 13:59:13

DeepSeek rushes to launch new AI model as China goes all in

nick7566

2025-02-18 10:52:45

Grok 3 Benchmarks

pigeon57434

2025-02-18 04:25:59

First Grok 3 Benchmarks

nick7566

2025-02-12 04:18:56

"Competitive Programming with Large Reasoning Models", El-Kishky et al 2025

nick7566

2025-01-22 21:26:25

Trading Inference-Time Compute for Adversarial Robustness

nick7566

2025-01-21 22:40:05

Announcing The Stargate Project

nick7566

2024-12-25 10:18:51

It’s been a rough year for robotaxis — but not for Waymo

nick7566

2024-12-20 18:30:16

OpenAI o3 Breakthrough High Score on ARC-AGI-Pub

nick7566

2024-12-16 03:02:30

How Google turned Jaguars into self-driving taxis, but General Motors gave up

nick7566

2024-12-06 23:19:30

ARC Prize 2024

nick7566

2024-12-05 19:13:23

"Mastering Board Games by External and Internal Planning with Language Models", Schultz et al 2024 (Google DeepMind)

nick7566

2024-12-05 04:56:24

Elon Musk's xAI Memphis Supercomputer Eyes Expansion to 1 Million GPUs

nick7566

2024-12-04 09:02:58

Predicting Emergent Capabilities by Finetuning

nick7566

2024-11-30 00:58:45

Uber and Lyft drivers say Waymo's robotaxis are hurting their earnings in Phoenix and LA

nick7566

2024-11-22 17:03:03

OK, I can partly explain the LLM chess weirdness now

nick7566

2024-11-16 03:24:28

The Surprising Effectiveness of Test-Time Training for Abstract Reasoning

nick7566

2024-08-19 00:02:29

Waymo’s robotaxi depot is still honking its San Francisco neighbors awake