That SWE-bench chart with the mismatched bars (52.8% somehow appearing larger than 69.1%) was emblematic of the entire presentation - rushed and underwhelming. It's the kind of error that would get flagged in any internal review, yet here it is in a billion-dollar product launch. Combined with the Bernoulli effect demo confidently explaining how airplane wings work incorrectly (the equal transit time fallacy that NASA explicitly debunks), it doesn't inspire confidence in either the model's ca...| news.ycombinator.com
peterdsharpe 70 days ago | next [–]| news.ycombinator.com
haffi112 70 days ago | next [–]| news.ycombinator.com
Plus how American professors are fighting back against the AI onslaught, a backlash over AI models in Vogue, and more.| www.bloodinthemachine.com
A call to reject the deployment and use of AI systems in Canada's public sector.| thedabbler.patatas.ca