Llama 2 Chat can generate and explain Python code quite well, right out of the box. Code Llama’s fine-tuned models offer even better capabilities for code generation.| InfoWorld
TL;DR: We introduce lookahead decoding, a new, exact, and parallel decoding algorithm to accelerate LLM inference. Look...| lmsys.org
When a dangerous model is deployed, it will pose misalignment and misuse risks. Even before dangerous models exist, deploying models on dangerous paths can accelerate and diffuse progress toward dangerous models.| ailabwatch.org
Fermyon Serverless AI gives you the building blocks for integrating Artificial Intelligence into your serverless applications, with AI inferencing for Large Language Models (LLMs) for Llama2 and CodeLlama, support for generating sentence embeddings and storing them in a vector-ready database, built-in key-value storage, and a seamless developer experience.| Fermyon • Experience the next wave of cloud computing.
Today, we are excited to introduce DBRX, an open, general-purpose LLM created by Databricks. Across a range of standard benchmarks, DBRX sets a new state-of-the-art for established open LLMs. Moreover, it provides the open community and enterprises building their own LLMs with capabilities that were previously limited to closed model APIs; according to our measurements, it surpasses GPT-3.5, and it is competitive with Gemini 1.0 Pro. It is an especially capable code model, surpassing speciali...| Databricks
Marking a major investment in Meta’s AI future, we are announcing two 24k GPU clusters. We are sharing details on the hardware, network, storage, design, performance, and software that help us extr…| Engineering at Meta
Recently Meta unveiled CodeLLama, a 70-billion-parameter open-source large language model (LLM).| Llama LLM
Listen now | The PyTorch creator riffs on geohot's Tinygrad, Chris Lattner's Mojo, Apple's MLX, the PyTorch Mafia, the upcoming Llama 3 and MTIA ASIC, AI robotics, and what it takes for open source AI to win!| www.latent.space
We're releasing Code Llama, an AI model built on top of Llama 2 that's fine-tuned for generating and discussing code.| Meta
Fermyon Serverless AI gives you the building blocks for integrating Artificial Intelligence into your serverless applications, with AI inferencing for Large Language Models (LLMs) for Llama2 and CodeLlama, support for generating sentence embeddings and storing them in a vector-ready database, built-in key-value storage, and a seamless developer experience.| radu-matei.com