Model Inference API - Search News

22h

OpenAI deploys Cerebras chips for 'near-instant' code generation in first major move beyond Nvidia

OpenAI launches GPT‑5.3‑Codex‑Spark, a Cerebras-powered, ultra-low-latency coding model that claims 15x faster generation ...

Network World

Nvidia claims 10x cost savings with open-source inference models

Nvidia noted that cost per token went from 20 cents on the older Hopper platform to 10 cents on Blackwell. Moving to Blackwell’s native low-precision NVFP4 format further reduced the cost to just 5 ...

insideHPC

AI Inference: Meta Collaborates with Cerebras on Llama API

Sunnyvale, CA — Meta has teamed with Cerebras on AI inference in Meta’s new Llama API, combining Meta’s open-source Llama models with inference technology from Cerebras. Developers building on the ...

VentureBeat

Nous Research just launched an API that gives developers access to AI models that OpenAI and Anthropic won’t build

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Nous Research, the New York-based AI ...

NVIDIA Shows Blackwell Slashing AI Inference Costs By 10X With Open Models

Achieving that 10x cost reduction is challenging, though, and it requires a huge up-front expenditure on Blackwell hardware.

Business Wire

Elasticsearch Open Inference API and Playground Now Support Amazon Bedrock

SAN FRANCISCO--(BUSINESS WIRE)--Elastic (NYSE: ESTC) announced support for Amazon Bedrock-hosted models in Elasticsearch Open Inference API and Playground. Developers now have the flexibility to ...

Nasdaq

Elasticsearch Open Inference API Extends Support for Hugging Face Models with Semantic Text

Applications using Hugging Face embeddings on Elasticsearch now benefit from native chunking “Developers are at the heart of our business, and extending more of our GenAI and search primitives to ...

SiliconANGLE

OpenRouter nabs $40M in funding for its AI inference API

OpenRouter Inc., a startup working to ease the development of artificial intelligence applications, today announced that it has secured $40 million in funding. The company raised the capital over two ...

eWeek

OpenAI’s Profitability Challenge: Can the AI Giant Make Its Business Model Sustainable?

OpenAI’s revenue is rising fast, but so are its costs. Here’s what the company’s economics reveal about the future of AI profitability.

SDxCentral

Elasticsearch Open Inference API Extends Support for Hugging Face Models with Semantic Text

Applications using Hugging Face embeddings on Elasticsearch now benefit from native chunking SAN FRANCISCO--(BUSINESS WIRE)--Elastic (NYSE: ESTC), the Search AI Company, today announced the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results