Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s (cerebras.ai)
147 points by campers on Oct 25, 2024 | past | 84 comments
Cerebras Inference now runs Llama 3.1-70B at 2100 tokens/s (cerebras.ai)
6 points by cs-fan-101 on Oct 24, 2024 | past
Simulating Human Behavior with Cerebras (cerebras.ai)
2 points by akvadrako on Oct 17, 2024 | past
Cerebras' third-generation wafer-scale engine (WSE-3) (cerebras.ai)
2 points by doener on Aug 29, 2024 | past
Llama 8B at 1800 tokens per second on Cerebras (cerebras.ai)
2 points by huevosabio on Aug 28, 2024 | past
Cerebras Inference: AI at Instant Speed (cerebras.ai)
174 points by meetpateltech on Aug 27, 2024 | past | 72 comments
Cerebras Launches the Fastest AI Inference (cerebras.ai)
13 points by cs-fan-101 on Aug 27, 2024 | past | 1 comment

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: