Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
from
login
Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s
(
cerebras.ai
)
147 points
by
campers
on Oct 25, 2024
|
past
|
84 comments
Cerebras Inference now runs Llama 3.1-70B at 2100 tokens/s
(
cerebras.ai
)
6 points
by
cs-fan-101
on Oct 24, 2024
|
past
Simulating Human Behavior with Cerebras
(
cerebras.ai
)
2 points
by
akvadrako
on Oct 17, 2024
|
past
Cerebras' third-generation wafer-scale engine (WSE-3)
(
cerebras.ai
)
2 points
by
doener
on Aug 29, 2024
|
past
Llama 8B at 1800 tokens per second on Cerebras
(
cerebras.ai
)
2 points
by
huevosabio
on Aug 28, 2024
|
past
Cerebras Inference: AI at Instant Speed
(
cerebras.ai
)
174 points
by
meetpateltech
on Aug 27, 2024
|
past
|
72 comments
Cerebras Launches the Fastest AI Inference
(
cerebras.ai
)
13 points
by
cs-fan-101
on Aug 27, 2024
|
past
|
1 comment
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: