Tokasaurus: An LLM Inference Engine for High-Throughput Workloads (scalingintelligence.stanford.edu)
106 points by rsehrlich 5 hours ago | 9 comments
1106 points by rsehrlich 5 hours ago | 9 comments
177 points by mrcgnc 3 hours ago | 48 comments
253 points by mikebannister 3 hours ago | 16 comments
3399 points by bdr 13 hours ago | 142 comments
457 points by ofalkaed 5 hours ago | 26 comments
524 points by us-merul 3 hours ago | 4 comments
628 points by noleary 4 hours ago | 8 comments
728 points by echollama 3 hours ago | 16 comments
858 points by rmason 7 hours ago | 3 comments
955 points by jawns 2 hours ago | 34 comments
10341 points by 256dpi 19 hours ago | 150 comments
11142 points by zdw 10 hours ago | 69 comments
127 points by wey-gu an hour ago | 2 comments
138 points by jorkingit 3 hours ago | 4 comments
145 points by BUFU 2 hours ago | 0 comments
1577 points by anteloper 8 hours ago | 43 comments
165 hours ago
1739 points by 90s_dev 9 hours ago | 13 comments
18604 points by doener a day ago | 338 comments
19176 points by mikeshi42 8 hours ago | 37 comments
20165 points by robertvc 8 hours ago | 94 comments
2121 points by rbanffy 5 hours ago | 11 comments
2242 points by aluzzardi 9 hours ago | 7 comments
23273 points by robenkleene 13 hours ago | 153 comments
2467 points by todsacerdoti 10 hours ago | 51 comments
256 points by eaglepeak 4 hours ago | 2 comments
26310 points by picture a day ago | 250 comments
27194 points by tabletcorry 13 hours ago | 178 comments
2896 points by kadrek 17 hours ago | 13 comments
2975 points by rmason 7 hours ago | 16 comments
30