Haihao Shen

Haihao

AI & ML interests

LLM quantization, sparsity, and acceleration

Articles

Organizations

Haihao's activity

upvoted an article about 1 month ago
view article
Article

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

• 11
upvoted an article 4 months ago
view article
Article

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

• 4