melisa (Melisa Russak)

upvoted a paper 7 days ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published 10 days ago • 50

upvoted 5 papers about 1 month ago

upvoted an article about 2 months ago

Article

Using Writer Framework with Hugging Face Spaces

By

•

Aug 20

• 30

upvoted 4 papers 4 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20 • 85

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Paper • 2406.02657 • Published Jun 4 • 36

Zamba: A Compact 7B SSM Hybrid Model

Paper • 2405.16712 • Published May 26 • 20

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27 • 85

upvoted a paper 5 months ago

Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 49

upvoted a collection 5 months ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 27 items • Updated 21 days ago • 474

upvoted a paper 5 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 87

upvoted 2 papers 6 months ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11 • 43

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Paper • 2404.07839 • Published Apr 11 • 41

upvoted a paper 7 months ago

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

Paper • 2402.17553 • Published Feb 27 • 21

upvoted a paper 8 months ago

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Paper • 2401.04081 • Published Jan 8 • 70

upvoted a paper about 1 year ago

Becoming self-instruct: introducing early stopping criteria for minimal instruct tuning

Paper • 2307.03692 • Published Jul 5, 2023 • 24

Melisa Russak

AI & ML interests

Organizations

melisa's activity