Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
rlhf
Inference Endpoints
AutoTrain Compatible
text-generation-inference
Merge
Eval Results
4-bit precision
8-bit precision
custom_code
Misc with no match
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
289
Full-text search
Edit filters
Sort: Trending
Active filters:
rlhf
Clear all
PKU-Alignment/beaver-7b-v1.0-reward
Reinforcement Learning
•
Updated
Apr 20
•
223
•
16
PKU-Alignment/beaver-7b-v1.0-cost
Reinforcement Learning
•
Updated
Apr 20
•
40
•
8
Ablustrund/moss-rlhf-reward-model-7B-zh
Updated
Jul 13, 2023
•
7
•
23
fnlp/moss-rlhf-reward-model-7B-en
Updated
Jul 13, 2023
•
8
fnlp/moss-rlhf-sft-model-7B-en
Updated
Jul 14, 2023
•
2
fnlp/moss-rlhf-policy-model-7B-en
Updated
Jul 17, 2023
•
1
lightonai/alfred-40b-0723
Text Generation
•
Updated
Aug 11, 2023
•
25
•
45
kashif/stack-llama-2
Text Generation
•
Updated
Aug 8, 2023
•
1.16k
•
15
barnybug/stack-llama-2-ggml
Updated
Aug 10, 2023
•
4
vwxyzjn/starcoderbase-triviaqa
Text Generation
•
Updated
Aug 29, 2023
•
6
lvwerra/starcoderbase-gsm8k
Text Generation
•
Updated
Aug 30, 2023
•
22
ContextualAI/archangel_sft_pythia1-4b
Text Generation
•
Updated
Jan 11
•
38
ContextualAI/archangel_sft_pythia2-8b
Text Generation
•
Updated
Jan 11
•
86
•
1
ContextualAI/archangel_sft_pythia6-9b
Text Generation
•
Updated
Jan 11
•
95
ContextualAI/archangel_sft_pythia12-0b
Text Generation
•
Updated
Jan 11
•
21
ContextualAI/archangel_sft_llama7b
Text Generation
•
Updated
Jan 11
•
1.75k
•
1
ContextualAI/archangel_sft_llama13b
Text Generation
•
Updated
Jan 11
•
156
ContextualAI/archangel_sft_llama30b
Text Generation
•
Updated
Jan 11
•
12
ContextualAI/archangel_slic_llama30b
Text Generation
•
Updated
Jan 11
•
14
ContextualAI/archangel_slic_pythia1-4b
Text Generation
•
Updated
Jan 11
•
3
ContextualAI/archangel_slic_pythia2-8b
Text Generation
•
Updated
Jan 11
•
22
ContextualAI/archangel_slic_pythia6-9b
Text Generation
•
Updated
Jan 11
•
30
ContextualAI/archangel_slic_pythia12-0b
Text Generation
•
Updated
Jan 11
•
22
ContextualAI/archangel_slic_llama7b
Text Generation
•
Updated
Jan 11
•
20
•
1
ContextualAI/archangel_slic_llama13b
Text Generation
•
Updated
Jan 11
•
23
ContextualAI/archangel_dpo_pythia1-4b
Text Generation
•
Updated
Jan 11
•
10
ContextualAI/archangel_dpo_pythia2-8b
Text Generation
•
Updated
Jan 11
•
34
ContextualAI/archangel_dpo_pythia6-9b
Text Generation
•
Updated
Jan 11
•
22
ContextualAI/archangel_dpo_pythia12-0b
Text Generation
•
Updated
Jan 11
•
13
ContextualAI/archangel_dpo_llama7b
Text Generation
•
Updated
Jan 11
•
36
Previous
1
2
3
4
...
10
Next