Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
GEM benchmark
https://gem-benchmark.com
Activity Feed
Request to join this org
Follow
121
AI & ML interests
We develop infrastructure for the evaluation of generated text.
Recent Activity
fladhak
authored
a paper
3 days ago
SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL
yjernite
authored
a paper
4 months ago
The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources
yjernite
authored
a paper
4 months ago
In-House Evaluation Is Not Enough: Towards Robust Third-Party Flaw Disclosure for General-Purpose AI
View all activity
Team members
96
+62
+49
+28
+18
GEM
's datasets
44
Sort: Recently updated
GEM/xlsum
Updated
Oct 3, 2024
•
1.81k
•
5
GEM/wiki_auto_asset_turk
Viewer
•
Updated
May 29, 2024
•
510k
•
1.01k
•
8
GEM/gem
Updated
Jan 18, 2024
•
28.8k
•
33
GEM/opusparcus
Updated
Jan 9, 2024
•
1.26k
•
2
GEM/Augmented_CACAPO_for_E2E
Viewer
•
Updated
Feb 26, 2023
•
47.3k
•
93
GEM/CACAPO_E2E
Viewer
•
Updated
Feb 26, 2023
•
20.1k
•
75
GEM/Elongated_CACAPO_for_E2E
Updated
Feb 26, 2023
•
139
GEM/xwikis
Updated
Feb 22, 2023
•
1.68k
•
4
GEM/wiki_lingua
Updated
Feb 16, 2023
•
7.41k
•
50
GEM/xmediasum
Viewer
•
Updated
Feb 15, 2023
•
40k
•
33
•
4
GEM/TaTA
Viewer
•
Updated
Nov 3, 2022
•
8.69k
•
62
•
1
GEM/FairytaleQA
Viewer
•
Updated
Oct 25, 2022
•
10.6k
•
762
•
9
GEM/squality
Updated
Oct 25, 2022
•
89
•
2
GEM/xsum
Updated
Oct 24, 2022
•
318
•
2
GEM/wiki_cat_sum
Viewer
•
Updated
Oct 24, 2022
•
187k
•
2.96k
•
4
GEM/web_nlg
Viewer
•
Updated
Oct 24, 2022
•
58.9k
•
1.09k
•
2
GEM/viggo
Viewer
•
Updated
Oct 24, 2022
•
8.84k
•
948
•
35
GEM/turku_hockey_data2text
Updated
Oct 24, 2022
•
182
GEM/totto
Viewer
•
Updated
Oct 24, 2022
•
138k
•
257
•
2
GEM/surface_realisation_st_2020
Viewer
•
Updated
Oct 24, 2022
•
351k
•
1.63k
•
1
GEM/squad_v2
Viewer
•
Updated
Oct 24, 2022
•
142k
•
446
•
3
GEM/sportsett_basketball
Viewer
•
Updated
Oct 24, 2022
•
6.15k
•
636
•
13
GEM/schema_guided_dialog
Viewer
•
Updated
Oct 24, 2022
•
188k
•
512
•
8
GEM/mlsum
Viewer
•
Updated
Oct 24, 2022
•
535k
•
146
•
2
GEM/mlb_data_to_text
Viewer
•
Updated
Oct 24, 2022
•
26.2k
•
459
•
3
GEM/e2e_nlg
Viewer
•
Updated
Oct 24, 2022
•
38.4k
•
1.05k
•
1
GEM/dstc10_track2_task2
Updated
Oct 24, 2022
•
87
•
4
GEM/dart
Viewer
•
Updated
Oct 24, 2022
•
70.5k
•
492
GEM/cs_restaurants
Viewer
•
Updated
Oct 24, 2022
•
6.69k
•
274
•
1
GEM/conversational_weather
Updated
Oct 24, 2022
•
519
•
5
Previous
1
2
Next