Skip to content

Updated load_dataset to be resistant to bad columns #1511

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

SilasMarvin
Copy link
Contributor

This PR alters the load_dataset function to not crash when values in some columns of the dataset do not deserialize correctly.

I found that when loading in the: hotpotqa/hotpot_qa dataset, some of the answer columns were null. This was crashing the function. Now we emit a warning and skip that row.

@SilasMarvin SilasMarvin requested a review from montanalow June 6, 2024 22:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy