Skip to content

Improve error message in case of a misformatted file (#158) #190

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jan 22, 2023

Conversation

joe-at-openai
Copy link
Collaborator

  • add more descriptive error handling regarding poorly formatted files

  • update version

  • add dot prefix to json file extentions and ensure list of allowable file types is complete

  • cleanup error messages and add comments to explain jsonl/json loading logic

  • cleanup csv/tsv reading allowing use of elif for other file extensions, add comments, and remove unnecessary re-attempt to parse as json

  • run fillna immediately upon DataFrame creation so that an additional switch is not needed

  • use only 1 try-except block to catch parsing errors + cleanup error message

  • separate the json and jsonl cases while still maintaining the same functionality, also include a message to user if jsonl appears to be json or vice versa

  • fix bug in csv path

  • use index -1 to get extension from split

  • black formatting apply

  • fix black

Co-authored-by: joe-at-openai [email protected]

* add more descriptive error handling regarding poorly formatted files

* update version

* add dot prefix to json file extentions and ensure list of allowable file types is complete

* cleanup error messages and add comments to explain jsonl/json loading logic

* cleanup csv/tsv reading allowing use of elif for other file extensions, add comments, and remove unnecessary re-attempt to parse as json

* run fillna immediately upon DataFrame creation so that an additional switch is not needed

* use only 1 try-except block to catch parsing errors + cleanup error message

* separate the json and jsonl cases while still maintaining the same functionality, also include a message to user if jsonl appears to be json or vice versa

* fix bug in csv path

* use index -1 to get extension from split

* black formatting apply

* fix black

Co-authored-by: joe-at-openai <[email protected]>
@joe-at-openai joe-at-openai requested a review from hallacy January 18, 2023 01:08
@joe-at-openai joe-at-openai merged commit 6349015 into main Jan 22, 2023
@joe-at-openai joe-at-openai deleted the joe/improve-error-message-in-ft-file-formatter branch January 22, 2023 01:53
cgayapr pushed a commit to cgayapr/openai-python that referenced this pull request Dec 14, 2024
…enai#190)

* add more descriptive error handling regarding poorly formatted files

* update version

* add dot prefix to json file extentions and ensure list of allowable file types is complete

* cleanup error messages and add comments to explain jsonl/json loading logic

* cleanup csv/tsv reading allowing use of elif for other file extensions, add comments, and remove unnecessary re-attempt to parse as json

* run fillna immediately upon DataFrame creation so that an additional switch is not needed

* use only 1 try-except block to catch parsing errors + cleanup error message

* separate the json and jsonl cases while still maintaining the same functionality, also include a message to user if jsonl appears to be json or vice versa

* fix bug in csv path

* use index -1 to get extension from split

* black formatting apply

* fix black

Co-authored-by: joe-at-openai <[email protected]>

Co-authored-by: Boris Power <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants