-
Notifications
You must be signed in to change notification settings - Fork 323
Closed
Labels
api: bigqueryIssues related to the googleapis/python-bigquery API.Issues related to the googleapis/python-bigquery API.type: questionRequest for information or clarification. Not an issue.Request for information or clarification. Not an issue.
Description
I'm currently attempting to upload a pandas DF containing several columns which either contain a value, or a list of json objects, which result in a nested RECORD field on the bigquery side.
i.e. a column version
might contain:
[{"type": "specific_api_name_1", "value": ["0.0.1"]}, {"type": "specific_api_name_2", "value": ["0.0.1"]}]
Uploading this results in the desired format: a nested RECORD in bigquery for a specific row.
Alas, this also yields the schema:
version | RECORD | NULLABLE |
version.list | RECORD | REPEATED |
version.list.item | RECORD | NULLABLE |
version.list.item.type | STRING | NULLABLE |
version.list.item.value | RECORD | NULLABLE |
version.list.item.value.list | RECORD | REPEATED |
version.list.item.value.list.item | STRING | NULLABLE |
I would expect (or want) this to generate the schema:
version | RECORD | NULLABLE |
version.type | STRING | NULLABLE |
version.value | STRING | REPEATED |
How could this be achieved?
Metadata
Metadata
Assignees
Labels
api: bigqueryIssues related to the googleapis/python-bigquery API.Issues related to the googleapis/python-bigquery API.type: questionRequest for information or clarification. Not an issue.Request for information or clarification. Not an issue.