Question

Я пытаюсь сохранить некоторую информацию в последовательности объектов JSON, каждый в новой строке - это прекрасно работает:

#Create a string with two JSONs in two 'lines':
example = pd.DataFrame(0, index=['first', 'second'], columns=['third', 'fourth'])
file_string = ''
for i in range(2):
    file_string += example.to_json(orient='table')+'\n'
print(file_string)

Output: {"schema": {"fields":[{"name":"index","type":"string"},{"name":"third","type":"integer"},{"name":"fourth","type":"integer"}],"primaryKey":["index"],"pandas_version":"0.20.0"}, "data": [{"index":"first","third":0,"fourth":0},{"index":"second","third":0,"fourth":0}]}
{"schema": {"fields":[{"name":"index","type":"string"},{"name":"third","type":"integer"},{"name":"fourth","type":"integer"}],"primaryKey":["index"],"pandas_version":"0.20.0"}, "data": [{"index":"first","third":0,"fourth":0},{"index":"second","third":0,"fourth":0}]}

К сожалению, все перестает работать, когда я пытаюсь прочитать данные обратно изтакая строка с использованием панд read_json с 'lines = True'.Хотя я обычно могу прочитать его обратно:

#Read it the usual way works - but format is incorrect:
print(pd.read_json(file_string, lines=True))
Output:                                                     data                                             schema
0  [{'index': 'first', 'third': 0, 'fourth': 0}, ...  {'fields': [{'name': 'index', 'type': 'string'...
1  [{'index': 'first', 'third': 0, 'fourth': 0}, ...  {'fields': [{'name': 'index', 'type': 'string'...

Я не могу прочитать его обратно как исходный кадр данных, используя orient = 'table':

#Read it taking into account the orient='table' fails:
reading = pd.read_json(file_string, lines=True, orient='table')
Traceback (most recent call last):

  File "<ipython-input-104-f05542cc1431>", line 1, in <module>
    reading = pd.read_json(file_string, lines=True, orient='table')

  File "C:\ProgramData\Anaconda3\lib\site-packages\pandas\io\json\json.py", line 422, in read_json
    result = json_reader.read()

  File "C:\ProgramData\Anaconda3\lib\site-packages\pandas\io\json\json.py", line 526, in read
    self._combine_lines(data.split('\n'))

  File "C:\ProgramData\Anaconda3\lib\site-packages\pandas\io\json\json.py", line 546, in _get_object_parser
    obj = FrameParser(json, **kwargs).parse()

  File "C:\ProgramData\Anaconda3\lib\site-packages\pandas\io\json\json.py", line 638, in parse
    self._parse_no_numpy()

  File "C:\ProgramData\Anaconda3\lib\site-packages\pandas\io\json\json.py", line 864, in _parse_no_numpy
    precise_float=self.precise_float)

  File "C:\ProgramData\Anaconda3\lib\site-packages\pandas\io\json\table_schema.py", line 298, in parse_table_schema
    col_order = [field['name'] for field in table['schema']['fields']]

TypeError: list indices must be integers or slices, not str

Я что-то не так делаю?У меня есть версия, которая работает - чтение каждой строки и передача единственной строки в json reader в то время, но это очень медленно.Я надеюсь, что строки = True версия будет намного быстрее.

pandas read_json не может распознать параметр orient, когда lines = True

Пожалуйста, войдите или зарегистрируйтесь чтобы ответить на этот вопрос.

Ответы [ 0 ]

pandas read_json не может распознать параметр orient, когда lines = True

Пожалуйста, войдите или зарегистрируйтесь чтобы ответить на этот вопрос.

Ответы [ 0 ]

Нет похожих вопросов