Можете ли вы сказать, где именно была эта лишняя запятая в вашем процессоре записи преобразования?Как я столкнулся с той же проблемой.Насколько я понимаю, проблема возникает из-за поля size_dimension Ниже приведены мои данные CSV:
id,project,name,depth,parentid,description,createdtime,lastupdatedtime,metadata,path,source,sourceid
75125,abcd,P200184,4,74861,"WIRELINE RUNNING / RETRIEVING TOOL, SUPP",2002-06-04 00:00:00.0,2019-04-26 00:00:00.0,"{""material_group"":""group"",""weight_unit"":""LB"",""laboratory"":""PMC"",""object_type"":""material"",""pyspark_generated_time"":""2019-06-07, 13:32:20.287657"",""size_dimension"":""3'5\""L X 3'5\""W X 1'H"",""gross_weight"":""100.000"",""net_weight"":""100.000"",""valid_from_date"":""20031219""}","[59941,64249,74859,74861,75125]",RPA_SAA.MRA,P200184
И схема avro, которую я использовал:
{
"name":"abc",
"namespace":"nifi",
"type":"record",
"fields": [
{"name":"id", "type": ["long", "null"], "default": null},
{"name":"project", "type": ["string", "null"], "default": null},
{"name":"name", "type": ["string", "null"], "default": null},
{"name":"depth", "type": ["int", "null"], "default": null},
{"name":"parentid", "type": ["long", "null"], "default": null},
{"name":"description", "type": ["string", "null"], "default": null},
{"name":"createdtime","type": ["null",{ "type":"long", "logicalType":"timestamp-millis"}], "default":null},
{"name":"lastupdatedtime","type": ["null",{ "type":"long", "logicalType":"timestamp-millis"}], "default":null},
{"name":"metadata","type": ["string", "null"], "default": null},
{"name":"path","type": ["string", "null"], "default": null},
{"name":"source", "type": ["string", "null"], "default": null},
{"name":"sourceid", "type": ["string", "null"], "default": null}
]
}