admin管理员组

文章数量:1387304

I want to validate the schema of a .parquet file against a defined AVRO schema file. I am able to fetch the schema of .parquet file using pyarrow.parquet.ParquetFile but I am not sure what's the best way to validate this schema against the defined schema in .avsc file.

from pyarrow.parquet import ParquetFile ParquetFile(source).schema_arrow

I am able to fetch the schema of .parquet file using pyarrow.parquet.ParquetFile but I am not sure what's the best way to validate this schema against the defined schema in .avsc file.

本文标签: How to validate the schema of a parquet against AVRO (avsc) schema using PythonStack Overflow