the issue mentioned in this bug report: https://github.com/apache/arrow/issues/2244 causes not just bigger file sizes, but schema conflicts. use ```table = pa.Table.from_pandas(df, preserve_index=False)``` or drop the column explicitly.