Diff
checker
文本
文本
图像
文档
Excel
文件夹
Legal
Enterprise
桌面版
定价
登录
下载 Diffchecker 桌面版
比较文本
查找两个文本文件之间的差异
工具
历史
实时编辑器
隐藏空白更改
折叠未更改行
关闭换行
视图
拆分
统一
比对精度
智能
单词
字符
文本样式
更改外观
语法高亮
选择语法
忽略
文本转换
转到第一个差异
编辑输入
Diffchecker Desktop
运行Diffchecker最安全的方式。获取Diffchecker桌面应用:您的差异永远不会离开您的电脑!
获取桌面版
comparing_parquet_files
创建于
去年
差异永不过期
清除
导出
分享
解释
18 删除
行
总计
删除
字符
总计
删除
要继续使用此功能,请升级到
Diff
checker
Pro
查看价格
155 行
全部复制
19 添加
行
总计
添加
字符
总计
添加
要继续使用此功能,请升级到
Diff
checker
Pro
查看价格
156 行
全部复制
复制
已复制
复制
已复制
nic@xps-15:~/arrow$ parquet-tools inspect
../Downloads/papers
.parquet
nic@xps-15:~/arrow$ parquet-tools inspect
"/tmp/RtmpfoyxmB/file18fa6b312836/part-0
.parquet
"
############ file meta data ############
############ file meta data ############
复制
已复制
复制
已复制
created_by: parquet-
go
version
18
.0.0-SNAPSHOT
created_by: parquet-
cpp-arrow
version
20
.0.0-SNAPSHOT
num_columns: 13
num_columns: 13
num_rows: 64141
num_rows: 64141
num_row_groups: 1
num_row_groups: 1
format_version: 2.6
format_version: 2.6
复制
已复制
复制
已复制
serialized_size:
1819
serialized_size:
3124
############ Columns ############
############ Columns ############
paper_id
paper_id
softcite_id
softcite_id
title
title
published_year
published_year
published_date
published_date
publication_venue
publication_venue
publisher_name
publisher_name
doi
doi
pmcid
pmcid
pmid
pmid
genre
genre
license_type
license_type
has_mentions
has_mentions
############ Column(paper_id) ############
############ Column(paper_id) ############
name: paper_id
name: paper_id
path: paper_id
path: paper_id
复制
已复制
复制
已复制
max_definition_level:
0
max_definition_level:
1
max_repetition_level: 0
max_repetition_level: 0
physical_type: INT32
physical_type: INT32
logical_type: Int(bitWidth=32, isSigned=false)
logical_type: Int(bitWidth=32, isSigned=false)
converted_type (legacy): UINT_32
converted_type (legacy): UINT_32
复制
已复制
复制
已复制
compression: GZIP (space_saved:
13
%)
compression: GZIP (space_saved:
22
%)
############ Column(softcite_id) ############
############ Column(softcite_id) ############
name: softcite_id
name: softcite_id
path: softcite_id
path: softcite_id
复制
已复制
复制
已复制
max_definition_level:
0
max_definition_level:
1
max_repetition_level: 0
max_repetition_level: 0
physical_type: BYTE_ARRAY
physical_type: BYTE_ARRAY
logical_type: String
logical_type: String
converted_type (legacy): UTF8
converted_type (legacy): UTF8
复制
已复制
复制
已复制
compression: GZIP (space_saved:
50
%)
compression: GZIP (space_saved:
47
%)
############ Column(title) ############
############ Column(title) ############
name: title
name: title
path: title
path: title
max_definition_level: 1
max_definition_level: 1
max_repetition_level: 0
max_repetition_level: 0
physical_type: BYTE_ARRAY
physical_type: BYTE_ARRAY
logical_type: String
logical_type: String
converted_type (legacy): UTF8
converted_type (legacy): UTF8
复制
已复制
复制
已复制
compression: GZIP (space_saved:
56
%)
compression: GZIP (space_saved:
55
%)
############ Column(published_year) ############
############ Column(published_year) ############
name: published_year
name: published_year
path: published_year
path: published_year
max_definition_level: 1
max_definition_level: 1
max_repetition_level: 0
max_repetition_level: 0
physical_type: INT32
physical_type: INT32
logical_type: Int(bitWidth=16, isSigned=false)
logical_type: Int(bitWidth=16, isSigned=false)
converted_type (legacy): UINT_16
converted_type (legacy): UINT_16
compression: GZIP (space_saved: 18%)
compression: GZIP (space_saved: 18%)
############ Column(published_date) ############
############ Column(published_date) ############
name: published_date
name: published_date
path: published_date
path: published_date
max_definition_level: 1
max_definition_level: 1
max_repetition_level: 0
max_repetition_level: 0
physical_type: INT32
physical_type: INT32
logical_type: Date
logical_type: Date
converted_type (legacy): DATE
converted_type (legacy): DATE
compression: GZIP (space_saved: 10%)
compression: GZIP (space_saved: 10%)
############ Column(publication_venue) ############
############ Column(publication_venue) ############
name: publication_venue
name: publication_venue
path: publication_venue
path: publication_venue
复制
已复制
复制
已复制
max_definition_level:
0
max_definition_level:
1
max_repetition_level: 0
max_repetition_level: 0
physical_type: BYTE_ARRAY
physical_type: BYTE_ARRAY
logical_type: String
logical_type: String
converted_type (legacy): UTF8
converted_type (legacy): UTF8
compression: GZIP (space_saved: 59%)
compression: GZIP (space_saved: 59%)
############ Column(publisher_name) ############
############ Column(publisher_name) ############
name: publisher_name
name: publisher_name
path: publisher_name
path: publisher_name
max_definition_level: 1
max_definition_level: 1
max_repetition_level: 0
max_repetition_level: 0
physical_type: BYTE_ARRAY
physical_type: BYTE_ARRAY
logical_type: String
logical_type: String
converted_type (legacy): UTF8
converted_type (legacy): UTF8
复制
已复制
复制
已复制
compression: GZIP (space_saved:
49
%)
compression: GZIP (space_saved:
48
%)
############ Column(doi) ############
############ Column(doi) ############
name: doi
name: doi
path: doi
path: doi
复制
已复制
复制
已复制
max_definition_level:
0
max_definition_level:
1
max_repetition_level: 0
max_repetition_level: 0
physical_type: BYTE_ARRAY
physical_type: BYTE_ARRAY
logical_type: String
logical_type: String
converted_type (legacy): UTF8
converted_type (legacy): UTF8
复制
已复制
复制
已复制
compression: GZIP (space_saved:
61
%)
compression: GZIP (space_saved:
59
%)
############ Column(pmcid) ############
############ Column(pmcid) ############
name: pmcid
name: pmcid
path: pmcid
path: pmcid
max_definition_level: 1
max_definition_level: 1
max_repetition_level: 0
max_repetition_level: 0
physical_type: BYTE_ARRAY
physical_type: BYTE_ARRAY
logical_type: String
logical_type: String
converted_type (legacy): UTF8
converted_type (legacy): UTF8
compression: GZIP (space_saved: 63%)
compression: GZIP (space_saved: 63%)
############ Column(pmid) ############
############ Column(pmid) ############
name: pmid
name: pmid
path: pmid
path: pmid
max_definition_level: 1
max_definition_level: 1
max_repetition_level: 0
max_repetition_level: 0
physical_type: BYTE_ARRAY
physical_type: BYTE_ARRAY
logical_type: String
logical_type: String
converted_type (legacy): UTF8
converted_type (legacy): UTF8
复制
已复制
复制
已复制
compression: GZIP (space_saved:
57
%)
compression: GZIP (space_saved:
56
%)
############ Column(genre) ############
############ Column(genre) ############
name: genre
name: genre
path: genre
path: genre
max_definition_level: 1
max_definition_level: 1
max_repetition_level: 0
max_repetition_level: 0
physical_type: BYTE_ARRAY
physical_type: BYTE_ARRAY
logical_type: String
logical_type: String
converted_type (legacy): UTF8
converted_type (legacy): UTF8
复制
已复制
复制
已复制
compression: GZIP (space_saved:
60
%)
compression: GZIP (space_saved:
56
%)
############ Column(license_type) ############
############ Column(license_type) ############
name: license_type
name: license_type
path: license_type
path: license_type
max_definition_level: 1
max_definition_level: 1
max_repetition_level: 0
max_repetition_level: 0
physical_type: BYTE_ARRAY
physical_type: BYTE_ARRAY
logical_type: String
logical_type: String
converted_type (legacy): UTF8
converted_type (legacy): UTF8
复制
已复制
复制
已复制
compression: GZIP (space_saved:
49
%)
compression: GZIP (space_saved:
45
%)
############ Column(has_mentions) ############
############ Column(has_mentions) ############
name: has_mentions
name: has_mentions
path: has_mentions
path: has_mentions
复制
已复制
复制
已复制
max_definition_level:
0
max_definition_level:
1
max_repetition_level: 0
max_repetition_level: 0
physical_type: BOOLEAN
physical_type: BOOLEAN
logical_type: None
logical_type: None
converted_type (legacy): NONE
converted_type (legacy): NONE
compression: GZIP (space_saved: 99%)
compression: GZIP (space_saved: 99%)
复制
已复制
复制
已复制
已保存差异
原始文本
打开文件
nic@xps-15:~/arrow$ parquet-tools inspect ../Downloads/papers.parquet ############ file meta data ############ created_by: parquet-go version 18.0.0-SNAPSHOT num_columns: 13 num_rows: 64141 num_row_groups: 1 format_version: 2.6 serialized_size: 1819 ############ Columns ############ paper_id softcite_id title published_year published_date publication_venue publisher_name doi pmcid pmid genre license_type has_mentions ############ Column(paper_id) ############ name: paper_id path: paper_id max_definition_level: 0 max_repetition_level: 0 physical_type: INT32 logical_type: Int(bitWidth=32, isSigned=false) converted_type (legacy): UINT_32 compression: GZIP (space_saved: 13%) ############ Column(softcite_id) ############ name: softcite_id path: softcite_id max_definition_level: 0 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 50%) ############ Column(title) ############ name: title path: title max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 56%) ############ Column(published_year) ############ name: published_year path: published_year max_definition_level: 1 max_repetition_level: 0 physical_type: INT32 logical_type: Int(bitWidth=16, isSigned=false) converted_type (legacy): UINT_16 compression: GZIP (space_saved: 18%) ############ Column(published_date) ############ name: published_date path: published_date max_definition_level: 1 max_repetition_level: 0 physical_type: INT32 logical_type: Date converted_type (legacy): DATE compression: GZIP (space_saved: 10%) ############ Column(publication_venue) ############ name: publication_venue path: publication_venue max_definition_level: 0 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 59%) ############ Column(publisher_name) ############ name: publisher_name path: publisher_name max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 49%) ############ Column(doi) ############ name: doi path: doi max_definition_level: 0 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 61%) ############ Column(pmcid) ############ name: pmcid path: pmcid max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 63%) ############ Column(pmid) ############ name: pmid path: pmid max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 57%) ############ Column(genre) ############ name: genre path: genre max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 60%) ############ Column(license_type) ############ name: license_type path: license_type max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 49%) ############ Column(has_mentions) ############ name: has_mentions path: has_mentions max_definition_level: 0 max_repetition_level: 0 physical_type: BOOLEAN logical_type: None converted_type (legacy): NONE compression: GZIP (space_saved: 99%)
更改后文本
打开文件
nic@xps-15:~/arrow$ parquet-tools inspect "/tmp/RtmpfoyxmB/file18fa6b312836/part-0.parquet" ############ file meta data ############ created_by: parquet-cpp-arrow version 20.0.0-SNAPSHOT num_columns: 13 num_rows: 64141 num_row_groups: 1 format_version: 2.6 serialized_size: 3124 ############ Columns ############ paper_id softcite_id title published_year published_date publication_venue publisher_name doi pmcid pmid genre license_type has_mentions ############ Column(paper_id) ############ name: paper_id path: paper_id max_definition_level: 1 max_repetition_level: 0 physical_type: INT32 logical_type: Int(bitWidth=32, isSigned=false) converted_type (legacy): UINT_32 compression: GZIP (space_saved: 22%) ############ Column(softcite_id) ############ name: softcite_id path: softcite_id max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 47%) ############ Column(title) ############ name: title path: title max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 55%) ############ Column(published_year) ############ name: published_year path: published_year max_definition_level: 1 max_repetition_level: 0 physical_type: INT32 logical_type: Int(bitWidth=16, isSigned=false) converted_type (legacy): UINT_16 compression: GZIP (space_saved: 18%) ############ Column(published_date) ############ name: published_date path: published_date max_definition_level: 1 max_repetition_level: 0 physical_type: INT32 logical_type: Date converted_type (legacy): DATE compression: GZIP (space_saved: 10%) ############ Column(publication_venue) ############ name: publication_venue path: publication_venue max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 59%) ############ Column(publisher_name) ############ name: publisher_name path: publisher_name max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 48%) ############ Column(doi) ############ name: doi path: doi max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 59%) ############ Column(pmcid) ############ name: pmcid path: pmcid max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 63%) ############ Column(pmid) ############ name: pmid path: pmid max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 56%) ############ Column(genre) ############ name: genre path: genre max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 56%) ############ Column(license_type) ############ name: license_type path: license_type max_definition_level: 1 max_repetition_level: 0 physical_type: BYTE_ARRAY logical_type: String converted_type (legacy): UTF8 compression: GZIP (space_saved: 45%) ############ Column(has_mentions) ############ name: has_mentions path: has_mentions max_definition_level: 1 max_repetition_level: 0 physical_type: BOOLEAN logical_type: None converted_type (legacy): NONE compression: GZIP (space_saved: 99%)
查找差异