Python Pandas - 删除重复值的返回索引保留最后一次出现
要返回删除重复值并保留最后一次出现的索引,请使用该方法。使用值为last的keep参数。index.drop_duplicates()
首先,导入所需的库 -
import pandas as pd
创建具有一些重复项的索引 -
index = pd.Index(['Car','Bike','Airplane','Ship','Airplane'])
显示索引 -
print("Pandas Index with duplicates...\n",index)
删除重复值的返回索引。值为“last”的“keep”参数保留每组重复条目的最后一次出现 -
print("\nIndex with duplicate values removed (keeping the last occurrence)...\n",index.drop_duplicates(keep='last'))
示例
以下是代码 -
import pandas as pd输出结果# 创建具有一些重复项的索引
index = pd.Index(['Car','Bike','Airplane','Ship','Airplane'])
# 显示索引
print("Pandas Index with duplicates...\n",index)
# 返回数据的 dtype
print("\nThe dtype object...\n",index.dtype)
# 获取数据中的字节
print("\nGet the bytes...\n",index.nbytes)
# 获取数据的维度
print("\nGet the dimensions...\n",index.ndim)
# 删除重复值的返回索引
# The "keep" 带值的参数 "last" keeps the last occurrence for each set of duplicated entries
print("\nIndex with duplicate values removed (keeping the last occurrence)...\n",index.drop_duplicates(keep='last'))
这将产生以下代码 -
Pandas Index with duplicates...Index(['Car', 'Bike', 'Airplane', 'Ship', 'Airplane'], dtype='object')
The dtype object...
object
Get the bytes...
40
Get the dimensions...
1
Index with duplicate values removed (keeping the last occurrence)...
Index(['Car', 'Bike', 'Ship', 'Airplane'], dtype='object')
以上是 Python Pandas - 删除重复值的返回索引保留最后一次出现 的全部内容, 来源链接: utcz.com/z/350387.html