如何用$ R数据帧中的某些特定值替换$符号?
有时我们会得到非常脏的数据,这就是为什么数据分析是一项艰巨的任务。大多数数据科学家都在寻找干净的数据,但是由于数据仓库通常只关注数据可用性而不是数据质量,因此这几乎是不可能的。头部抓挠的情况之一是获得不必要的值以随机方式放置在不同位置,$符号也是该类型的值。我们可以使用lapply函数将其从R数据帧中删除。
示例
请看以下数据帧:
> x<-sample(c("A","$B","C"),20,replace=TRUE)> y<-sample(c("I","II","$II"),20,replace=TRUE)
> df1<-data.frame(x,y)
> df1
输出结果
x y1 C $II
2 C II
3 A I
4 $B $II
5 $B $II
6 A I
7 A $II
8 C I
9 $B II
10 $B II
11 C $II
12 A II
13 $B II
14 C I
15 C $II
16 C I
17 C II
18 $B I
19 $B II
20 C $II
从df1中的每个位置删除$符号:
示例
> df1<-lapply(df1,gsub,pattern='\\$',replacement='')> df1
$x
输出结果
[1] "C" "C" "A" "B" "B" "A" "A" "C" "B" "B" "C" "A" "B" "C" "C" "C" "C" "B" "B"[20] "C"
示例
$y
输出结果
[1] "II" "II" "I" "II" "II" "I" "II" "I" "II" "II" "II" "II" "II" "I" "II"[16] "I" "II" "I" "II" "II"
让我们看另一个例子:
示例
> Price<-sample(c("1$","2$","3$","4$"),20,replace=TRUE)> Group<-sample(c("$First","$Second","Third"),20,replace=TRUE)
> df2<-data.frame(Price,Group)
> df2
输出结果
Price Group1 3$ $Second
2 2$ Third
3 1$ Third
4 2$ $Second
5 2$ $First
6 4$ $First
7 2$ $First
8 3$ $First
9 2$ Third
10 4$ Third
11 3$ $First
12 3$ Third
13 3$ $Second
14 2$ $First
15 4$ Third
16 3$ $First
17 4$ Third
18 2$ $First
19 2$ $Second
20 3$ Third
从df2中的每个位置删除$符号:
示例
> df2<-lapply(df2,gsub,pattern='\\$',replacement='')> df2
输出结果
$Price[1] "3" "2" "1" "2" "2" "4" "2" "3" "2" "4" "3" "3" "3" "2" "4" "3" "4" "2" "2"
[20] "3"
$Group
[1] "Second" "Third" "Third" "Second" "First" "First" "First" "First"
[9] "Third" "Third" "First" "Third" "Second" "First" "Third" "First"
[17] "Third" "First" "Second" "Third"
以上是 如何用$ R数据帧中的某些特定值替换$符号? 的全部内容, 来源链接: utcz.com/z/326446.html