在将字符变量转换为整数时，有一条消息说：强制引入的 NA。如何避免此错误？-解网

问：

我尝试使用函数将字符变量转换为整数变量。但是，在执行代码时，输出将返回值为。代码如下：as.integerNA

library(tidyverse)
coal_data <- read.csv("http://594442.youcanlearnit.net/coal.csv", skip = 2)
coal_data %>% glimpse()
colnames(coal_data)[1] <- "region"
coal_long <- gather(coal_data, 'year', 'coal_consumption', -region)
coal_long %>% glimpse()
coal_long %>% separate(year, into = c("x", "year"), sep = "X")%>%
    select(-x)%>% glimpse()
class(coal_long$year)
coal_long$year <- as.integer(coal_long$year)

输出如下

coal_long %>% glimpse()



 Rows: 6,960
    Columns: 3
    $ region           <fct> "North America", "Bermuda", "Canada", "Greenland", "Mexico",...
    $ year             <int> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, ...
    $ coal_consumption <chr> "16.45179", "0", "0.96156", "0.00005", "0.10239", "0", "15.3...

预期的实际产出以整数形式获得这一年。非常感谢您提前调查此事。

r dplyr 类型转换整数 na

coal_long <- coal_long %>% 
  separate(year, into = c("x", "year"), sep = "X") %>% 
  select(-x) %>% 
  glimpse()

coal_long$year <- as.integer(coal_long$year)

coal_long %>% glimpse()

Rows: 6,960
Columns: 3
$ region           <fct> "North America", "Bermuda", "Canada", "Greenland", "Mexico", "Saint Pierre and Miquelon", "United States", "Cent…
$ year             <int> 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980…
$ coal_consumption <chr> "16.45179", "0", "0.96156", "0.00005", "0.10239", "0", "15.38779", "0.42011", "0", "0", "0.03476", "--", "0", "0…

3赞 Chuck P 5/15/2020 #3

不妨在你做的时候coal_consumption加倍......

library(tidyverse)

coal_data <- read.csv("http://594442.youcanlearnit.net/coal.csv", skip = 2, na.strings = "--")

colnames(coal_data)[1] <- "region"
coal_long <- gather(coal_data, 'year', 'coal_consumption', -region)
coal_long %>% glimpse()
#> Rows: 6,960
#> Columns: 3
#> $ region           <chr> "North America", "Bermuda", "Canada", "Greenland", "…
#> $ year             <chr> "X1980", "X1980", "X1980", "X1980", "X1980", "X1980"…
#> $ coal_consumption <dbl> 16.45179, 0.00000, 0.96156, 0.00005, 0.10239, 0.0000…
coal_long <- coal_long %>% separate(year, into = c("x", "year"), sep = "X") %>%
  select(-x) %>% glimpse()
#> Rows: 6,960
#> Columns: 3
#> $ region           <chr> "North America", "Bermuda", "Canada", "Greenland", "…
#> $ year             <chr> "1980", "1980", "1980", "1980", "1980", "1980", "198…
#> $ coal_consumption <dbl> 16.45179, 0.00000, 0.96156, 0.00005, 0.10239, 0.0000…
class(coal_long$year)
#> [1] "character"
coal_long$year <- as.integer(str_remove(coal_long$year, "X"))
glimpse(coal_long)
#> Rows: 6,960
#> Columns: 3
#> $ region           <chr> "North America", "Bermuda", "Canada", "Greenland", "…
#> $ year             <int> 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980, 1980…
#> $ coal_consumption <dbl> 16.45179, 0.00000, 0.96156, 0.00005, 0.10239, 0.0000…

在将字符变量转换为整数时，有一条消息说：强制引入的 NA。如何避免此错误？

While converting character variable into integer, there is a message saying : NAs introduced by coercion. How to avoid this error?

评论

评论

评论