join - How to replace multiple column values based on another table's column where each table has a common column value -软件玩家

admin管理员组
文章数量:1123708

I have a large table (original_df) which includes 10+ columns and 1000+ rows. One of those columns (street_column) lists street intersections. Another column (coordinates_column) lists a combined latitude and longitude coordinate. The street_column row values can have many duplicates.

My issue is that the corresponding coordinates_column currently have many different values for the same street_column value.

My goal is to remove variation in the coordinates_column values to ensure each unique street_column value always correlates with the same coordinates_column value. I have created a second table (key_df) that only includes 2 columns: one for unique street_column values and the other for its unique coordinates_column values.

I would like to replace all coordinates_column values in the original_df table with all coordinates_column values in the key_df table based off of the matching street_column value.

I this is likely super basic, but I am brand new to R and don't know where to start. And I have had no luck finding an answer to this scenario. I am currently using Rstudio (posit).

original_df table

ride_id | street_column             | coordinates_column  |
001     | Ash St & 1st Ave          | -100.123,98.123         |
002     | Ash St & 1st Ave          | -100.100,98.123         |
003     | Brooke St & Rose Rd    | 90.456,91.456         | 
004     | Brooke St & Rose Rd    | 90.400,91.987         |
005     | 9th Ave & Center St     | 20.567,-100.654       |
006     | 9th Ave & Center St     | 21.123,-100.654       |
007     | 9th Ave & Center St     | 20.567,-101.100       |

key_df table

street_column        | coordinates_column  |
Ash St & 1st Ave     | -100.123,98.123       |
Brooke St & Rose Rd  | 90.456,91.456       |
9th Ave & Center St  | 20.567,-100.654     |

desired change to original_df table

ride_id | street_column        | coordinates_column  |
001     | Ash St & 1st Ave     | -100.123,98.123         |
002     | Ash St & 1st Ave     | -100.123,98.123         |
003     | Brooke St & Rose Rd  | 90.456,91.456         |
004     | Brooke St & Rose Rd  | 90.456,91.456         |
005     | 9th Ave & Center St  | 20.567,-100.654       |
006     | 9th Ave & Center St  | 20.567,-100.654       |
007     | 9th Ave & Center St  | 20.567,-100.654       |

original_df <- 
  structure(list(
    ride_id = c("1", "2", "3", "4", "5", "6", "7"), 
    street_column = c("Ash St & 1st Ave", "Ash St & 1st Ave", 
                      "Brooke St & Rose Rd", "Brooke St & Rose Rd", 
                      "9th Ave & Center St", "9th Ave & Center St", "9th Ave & Center St"), 
    coordinates_column = c("-100.123,98.123", "-100.100,98.123", "90.456,91.456", 
                           "90.400,91.987", "20.567,-100.654", "21.123,-100.654", "20.567,-101.100")), 
    row.names = c(NA, -7L), class = "data.frame")

key_df <-
  structure(list(
    street_column = c("Ash St & 1st Ave", "Brooke St & Rose Rd", "9th Ave & Center St"), 
    coordinates_column = c("-100.123,98.123", "90.456,91.456", "20.567,-100.654")), 
    row.names = c(NA, -3L), class = "data.frame")

desired_df <-
  structure(list(
    ride_id = c("1", "2", "3", "4", "5", "6", "7"), 
    street_column = c("Ash St & 1st Ave", "Ash St & 1st Ave", 
                      "Brooke St & Rose Rd", "Brooke St & Rose Rd", 
                      "9th Ave & Center St", "9th Ave & Center St", "9th Ave & Center St"), 
    coordinates_column = c("-100.123,98.123", "-100.123,98.123", "90.456,91.456", 
                           "90.456,91.456", "20.567,-100.654", "20.567,-100.654", "20.567,-100.654")), 
    row.names = c(NA, -7L), class = "data.frame")

My issue is that the corresponding coordinates_column currently have many different values for the same street_column value.

I would like to replace all coordinates_column values in the original_df table with all coordinates_column values in the key_df table based off of the matching street_column value.

I this is likely super basic, but I am brand new to R and don't know where to start. And I have had no luck finding an answer to this scenario. I am currently using Rstudio (posit).

original_df table

ride_id | street_column             | coordinates_column  |
001     | Ash St & 1st Ave          | -100.123,98.123         |
002     | Ash St & 1st Ave          | -100.100,98.123         |
003     | Brooke St & Rose Rd    | 90.456,91.456         | 
004     | Brooke St & Rose Rd    | 90.400,91.987         |
005     | 9th Ave & Center St     | 20.567,-100.654       |
006     | 9th Ave & Center St     | 21.123,-100.654       |
007     | 9th Ave & Center St     | 20.567,-101.100       |

key_df table

street_column        | coordinates_column  |
Ash St & 1st Ave     | -100.123,98.123       |
Brooke St & Rose Rd  | 90.456,91.456       |
9th Ave & Center St  | 20.567,-100.654     |

desired change to original_df table

ride_id | street_column        | coordinates_column  |
001     | Ash St & 1st Ave     | -100.123,98.123         |
002     | Ash St & 1st Ave     | -100.123,98.123         |
003     | Brooke St & Rose Rd  | 90.456,91.456         |
004     | Brooke St & Rose Rd  | 90.456,91.456         |
005     | 9th Ave & Center St  | 20.567,-100.654       |
006     | 9th Ave & Center St  | 20.567,-100.654       |
007     | 9th Ave & Center St  | 20.567,-100.654       |

original_df <- 
  structure(list(
    ride_id = c("1", "2", "3", "4", "5", "6", "7"), 
    street_column = c("Ash St & 1st Ave", "Ash St & 1st Ave", 
                      "Brooke St & Rose Rd", "Brooke St & Rose Rd", 
                      "9th Ave & Center St", "9th Ave & Center St", "9th Ave & Center St"), 
    coordinates_column = c("-100.123,98.123", "-100.100,98.123", "90.456,91.456", 
                           "90.400,91.987", "20.567,-100.654", "21.123,-100.654", "20.567,-101.100")), 
    row.names = c(NA, -7L), class = "data.frame")

key_df <-
  structure(list(
    street_column = c("Ash St & 1st Ave", "Brooke St & Rose Rd", "9th Ave & Center St"), 
    coordinates_column = c("-100.123,98.123", "90.456,91.456", "20.567,-100.654")), 
    row.names = c(NA, -3L), class = "data.frame")

desired_df <-
  structure(list(
    ride_id = c("1", "2", "3", "4", "5", "6", "7"), 
    street_column = c("Ash St & 1st Ave", "Ash St & 1st Ave", 
                      "Brooke St & Rose Rd", "Brooke St & Rose Rd", 
                      "9th Ave & Center St", "9th Ave & Center St", "9th Ave & Center St"), 
    coordinates_column = c("-100.123,98.123", "-100.123,98.123", "90.456,91.456", 
                           "90.456,91.456", "20.567,-100.654", "20.567,-100.654", "20.567,-100.654")), 
    row.names = c(NA, -7L), class = "data.frame")

Share Improve this question edited yesterday Rui Barradas 76.3k8 gold badges39 silver badges72 bronze badges asked yesterday Tyler Wahlquist 111 silver badge1 bronze badge New contributor Tyler Wahlquist is a new contributor to this site. Take care in asking for clarification, commenting, and answering. Check out our Code of Conduct.

Add a comment |

2 Answers 2

Sorted by: Reset to default 3

match() can be used to find the row number in key_df which matches by street_column for each row in original_df:

original_df$coordinates_column <- key_df$coordinates_column[
  match(original_df$street_column, key_df$street_column)
]

identical(original_df, desired_df)
#> [1] TRUE

Or, use dplyr to join original_df with key_df, then drop the original coordinates_column:

library(dplyr)

original_df <- original_df |> 
  left_join(key_df, by = join_by(street_column), suffix = c(".x", "")) |>
  select(-coordinates_column.x)

identical(original_df, desired_df)
#> [1] TRUE

Here is a way.
First get where in original_df the columns of interest "street_column" and "coordinates_column" are. Then join the original without the coordinates and the key data sets.
Then, put the columns in their original order.

If you want the rows to be in the original order too, create an index to their original order and use it to sort the result.

i <- grep("coordinates_column", names(original_df))
j <- grep("street_column", names(original_df))
cols_order <- c(j, (1:ncol(original_df))[-j])
result <- merge(original_df[-i], key_df)[cols_order]

o <- order(original_df$ride_id, original_df$street_column)
result[order(o),]
#>   ride_id       street_column coordinates_column
#> 4       1    Ash St & 1st Ave    -100.123,98.123
#> 5       2    Ash St & 1st Ave    -100.123,98.123
#> 6       3 Brooke St & Rose Rd      90.456,91.456
#> 7       4 Brooke St & Rose Rd      90.456,91.456
#> 1       5 9th Ave & Center St    20.567,-100.654
#> 2       6 9th Ave & Center St    20.567,-100.654
#> 3       7 9th Ave & Center St    20.567,-100.654

^{Created on 2025-01-10 with reprex v2.1.1}

本文标签：

版权声明：本文标题：join - How to replace multiple column values based on another table's column where each table has a common column value 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1736590554a1945069.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

join - How to replace multiple column values based on another table&#39;s column where each table has a common column value

2 Answers 2

更多相关文章

logging - Groovy invoke a shell script and redirect stdout and stderr to logfile and Jenkins console - Stack Overflow

Jupyter kernel for R + Spark - Stack Overflow

vue.js - Vue 3 and Laravel: Unable to get uploaded files from front end - Stack Overflow

javascript - I am having Issues installing node packages like bcrypt, multer, cloudinary and multer-storage-cloudinary - Stack O

java - An attempt was made to call a method that does not exist. The attempt was made from the following location - Stack Overfl

Is there a way in Clickhouse to pass dynamic parameters to UDF or Parameterized view? - Stack Overflow

my flutter project running on chrom (web javascript ) but i cant run in android emulator - Stack Overflow

shell - How to give input to two prompts in Linux - Stack Overflow

python - How do I rotate camera in Ursina engine? - Stack Overflow

customization - How to make the title of the terminal always show the current working directory - Ghostty Terminal - Stack Overf

r - I get an error when I run my model: &quot;Error in family$family : $ operator not defined for this S4 class&quot; -

How to add custom properties to Azure Application Insights using Java Spring Boot - Stack Overflow

python - Is My Time Complexity Analysis for Finding Universal Words O(m * k^2 + n*k) correct? - Stack Overflow

apollo - graphql codegen mock producing incorrect casing on enum - Stack Overflow

custom post types - Cannot save the contents of a csv file as post_content within wp_update_post

UIButton setBackgroundImage doesn&#39;t work?! Swift UIKit - Stack Overflow

wp admin - when creating menu in wordpress it shows database can not be inserted

excerpt - How to include line-breaks in the_excerpt?

wp admin - WP dashboard messed up

docker - Deploying Cloud Run job works fine, using cloudbuild.yml fails in cryptic manner - Stack Overflow

发表评论

推荐文章

categories - How to show posts only for the last subcategory?

winui 3 - WinUI3 application does not restart with administrator privileges on Windows 11 - Stack Overflow

VBA Excel function isn&#39;t filling down completely? - Stack Overflow

apache - Should servers be POSTing my wp-load.php file?

update meta field value after

热门文章

conditional menu with custom fields

Reference to the svd_econ function of RcppArmadillo - Stack Overflow

excel - Office Plugin - VB.NET - ExcelWorkbook.FullName Yet Again Returns a URL Instead of a Physical Path - Stack Overflow

javascript - Group lists using DOM manipulation - Stack Overflow

laravel - Why using jobs to send email notification I got error? - Stack Overflow

Failed to load resource: the server responded with a status of 500 ()

sockets - Android and PC use adb for communication, sleep needs to be added to execute normally - Stack Overflow

terraform - When are Azure Storage Account Poison Queues Created? - Stack Overflow

javascript - Fetch Api redirect to page - Stack Overflow

asp classic - Equivalent for textarea new lines in vbscript - Stack Overflow

最新文章

Java入门级教学（IDEA的下载与安装与JDK的环境配置）

华硕笔记本电脑用U盘重装windows系统

物理网卡MAC修改器v3.0 - 真实网卡硬件MAC地址修改，重装系统不变！

如何一键安装win7系统(一键安装win7系统步骤)

Windows 11最稳定版本详解

winui 3 - Can one use MKL with C++WinRT? - Stack Overflow

permalinks - How to get rid of index.php?

python - How to detect North Arrow on a floor plan? - Stack Overflow

docker - Deploying Cloud Run job works fine, using cloudbuild.yml fails in cryptic manner - Stack Overflow

wp admin - WP dashboard messed up

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

join - How to replace multiple column values based on another table's column where each table has a common column value

r - I get an error when I run my model: "Error in family$family : $ operator not defined for this S4 class" -

UIButton setBackgroundImage doesn't work?! Swift UIKit - Stack Overflow

VBA Excel function isn't filling down completely? - Stack Overflow