Extracting everything after first two words in R

jzurks Published at Dev

jzurks

I am trying to extract all the info, using a regular expression in R, after the first number and first word of an entry in a data frame.

For example:

Header = 
c("2006 Volvo XC70", 
"2012 Ford Econoline Cargo Van E-250 Commercial", 
"2012 Nissan Frontier", 
"2012 Kia Soul 5dr Wagon Automatic")

I want to write a pattern that will grab Volvo XC70, or Econoline Cargo Van E-250 Commercial (everything after the year and make) from an entry in my "header" column so that I may run the function on my data frame and create a new "model" column. I can't figure out a pattern that will allow me to skip the first string of integers, then a space, then the first string of characters, and then a space, and then grab everything proceeding.

Any help would be appreciated. Thanks!

Avinash Raj

Just use sub.

sub("^\\d+\\s+\\w+\\s+", "", df$x)

Example:

x <- "2012 Ford Econoline Cargo Van E-250 Commercial"
sub("^\\d+\\s+\\w+\\s+", "", x)
# [1] "Econoline Cargo Van E-250 Commercial"

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2021-02-23

Comments

0 comments

From Dev

Related Related

Article

Extracting everything after first two words in R

Extracting everything after first two words in R

Regular expression to match everything after two words

How to replace words after first two words

Extracting first two words of a string in Javascript using regex

First two letters from words Regex in R

SQL Select between two spaces or everything after first if only one

Ignore first two special characters, and grab everything after that

VBA regex everything after words

VBA regex everything after words

Extracting everything between two symbols in a string

Extracting everything between two symbols in a string

Extracting first names in R

Get everything after first character

Extracting a paragraph between two key words in Perl

Match everything between two words in Powershell

Using sed to delete everything between two words?

Match everything between two words in Powershell

REGEX in R: extracting words from a string

Finding and extracting words that include a punctuation expressions in R

R gsub everything after blank

Find everything after first comma in lines and remove it

regex - delete everything after the first letter

Get everything after first occurence of substring

Grep regexp (linux) for extracting two words and storing them in variables

How to remove everything after two different punctuations

Catch everything between two same words multiple times

Extracting equal rows of two data frames (in R)

Extracting a string between other two strings in R

R:Extracting words from one column into different columns

Styling Two First Words With Spannable String