Filter out all rows with only one period in R

I have this column, Identifier with character values.

structure(list(Identifier = c("RL.K", "RL.K.1", "RL.K.2", "RL.K.3", 

"RL.K.4", "RL.K.5", "RL.K.6", "RL.K.7", "RL.K.9", "RL.K.10", 

"RI.K", "RI.K.1", "RI.K.2", "RI.K.3", "RI.K.4", "RI.K.5", "RI.K.6", 

"RI.K.7", "RI.K.9", "RI.K.10", "RF.K", "RF.K.1")), row.names = c(NA, 

-22L), class = c("tbl_df", "tbl", "data.frame"))

How do I filter out the values with only one period? so that I can take out rows 1, 11, and 21

edited Nov 20 '18 at 17:46

Wiktor Stribiżew

310k16131206

asked Nov 20 '18 at 17:43

JasonBaik

17510

add a comment |

I have this column, Identifier with character values.

structure(list(Identifier = c("RL.K", "RL.K.1", "RL.K.2", "RL.K.3", 

"RL.K.4", "RL.K.5", "RL.K.6", "RL.K.7", "RL.K.9", "RL.K.10", 

"RI.K", "RI.K.1", "RI.K.2", "RI.K.3", "RI.K.4", "RI.K.5", "RI.K.6", 

"RI.K.7", "RI.K.9", "RI.K.10", "RF.K", "RF.K.1")), row.names = c(NA, 

-22L), class = c("tbl_df", "tbl", "data.frame"))

How do I filter out the values with only one period? so that I can take out rows 1, 11, and 21

edited Nov 20 '18 at 17:46

Wiktor Stribiżew

310k16131206

asked Nov 20 '18 at 17:43

JasonBaik

17510

add a comment |

I have this column, Identifier with character values.

structure(list(Identifier = c("RL.K", "RL.K.1", "RL.K.2", "RL.K.3", 

"RL.K.4", "RL.K.5", "RL.K.6", "RL.K.7", "RL.K.9", "RL.K.10", 

"RI.K", "RI.K.1", "RI.K.2", "RI.K.3", "RI.K.4", "RI.K.5", "RI.K.6", 

"RI.K.7", "RI.K.9", "RI.K.10", "RF.K", "RF.K.1")), row.names = c(NA, 

-22L), class = c("tbl_df", "tbl", "data.frame"))

How do I filter out the values with only one period? so that I can take out rows 1, 11, and 21

edited Nov 20 '18 at 17:46

Wiktor Stribiżew

310k16131206

asked Nov 20 '18 at 17:43

JasonBaik

17510

I have this column, Identifier with character values.

structure(list(Identifier = c("RL.K", "RL.K.1", "RL.K.2", "RL.K.3", 

"RL.K.4", "RL.K.5", "RL.K.6", "RL.K.7", "RL.K.9", "RL.K.10", 

"RI.K", "RI.K.1", "RI.K.2", "RI.K.3", "RI.K.4", "RI.K.5", "RI.K.6", 

"RI.K.7", "RI.K.9", "RI.K.10", "RF.K", "RF.K.1")), row.names = c(NA, 

-22L), class = c("tbl_df", "tbl", "data.frame"))

How do I filter out the values with only one period? so that I can take out rows 1, 11, and 21

r dplyr

edited Nov 20 '18 at 17:46

Wiktor Stribiżew

310k16131206

asked Nov 20 '18 at 17:43

JasonBaik

17510

edited Nov 20 '18 at 17:46

Wiktor Stribiżew

310k16131206

asked Nov 20 '18 at 17:43

JasonBaik

17510

edited Nov 20 '18 at 17:46

Wiktor Stribiżew

310k16131206

edited Nov 20 '18 at 17:46

Wiktor Stribiżew

310k16131206

edited Nov 20 '18 at 17:46

Wiktor Stribiżew

310k16131206

asked Nov 20 '18 at 17:43

JasonBaik

17510

asked Nov 20 '18 at 17:43

JasonBaik

17510

asked Nov 20 '18 at 17:43

JasonBaik

17510

add a comment |

4 Answers
4

active

oldest

votes

We can count the number of . in the 'Identifier' and create a logical condition for filtering the rows

library(tidyverse)

df1 %>% 

   filter(str_count(Identifier, "[.]") == 1)

# A tibble: 3 x 1

#  Identifier

#  <chr>     

#1 RL.K      

#2 RI.K      

#3 RF.K

Or as @WiktorStribizew mentioned, fixed can be wrapped to make it more faster

df1 %>% 

   filter(str_count(Identifier, fixed(".")) == 1)

Or without using any external libraries,

df1[nchar(gsub("[^.]*", "", df1$Identifier)) == 1,]

Or using gregexpr from base R

df1[lengths(gregexpr(".", df1$Identifier, fixed = TRUE)) == 1,]

edited Nov 20 '18 at 18:12

answered Nov 20 '18 at 17:44

akrun

400k13190265

Why regex? There is just a dot to find, use str_count(Identifier, fixed("."))

– Wiktor Stribiżew
Nov 20 '18 at 17:45

1

Wow, that was a quickie!

– JasonBaik
Nov 20 '18 at 17:47

add a comment |

If we're going to use base and grepl, there's a simpler regex code:

df[grepl("\..*\.", df$Identifier),]

(explanation for the regex: \. finds a literal ., .* finds anything, so this code finds cases where there are two literal dots separated by anything)

answered Nov 20 '18 at 17:58

iod

3,5792722

add a comment |

A solution using base R. (find all strings with exactly one dot)

grepl("^[^.]*[.][^.]*$", df1$Identifier)

To remove the rows with one dot use:

df1[

!grepl("^[^.]*[.][^.]*$", df1$Identifier),

]

edited Nov 22 '18 at 11:11

answered Nov 20 '18 at 17:48

Andre Elrico

5,63311027

1

it nees a ! in front of grepl expression, since you want to filter out those with only one . for which the regex is searching.

– Gwang-Jin Kim
Nov 20 '18 at 17:59

thanks @Gwang-JinKim. I just realized "filter out" meant "remove".

– Andre Elrico
Nov 22 '18 at 11:09

add a comment |

With as little Regex as possible ;):

has.only.one.dot <- function(str_vec) sapply(strsplit(str_vec, "\."), function(vec) length(vec) == 2)

df[!has.only.one.dot(df$Identifier), ]

However, the list functions sapply and strsplit are slower than regex solution.

has.only.one.dot <- function(str_vec) grepl("\.", str_vec) & ! grepl("\..*\.", str_vec)

df[!has.only.one.dot(df$Identifier), ]

edited Nov 20 '18 at 18:11

answered Nov 20 '18 at 18:06

Gwang-Jin Kim

2,421116

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53398636%2ffilter-out-all-rows-with-only-one-period-in-r%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

4 Answers
4

active

oldest

votes

4 Answers
4

active

oldest

votes

We can count the number of . in the 'Identifier' and create a logical condition for filtering the rows

library(tidyverse)

df1 %>% 

   filter(str_count(Identifier, "[.]") == 1)

# A tibble: 3 x 1

#  Identifier

#  <chr>     

#1 RL.K      

#2 RI.K      

#3 RF.K

Or as @WiktorStribizew mentioned, fixed can be wrapped to make it more faster

df1 %>% 

   filter(str_count(Identifier, fixed(".")) == 1)

Or without using any external libraries,

df1[nchar(gsub("[^.]*", "", df1$Identifier)) == 1,]

Or using gregexpr from base R

df1[lengths(gregexpr(".", df1$Identifier, fixed = TRUE)) == 1,]

edited Nov 20 '18 at 18:12

answered Nov 20 '18 at 17:44

akrun

400k13190265

Why regex? There is just a dot to find, use str_count(Identifier, fixed("."))

– Wiktor Stribiżew
Nov 20 '18 at 17:45

1

Wow, that was a quickie!

– JasonBaik
Nov 20 '18 at 17:47

add a comment |

We can count the number of . in the 'Identifier' and create a logical condition for filtering the rows

library(tidyverse)

df1 %>% 

   filter(str_count(Identifier, "[.]") == 1)

# A tibble: 3 x 1

#  Identifier

#  <chr>     

#1 RL.K      

#2 RI.K      

#3 RF.K

Or as @WiktorStribizew mentioned, fixed can be wrapped to make it more faster

df1 %>% 

   filter(str_count(Identifier, fixed(".")) == 1)

Or without using any external libraries,

df1[nchar(gsub("[^.]*", "", df1$Identifier)) == 1,]

Or using gregexpr from base R

df1[lengths(gregexpr(".", df1$Identifier, fixed = TRUE)) == 1,]

edited Nov 20 '18 at 18:12

answered Nov 20 '18 at 17:44

akrun

400k13190265

Why regex? There is just a dot to find, use str_count(Identifier, fixed("."))

– Wiktor Stribiżew
Nov 20 '18 at 17:45

1

Wow, that was a quickie!

– JasonBaik
Nov 20 '18 at 17:47

add a comment |

We can count the number of . in the 'Identifier' and create a logical condition for filtering the rows

library(tidyverse)

df1 %>% 

   filter(str_count(Identifier, "[.]") == 1)

# A tibble: 3 x 1

#  Identifier

#  <chr>     

#1 RL.K      

#2 RI.K      

#3 RF.K

Or as @WiktorStribizew mentioned, fixed can be wrapped to make it more faster

df1 %>% 

   filter(str_count(Identifier, fixed(".")) == 1)

Or without using any external libraries,

df1[nchar(gsub("[^.]*", "", df1$Identifier)) == 1,]

Or using gregexpr from base R

df1[lengths(gregexpr(".", df1$Identifier, fixed = TRUE)) == 1,]

edited Nov 20 '18 at 18:12

answered Nov 20 '18 at 17:44

akrun

400k13190265

We can count the number of . in the 'Identifier' and create a logical condition for filtering the rows

library(tidyverse)

df1 %>% 

   filter(str_count(Identifier, "[.]") == 1)

# A tibble: 3 x 1

#  Identifier

#  <chr>     

#1 RL.K      

#2 RI.K      

#3 RF.K

Or as @WiktorStribizew mentioned, fixed can be wrapped to make it more faster

df1 %>% 

   filter(str_count(Identifier, fixed(".")) == 1)

Or without using any external libraries,

df1[nchar(gsub("[^.]*", "", df1$Identifier)) == 1,]

Or using gregexpr from base R

df1[lengths(gregexpr(".", df1$Identifier, fixed = TRUE)) == 1,]

edited Nov 20 '18 at 18:12

answered Nov 20 '18 at 17:44

akrun

400k13190265

edited Nov 20 '18 at 18:12

answered Nov 20 '18 at 17:44

akrun

400k13190265

answered Nov 20 '18 at 17:44

akrun

400k13190265

answered Nov 20 '18 at 17:44

akrun

400k13190265

Why regex? There is just a dot to find, use str_count(Identifier, fixed("."))

– Wiktor Stribiżew
Nov 20 '18 at 17:45

1

Wow, that was a quickie!

– JasonBaik
Nov 20 '18 at 17:47

add a comment |

Why regex? There is just a dot to find, use str_count(Identifier, fixed("."))

– Wiktor Stribiżew
Nov 20 '18 at 17:45

1

Wow, that was a quickie!

– JasonBaik
Nov 20 '18 at 17:47

Why regex? There is just a dot to find, use str_count(Identifier, fixed("."))

– Wiktor Stribiżew
Nov 20 '18 at 17:45

Wow, that was a quickie!

– JasonBaik
Nov 20 '18 at 17:47

add a comment |

If we're going to use base and grepl, there's a simpler regex code:

df[grepl("\..*\.", df$Identifier),]

(explanation for the regex: \. finds a literal ., .* finds anything, so this code finds cases where there are two literal dots separated by anything)

answered Nov 20 '18 at 17:58

iod

3,5792722

add a comment |

If we're going to use base and grepl, there's a simpler regex code:

df[grepl("\..*\.", df$Identifier),]

(explanation for the regex: \. finds a literal ., .* finds anything, so this code finds cases where there are two literal dots separated by anything)

answered Nov 20 '18 at 17:58

iod

3,5792722

add a comment |

If we're going to use base and grepl, there's a simpler regex code:

df[grepl("\..*\.", df$Identifier),]

(explanation for the regex: \. finds a literal ., .* finds anything, so this code finds cases where there are two literal dots separated by anything)

answered Nov 20 '18 at 17:58

iod

3,5792722

If we're going to use base and grepl, there's a simpler regex code:

df[grepl("\..*\.", df$Identifier),]

(explanation for the regex: \. finds a literal ., .* finds anything, so this code finds cases where there are two literal dots separated by anything)

answered Nov 20 '18 at 17:58

iod

3,5792722

answered Nov 20 '18 at 17:58

iod

3,5792722

answered Nov 20 '18 at 17:58

iod

3,5792722

answered Nov 20 '18 at 17:58

iod

3,5792722

add a comment |

A solution using base R. (find all strings with exactly one dot)

grepl("^[^.]*[.][^.]*$", df1$Identifier)

To remove the rows with one dot use:

df1[

!grepl("^[^.]*[.][^.]*$", df1$Identifier),

]

edited Nov 22 '18 at 11:11

answered Nov 20 '18 at 17:48

Andre Elrico

5,63311027

1

it nees a ! in front of grepl expression, since you want to filter out those with only one . for which the regex is searching.

– Gwang-Jin Kim
Nov 20 '18 at 17:59

thanks @Gwang-JinKim. I just realized "filter out" meant "remove".

– Andre Elrico
Nov 22 '18 at 11:09

add a comment |

A solution using base R. (find all strings with exactly one dot)

grepl("^[^.]*[.][^.]*$", df1$Identifier)

To remove the rows with one dot use:

df1[

!grepl("^[^.]*[.][^.]*$", df1$Identifier),

]

edited Nov 22 '18 at 11:11

answered Nov 20 '18 at 17:48

Andre Elrico

5,63311027

1

it nees a ! in front of grepl expression, since you want to filter out those with only one . for which the regex is searching.

– Gwang-Jin Kim
Nov 20 '18 at 17:59

thanks @Gwang-JinKim. I just realized "filter out" meant "remove".

– Andre Elrico
Nov 22 '18 at 11:09

add a comment |

A solution using base R. (find all strings with exactly one dot)

grepl("^[^.]*[.][^.]*$", df1$Identifier)

To remove the rows with one dot use:

df1[

!grepl("^[^.]*[.][^.]*$", df1$Identifier),

]

edited Nov 22 '18 at 11:11

answered Nov 20 '18 at 17:48

Andre Elrico

5,63311027

A solution using base R. (find all strings with exactly one dot)

grepl("^[^.]*[.][^.]*$", df1$Identifier)

To remove the rows with one dot use:

df1[

!grepl("^[^.]*[.][^.]*$", df1$Identifier),

]

edited Nov 22 '18 at 11:11

answered Nov 20 '18 at 17:48

Andre Elrico

5,63311027

edited Nov 22 '18 at 11:11

answered Nov 20 '18 at 17:48

Andre Elrico

5,63311027

answered Nov 20 '18 at 17:48

Andre Elrico

5,63311027

answered Nov 20 '18 at 17:48

Andre Elrico

5,63311027

1

it nees a ! in front of grepl expression, since you want to filter out those with only one . for which the regex is searching.

– Gwang-Jin Kim
Nov 20 '18 at 17:59

thanks @Gwang-JinKim. I just realized "filter out" meant "remove".

– Andre Elrico
Nov 22 '18 at 11:09

add a comment |

1

it nees a ! in front of grepl expression, since you want to filter out those with only one . for which the regex is searching.

– Gwang-Jin Kim
Nov 20 '18 at 17:59

thanks @Gwang-JinKim. I just realized "filter out" meant "remove".

– Andre Elrico
Nov 22 '18 at 11:09

it nees a ! in front of grepl expression, since you want to filter out those with only one . for which the regex is searching.

– Gwang-Jin Kim
Nov 20 '18 at 17:59

thanks @Gwang-JinKim. I just realized "filter out" meant "remove".

– Andre Elrico
Nov 22 '18 at 11:09

add a comment |

With as little Regex as possible ;):

has.only.one.dot <- function(str_vec) sapply(strsplit(str_vec, "\."), function(vec) length(vec) == 2)

df[!has.only.one.dot(df$Identifier), ]

However, the list functions sapply and strsplit are slower than regex solution.

has.only.one.dot <- function(str_vec) grepl("\.", str_vec) & ! grepl("\..*\.", str_vec)

df[!has.only.one.dot(df$Identifier), ]

edited Nov 20 '18 at 18:11

answered Nov 20 '18 at 18:06

Gwang-Jin Kim

2,421116

add a comment |

With as little Regex as possible ;):

has.only.one.dot <- function(str_vec) sapply(strsplit(str_vec, "\."), function(vec) length(vec) == 2)

df[!has.only.one.dot(df$Identifier), ]

However, the list functions sapply and strsplit are slower than regex solution.

has.only.one.dot <- function(str_vec) grepl("\.", str_vec) & ! grepl("\..*\.", str_vec)

df[!has.only.one.dot(df$Identifier), ]

edited Nov 20 '18 at 18:11

answered Nov 20 '18 at 18:06

Gwang-Jin Kim

2,421116

add a comment |

With as little Regex as possible ;):

has.only.one.dot <- function(str_vec) sapply(strsplit(str_vec, "\."), function(vec) length(vec) == 2)

df[!has.only.one.dot(df$Identifier), ]

However, the list functions sapply and strsplit are slower than regex solution.

has.only.one.dot <- function(str_vec) grepl("\.", str_vec) & ! grepl("\..*\.", str_vec)

df[!has.only.one.dot(df$Identifier), ]

edited Nov 20 '18 at 18:11

answered Nov 20 '18 at 18:06

Gwang-Jin Kim

2,421116

With as little Regex as possible ;):

has.only.one.dot <- function(str_vec) sapply(strsplit(str_vec, "\."), function(vec) length(vec) == 2)

df[!has.only.one.dot(df$Identifier), ]

However, the list functions sapply and strsplit are slower than regex solution.

has.only.one.dot <- function(str_vec) grepl("\.", str_vec) & ! grepl("\..*\.", str_vec)

df[!has.only.one.dot(df$Identifier), ]

edited Nov 20 '18 at 18:11

answered Nov 20 '18 at 18:06

Gwang-Jin Kim

2,421116

edited Nov 20 '18 at 18:11

answered Nov 20 '18 at 18:06

Gwang-Jin Kim

2,421116

answered Nov 20 '18 at 18:06

Gwang-Jin Kim

2,421116

answered Nov 20 '18 at 18:06

Gwang-Jin Kim

2,421116

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Argthtjtr