Implement distinct() #17

msberends · 2020-05-04T10:06:17Z

Suggestion:

distinct <- function(.data, ..., .keep_all = FALSE) {
  check_is_dataframe(.data)
  UseMethod("distinct")
}

distinct.default <- function(.data, ..., .keep_all = FALSE) {
  names <- rownames(.data)
  rownames(.data) <- NULL
  if (length(deparse_dots(...)) == 0) {
    selected <- .data
  } else {
    selected <- select(.data, ...)
  }
  rows <- as.integer(rownames(unique(selected)))
  if (isTRUE(.keep_all)) {
    res <- .data[rows, , drop = FALSE]
  } else {
    res <- selected[rows, , drop = FALSE]
  }
  rownames(res) <- names[rows]
  res
}

distinct.grouped_data <- function(.data, ..., .keep_all = FALSE) {
  apply_grouped_function(.data, "distinct", ..., .keep_all = .keep_all)
}

I cannot get the grouped version to work to also include the group variables. It now only returns the distinct variable if set...

Another idea - You should mention on your README that this package is a great, great idea for package developers that do not want to be dependent on dplyr (as it changes too often for sustainable pkg development), but do want to code using dplyr methods. For those users you could also create an extra raw syntax file with all your functions without roxygen parts (remove all lines starting with #') and your name on it, so they can copy it to their package.

The text was updated successfully, but these errors were encountered:

nathaneastwood · 2020-05-04T12:24:02Z

Thanks for the code 🙂 I’ll take a look when I get a chance.

Another idea - You should mention on your README that this package is a great, great idea for package developers that do not want to be dependent on dplyr (as it changes too often for sustainable pkg development), but do want to code using dplyr methods.

Yes and no. It’s good because there are much fewer dependencies. But the aim of poorman is to replicate dplyr as closely as possible. So any changes seen in dplyr are likely to make their way into poorman at some point.

For those users you could also create an extra raw syntax file with all your functions without roxygen parts (remove all lines starting with #') and your name on it, so they can copy it to their package.

Hmm, I think it might just be easier to depend on poorman to be honest. As more features get added, the code is likely to get more complex. Given there will only ever be the one package to depend on with poorman, I don’t think it’s too much of an issue.

msberends · 2020-05-04T13:00:01Z

👍

Closes #17

nathaneastwood added the feature request New feature or request label May 4, 2020

nathaneastwood added this to the Next set of `dplyr` features milestone May 10, 2020

nathaneastwood closed this as completed in b074c7a May 14, 2020

nathaneastwood added a commit that referenced this issue May 14, 2020

feat: Add distinct()

8ac5aef

Closes #17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement distinct() #17

Implement distinct() #17

msberends commented May 4, 2020

nathaneastwood commented May 4, 2020

msberends commented May 4, 2020

Implement distinct() #17

Implement distinct() #17

Comments

msberends commented May 4, 2020

nathaneastwood commented May 4, 2020

msberends commented May 4, 2020