Skip to content

R package to access USPTO bulk data in tidy, rectangular format

License

Unknown, MIT licenses found

Licenses found

Unknown
LICENSE
MIT
LICENSE.md
Notifications You must be signed in to change notification settings

JYProjs/patentr

Repository files navigation

patentr: Access USPTO Bulk Data in Tidy Rectangular Format

Travis-CI Build Status AppVeyor Build Status Coverage Status

License: MIT CRAN version CRAN Downloads

Overview

The patentr R package allows easy access to USPTO (United States Patent and Trademark Office) bulk patent data in tidy, rectangular format. By downloading, converting, and storing patent data directly from the USPTO website, patentr minimizes the work needed to acquire usable data, allowing users to focus on analyzing the data.

Installation

To install patentr, run the following R code:

# install from CRAN
install.packages("patentr")

# install development version from GitHub
devtools::install_github("JYProjs/patentr")

Sample code

Bulk patent data can be downloaded using the year and week (within each year) as follows:

# load patentr
library("patentr")

# download patents from the first week of 1976
get_bulk_patent_data(year = 1976,
                     week = 1,
                     output_file = "patent_output1.csv")

# download patents from:
#   1. week 1 of 1976 (TXT format in USPTO)
#   2. week 48 of 2002 (XML format 1 in USPTO)
#   3. week 19 of 2006 (XML format 2 in USPTO)
# N.B. it will take a few minutes to run the next line
get_bulk_patent_data(year = c(1976, 2002, 2006),
                     week = c(1, 48, 19),
                     output_file = "patent_output2.csv")

Data collected for each patent

  • patent title
  • application date
  • issue date
  • inventor name(s)
  • assignee name(s)
  • ICL classification
  • unique identifier (WKU)
  • referenced patent numbers
  • claims

Contribute

To contribute to patentr, you can create issues for any bugs/suggestions on the issues page. You can also fork the patentr repository and create pull requests to add features you think will be useful for users.

Citation

Wadhwa RR, Yu J, Beltz H, Desai MY, Érdi P, Scott JG. patentr: Access USPTO Bulk Data in Tidy Rectangular Format. 2021; R package version 0.1.0. URL https://github.com/JYProjs/patentr.

About

R package to access USPTO bulk data in tidy, rectangular format

Topics

Resources

License

Unknown, MIT licenses found

Licenses found

Unknown
LICENSE
MIT
LICENSE.md

Stars

Watchers

Forks

Packages

No packages published