Skip to contents

Get bioRxiv preprints

Usage

get_preprints(
  subject = "all",
  baseurl = "https://connect.biorxiv.org/biorxiv_xml.php?subject=",
  clean = TRUE
)

Arguments

subject

A character vector of valid biorxiv subjects. See subjects.

baseurl

The base URL for the biorxiv RSS feed. Default is https://connect.biorxiv.org/biorxiv_xml.php?subject=. Do not change unless you know what you are doing.

clean

Logical; try to strip out graphical abstract information? If TRUE, this strips away any text between O_FIG and C_FIG, and the words graphical abstract from the abstract text in the RSS feed.

Value

A data frame of bioRxiv preprints.

Examples

preprints <- get_preprints(subject=c("bioinformatics", "genomics"))
preprints
#> # A tibble: 60 × 4
#>    subject        title                                           url   abstract
#>    <chr>          <chr>                                           <chr> <chr>   
#>  1 bioinformatics "PHIStruct: Improving phage-host interaction p… http… Recent …
#>  2 bioinformatics "BGC Atlas: A Web Resource for Exploring the G… http… Seconda…
#>  3 bioinformatics "Enrichment analysis for spatial and single-ce… http… Imaging…
#>  4 bioinformatics "SpatialLeiden - Spatially-aware Leiden cluste… http… Cluster…
#>  5 bioinformatics "Protein sequence classification using natural… http… Protein…
#>  6 bioinformatics "A multi-omics approach to identify deleteriou… http… Crops l…
#>  7 bioinformatics "Automatic crystal identification for crystall… http… Crystal…
#>  8 bioinformatics "Solving the \"Blind men and the elephant prob… http… Biologi…
#>  9 bioinformatics "Evolutionary mismatch between nuclear and mit… http… Serrano…
#> 10 bioinformatics "Predicting small-molecule inhibition of prote… http… Protein…
#> # ℹ 50 more rows