Skip to content
This repository has been archived by the owner on Feb 3, 2020. It is now read-only.
/ whitebread Public archive

A simple Python library for mundane Fedora 3.8 tasks.

Notifications You must be signed in to change notification settings

markpbaggett/whitebread

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Whitebread Fedora

Note: This project has been deprecated and is now archived in favor of a statically compiled application built in nim. For an actively developed and maintained project that has 95% of the functionality here plus much more, please see moldybread. Compilied binaries for Linux x86 can be found in the releases section. If you're using another another architecture instructions for compiling can be found in the README. All other documentation can be found online in the project's documenation respository.

Simple python scripts for those everyday, mundane Fedora tasks.

Count Matching Objects

>>> python run.py -o count_objects -dc type -dcs StillImage
>>> python run.py -o count_objects -p vanvactor

Harvest Metadata!

By Matching Parent Namespace:

>>> python run.py -p smhc -o harvest_metadata

By Dublincore Field Matching:

>>> python run.py -o harvest_metadata -dc rights -dcs "In Copyright"

Override Default DSID without Touching YAML:

>>> python run.py -o harvest_metadata -p smhc -ds DC

Harvest Metadata but Ignore Records for Pages

>>> python run.py -o harvest_metadata_no_pages -p smhc -ds MODS

Grab Images

Like above, but for image mime types

>>> python run.py -p smhc -o grab_images -ds JP2

Download Binaries!

Just like above examples but with a different operator (use for things that aren't images or test).

>>> python run.py -o grab_other -p smhc -ds PDF

Update GSearch!

You guessed it!

>>> python run.py -o update_gsearch -p smhc

Update GSearch for all objects that aren't pages

>>> python run.py -o update_gsearch_no_pages -p smhc

Update fgsLabel!

Like above, but drop in an xpath value to match on. Only works with mods right now."

>>> python run.py -o update_labels -p swim -xp "//mods:titleInfo[@supplied='yes']/mods:title"

Purge All But the Newest Version of a Datastream

>>> python run.py -o purge_old_dsids -p vanvactor -ds MODS

Check MimeType of the Preservation Object

>>> python run.py -o test_obj_mimes -p vanvactor

Find matching objects missing a specific datastream

>>> python run.py -o find_missing -p vanvactor -ds PDF

Download FOXML for matching documents

>>> python run.py -o grab_foxml -p vanvactor

Find Books that are bad or aren't done processing

>>> python run.py -o find_bad_books -p vanvactor

Get the History of a Datastream

>>> python run.py -o get_history -ds MODS -p vanvactor

Get Version of Datastream at a Specfic Date

>>> python run.py -o get_datastream_at_date -ds MODS -p vanvactor -d 2017-09-01

Get All Versions of a Datastream

>>> python run.py -o get_all_versions_of_datastream -ds MODS -p vanvactor

Get Datastream Report

>>> python run.py -o get_datastream_report -p test

About

A simple Python library for mundane Fedora 3.8 tasks.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages