This document describes the setup of the "dpiprocessing" service account on the "libdpiprocessing.lib.umd.edu" server for use in the Plant Patents Ingest process.
- Git
- Docker
The "docker-ce" and "docker-compose" clients should be loaded, and be usable by the "dpiprocessing" service account.
- In the "dpiprocessing" service account, create a public/private keypair:
> su - dpiprocessing
> ssh-keygen -t rsa
(Just hit "Enter" to accept all the defaults. This created "~/.ssh/id_rsa" and
"~/.ssh/id_rsa.pub" files).
-
Add public key to "lib-ssdr-service" account on GitHib.
-
In the "dpiprocessing" service account home directory, edit the ".bashrc" and add a "git_authors.sh" file as outlined in https://umd-dit.atlassian.net/wiki/spaces/LIB/pages/38404219/Git+Service+Account+Commits
The following steps creates the following subdirectory hierarchy in the "dpiprocessing" service account home directory ("/home/dpiprocessing/"):
plant_patents/
|-- docker_mount/
|-- box/
| |-- metadata (directory holding "patents_metadata.csv" from Box
| |-- PDFs (directory holding Plant Patents PDF scans from Box)
|
|-- plastron/ (directory to hold Plastron configuration file and logs)
|-- solr-plant-patents/ (clone of the "solr-plant-patents" GitHub repository)
- SSH to the "libdpiprocessing" server:
ssh libdpiprocessing.lib.umd.edu
- Switch to the "dpiprocessing" service account:
su - dpiprocessing
- Switch to the "dpiprocessing" service account home directory:
cd ~
- Construct the directories:
mkdir plant_patents
mkdir -p plant_patents/docker_mount/box/metadata
mkdir -p plant_patents/docker_mount/box/PDFs
mkdir -p plant_patents/plastron
- Switch to the "plant_patents/docker_mount" directory and clone the "umd-lib/solr-plant-patents" GitHub repository:
cd plant_patents/docker_mount
git clone [email protected]:umd-lib/solr-plant-patents.git