Skip to content

Latest commit

 

History

History
111 lines (58 loc) · 2.11 KB

README.md

File metadata and controls

111 lines (58 loc) · 2.11 KB

DRAX (DataCite Rest Api for Xml)

DRAX is a Python program for posting bulk DOIs metadata in XML format using DataCite REST API

If you provide XML files, it will convert them to JSON files and post with REST API


Getting Started

xml_post_parser.py is the program you will call to do the task

It will call two python classes:

  • class_xml_encode.py: encode XML files to JSON envelope

  • class_json_post.py: post JSON envelopes to server port


Usage

Check the configuration file xml_post_parser_config.json and start configure your program:


Important fields

"server_port":

the location you want to post your files to

"bearer_auth_key":

your Bearer Auth key to access DataCite REST API

"prefix":

namespace of the DOIs you are using

"schema_url":

schema for the DOIs

Folder names

"xml_folder":

the directory for your input XML files

"json_folder":

the directory for your JSON envelopes to be stored at

"processed_xml_folder":

the directory for your processed XML files

"posted_json_folder":

the directory for your successfully posted JSON envelopes

"post_failed_folder":

the directory for your JSON envelopes that failed to post



Then from the config file, use the value of "xml_folder" as the name to create a folder in the same directory

Store your xml files to be posted to the folder


Last, run xml_post_parser.py

python xml_post_parser.py

XML files will begin to be encoded into JSON files and then posted to the server port

Successfully posted files will be stored at "posted_json_folder"

Failed files will be stored at "post_failed_folder"


Additional file for testing

tests/test_server.py:

construct a dummy server at http://localhost:5000

you can post data to http://localhost:5000 and view the posted data on http://localhost:5000/data

Remember to change "server_port" in the config file to http://localhost:5000/



This work was carried out under the ITC STEM Internship Scheme from the The Hong Kong University of Science and Technology in collaboration with GigaScience - BGI.