It's an effort to convert existing data from CEUR-WS.org to Linked Data dataset. The work is done in the framework of Semantic Publishing Challenge 2015 co-located with ESWC 2015 conference.
The goal is to extract information from a set of HTML tables of contents published in the CEUR-WS.org workshop proceedings. The extracted information is expected to answer queries about the quality of these workshops, for instance by measuring growth, longevity, etc.
The source code is available in ceur-ws-crawler folder.
The goal is to extract information from the textual content of the papers (in PDF). That information should provide a deeper understanding of the context in which it was written. In particular, the extracted information is expected to answer queries about authors’ affiliations and research institutions, research grants and funding bodies, and related works (papers presented in the same venue, addressing similar issues, etc.).
The source code is available in ceur-ws-pdfs folder.
The source code and all data is licensed under the MIT License.