You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This seems to happen in the following pipeline (cf. https://github.com/acoli-repo/germhist, analyses/scrambling/Makefile, slightly modified version of make chlg.tsv: added tee commands)
cat $file | \
# split into columns WORD POS PARSE \
perl -pe 's/^(.*)\(([^\(\) ]+) +([^\(\)]+)\)/\3\t\2\t\1*/;' | \
# convert to CoNLL-RDF with tree extensions \
tee chlg.1.tmp| \
conll-rdf/run.sh CoNLLBrackets2RDF file:`realpath $file`'#' WORD POS PARSE | \
tee chlg.2.ttl| \
# extrapolate edge labels \
conll-rdf/run.sh CoNLLRDFUpdater -custom -updates enhgprep.sparql |\
tee chlg.3.ttl |\
conll-rdf/run.sh CoNLLRDFFormatter -query enhgquery.sparql | \
egrep '.' | grep -v '^#' | \
sed s/'$$'/'\t'$$c'\t'$$h'\t'$$genre'\t'$$region/|\
tee chlg.tsv
When tracking the progress of the script with ls -l chlg*, it seems that chlg3.ttl remains empty until chlg2.ttl is no longer written into. Maybe, CoNLLBrackets2RDF doesn't produce the right sentence delimiter symbol.
The same thing seems to happen between writing chlg.3.ttl and chlg.tsv (also checked with ls -l *).
The text was updated successfully, but these errors were encountered:
I can confirm that CoNLLRDFUpdater doesn't work in streaming mode. Looking at chldg.3.ttl, I can indeed confirm that this is processed en block, but not streamed: chldg.3.ttl is not grouped by sentences, but has random order from all over the the input sentence data.
This seems to happen in the following pipeline (cf. https://github.com/acoli-repo/germhist, analyses/scrambling/Makefile, slightly modified version of
make chlg.tsv
: addedtee
commands)ls -l chlg*
, it seems thatchlg3.ttl
remains empty untilchlg2.ttl
is no longer written into. Maybe,CoNLLBrackets2RDF
doesn't produce the right sentence delimiter symbol.ls -l *
).The text was updated successfully, but these errors were encountered: