Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Collect neutral citations for vt #1150

Open
grossir opened this issue Aug 31, 2024 · 2 comments
Open

Collect neutral citations for vt #1150

grossir opened this issue Aug 31, 2024 · 2 comments
Assignees

Comments

@grossir
Copy link
Contributor

grossir commented Aug 31, 2024

They are inside the documents

vt example: https://www.vermontjudiciary.org/sites/default/files/documents/op24-015.pdf
image

vt_criminal example: https://www.vermontjudiciary.org/sites/default/files/documents/eo22-076_0.pdf
image

grossir added a commit to grossir/juriscraper that referenced this issue Sep 4, 2024
Implements freelawproject#1150

- implements extract_from_text to collect neutral citations
- updates test_ScraperExtractFromText
- updates scraper files, makes `vtsuperct_*` not inherit extract_from_text from `vt`
@flooie
Copy link
Contributor

flooie commented Sep 5, 2024

This should be take care of thanks @grossir

@flooie flooie closed this as completed Sep 5, 2024
@grossir
Copy link
Contributor Author

grossir commented Nov 26, 2024

Ran the recently merged command

./manage.py update_from_text --courts juriscraper.opinions.united_states.state.vt --cluster-status Published --date-filed-gte 2017-01-01 --date-filed-lte 2024-01-01

From:

courtlistener=> select volume, count(*) from search_citation where reporter = 'VT' group by volume order by 1 desc
 volume | count 
--------+-------
   2024 |    13
   2017 |    43
  2016 |   143
   2015 |   149
   2014 |   131
....  more years ...

To:

 volume | count 
--------+-------
   2024 |    63
   2023 |    11
   2022 |    39
   2021 |   100
   2020 |    98
   2019 |    88
   2018 |   137
   2017 |   109
   2016 |   143
   2015 |   149
   2014 |   131
   2013 |   115
   2012 |   105
   2011 |   142
   2010 |   123
   2009 |   127
   2008 |   139
   2007 |   200
   2006 |   147
   2005 |   135
   2004 |   128
   2003 |   110


Having a gain of 589 citations for the years 2017 to 2024, from 56 to 645. However, we can also see there are big gaps in the years 2022 and 2023; and probably some missing records in the other years

UPDATE: after filling the gaps and collecting 201 new opinions, we have the following citation counts, for a total of 658 new VT citations

   2023 |    58
   2022 |    61

@grossir grossir reopened this Nov 26, 2024
@grossir grossir self-assigned this Nov 26, 2024
@flooie flooie moved this to Feb 10 to Feb 21 in Case Law Sprint Jan 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Feb 10 to Feb 21
Development

No branches or pull requests

2 participants