Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A modular rewrite featuring numerous enhancements and bug fixes #3

Open
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

9ao9ai9ar
Copy link

@9ao9ai9ar 9ao9ai9ar commented Aug 6, 2024

The only downsides of my rewrite that I can think of are the higher memory usage (by about 50%) and increased dependencies.

9ao9ai9ar and others added 2 commits August 1, 2024 04:24
This commit also gives more consideration to rate limiting and fixes
some bugs like posts on meta sites weren't being backed up, as well as
the issue at mhdadk#1.
Filters are now baked in, following the official doc's recommendation.
A few CLI options are also added.
@mhdadk
Copy link
Owner

mhdadk commented Aug 22, 2024

Hey @9ao9ai9ar, thank you very much for this PR! It seems like you really cleaned up the code. Adding tests was an especially nice touch.

Before merging this PR, because it is quite large (49 files changed), I'd like to review it in more detail to make sure it is OK. Unfortunately, I'm not sure when I will have the time in the near future to do this. I'm going to keep this PR open until I do so.

Nevertheless, I wanted to write to you to give you some positive feedback on your work.

@9ao9ai9ar
Copy link
Author

Thanks for your words of encouragement. I will be pushing out minor commits in the meantime, like updating the README and upgrading dependencies (Pydantic v2.9.0 is just around the corner, promising to fix the high memory consumption).

I do have an idea to improve the output format, but I will withhold work on that until next year. The idea is to first move all non-body fields of the posts into the (non-standard but widely supported) YAML metadata section at the beginning of the Markdown files. Secondly, individual answers should probably be separated into individual files to keep them as close to the original as possible without modifying the Markdown content by inserting the section markers. Lastly, the licenses of the posts (currently there are 3 versions of CC in use) should be included and linked. I can open an issue if you're interested in further discussion.

@mhdadk
Copy link
Owner

mhdadk commented Aug 23, 2024

All these changes sounds good to me. I can have a look at them as well when you push them (and when I have some more time).

1. Fix filter.
2. Add `--no-meta` option.
3. Update README.
4. Other minor adjustments.
@9ao9ai9ar 9ao9ai9ar changed the title A complete rewrite, but keeping the format you've defined for now A modular rewrite bringing numerous enhancements and bug fixes Nov 8, 2024
@9ao9ai9ar 9ao9ai9ar changed the title A modular rewrite bringing numerous enhancements and bug fixes A modular rewrite featuring numerous enhancements and bug fixes Nov 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants