Skip to content

WPExtractor is a python-based tool specifically made for Aritificial Intelligence based projects to make dataset. This helps to collect data from blog which can be used to train bot in many useful way.

License

Notifications You must be signed in to change notification settings

manalshaikh/wpextractor

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WP-Extractor - WordPress Blog Post Extractor in Json Format

WPExtractor is a python-based tool specifically made for Aritificial Intelligence based projects to make dataset. This helps to collect data from blog which can be used to train bot in many useful way.

Features

  • Automatically extract all posts from a WordPress website within seconds.
  • Saves the data in Json file in the directory for you.
  • Easily understable json format to make your life easier :D
  • Responsive developers. Just make an issue, we'll fix it for you :)
  • Graphical User Interface Option - In beta mode

Usage

Usage:
python main.py -u https://csrockers.in
By default, it will fetch posts from the website. To fetch pages, use the following.
python main.py -u https://fulltimehosting.net --pages

Credits

This project was forked from original copy because Somil stopped working on it and refuses to transfer ownership.

About

WPExtractor is a python-based tool specifically made for Aritificial Intelligence based projects to make dataset. This helps to collect data from blog which can be used to train bot in many useful way.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%