2017.11.29
- 在原作者的基础上增加了一个post接口,直接传入网页内容,具体开API的2
A micro-service for extract main content from url.
-
Package
$ mvn clean package
-
Run
$ java -jar target/smart-extractor.jar $ open http://localhost:8080
-
GET Extract
http://localhost:8080/extract?url={url}
$ curl -i -X GET http://localhost:8080/extract\?url\=https://medium.com/@benjaminhardy/8-things-every-person-should-do-before-8-a-m-cc0233e15c8d
-
POST Extract
http://localhost:8080/extract
python示例代码import requests url = "https://www.baidu.com" r = requests.get(url) payload = {"content": r.content} r_e = requests.post("http://localhost:8080/extract", data=payload) json_str = r_e.content print json_str
$ mvn clean package
$ mvn package docker:build