-
Notifications
You must be signed in to change notification settings - Fork 371
Commit
- Loading branch information
There are no files selected for viewing
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Large diffs are not rendered by default.
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
|
||
# Live Streaming word-counter using PySpark | ||
|
||
I have taken the text file format of The Pride and Prejudice novel as an input for the word counter. This is done using live streaming. Other inputs like tweets from twitter and facebook can also be integrated in the code. It is made efficient by using spark which uses in-memory parallel processing. Thus, it can handle large streaming data and can deliver outputs faster than other methods. | ||
|