New Jersey Government News

Scrape news feeds from the New Jersey government

Punch List

Known bugs befor v1.0

GitHub Install
Encoding

Operation

Install

You can install the package using the following steps:

pip install using an admin prompt

pip uninstall NJGovNews
pip install -v git+https://github.com/TextCorpusLabs/NJGovNews.git

Run

You can run the package as follows:

 NJGovNews SITE -out FILE_OUT

The scraper currently supports the following SITEs:

The Department of the Treasury. I.E. NJGovNews treasury -out "c:/data/news/nj_treasury.csv"

Cache

This scraper uses requests-cache to improve performance. If you want to force a full reload of all the data, delete the file called 'SITE.cache.sqlite'. It will be in the same folder as the .csv the scraper created.

Development

Prerequisites

You can install the package for development using the following steps:

Note: You can replace steps 1-3 using the VSCode Git:Clone command

Download the project from GitHub
- Click the green "Code" button on the right. Select "Download Zip"
Remove zip protections by right-clicking on the file, selecting properties, and checking "security: unblock"
Unzip the folder. I recommend using the folder c:/repos/TextCorpusLabs/NJGovNews

Run pip's edit install using an admin prompt

pip uninstall NJGovNews
pip install -v -e c:/repos/TextCorpusLabs/NJGovNews

Install the nltk add-ons using an admin prompt

python -c "import nltk;nltk.download('punkt')"

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

New Jersey Government News

Punch List

Operation

Install

Run

Cache

Development

Prerequisites

About

Languages

License

TextCorpusLabs/NJGovNews

Folders and files

Latest commit

History

Repository files navigation

New Jersey Government News

Punch List

Operation

Install

Run

Cache

Development

Prerequisites

About

Topics

Resources

License

Stars

Watchers

Forks

Languages