kwic

Display the key words in context

Installation

git clone https://github.com/JeffreyMFarley/kwic.git
cd kwic
<set up your virtual python environment>
pip install -r requirements.txt
python -m spacy download en_core_web_sm

Pre-requistes

You will need a corpus.json that contains the document frequency for each lemma in English. See this commit for more details

Running the program

$ python kwic.py --help                                                 
usage: kwic [-h] [--top TOP] [--min-occ MIN_OCC]
            [--handle-newline {keep,ignore,space}]
            document

shows key words in context

optional arguments:
  -h, --help            show this help message and exit

Analysis:
  --top TOP             how many high scoring nouns and verbs are keywords
  --min-occ MIN_OCC     number of key words in an important sentence

Display:
  --handle-newline {keep,ignore,space}
                        decide how newlines should be handled

I/O:
  document              the file to analyze

Example

$ python kwic.py --handle-newline space sample_texts/moby-dick.txt

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
sample_texts		sample_texts
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
build_corpus.py		build_corpus.py
extract_coca_corpus.py		extract_coca_corpus.py
kwic.py		kwic.py
requirements.txt		requirements.txt
to_text.py		to_text.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kwic

Installation

Pre-requistes

Running the program

Example

About

Releases

Packages

Languages

JeffreyMFarley/kwic

Folders and files

Latest commit

History

Repository files navigation

kwic

Installation

Pre-requistes

Running the program

Example

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages