KafNafParserPy

Description

Parser for KAF or NAF files in python. The documentation for all methods and API of this parser can be found at:

You can also take a look at this presentation on slideshare about this library.

Quick Installation

The KafNafParserPy is from Feb 10th available in the Python Package Index, so you can easily install it (and its dependencies), by running:

pip install KafNafParserPy

Installation

Clone the repository from github

git clone https://github.com/cltl/KafNafParserPy.git

You will need to have installed the lxml library for python (http://lxml.de/). Usually just by runningpip install --user lxml should be enough for getting lxml installed. In some cases there can be problems with the libraries libxml and libxslt. In this case (considering you have no root access for the machine), you can try to do the following:

wget http://xmlsoft.org/sources/libxml2-sources-2.7.7.tar.gz
gzip -dc libxml2-sources-2.7.7.tar.gz | tar xvf -
cd libxml2-2.7.7
./configure --prefix=/home/ruben/lib
make
make install
wget http://xmlsoft.org/sources/libxslt-1.1.26.tar.gz
gzip -dc libxslt-1.1.26.tar.gz | tar xvf -
cd libxslt-1.1.26
./configure --prefix=/home/ruben/lib --with-libxml-prefix=/home/ruben/lib
make
make install
PATH=$PATH:/home/ruben/lib/bin/
pip install --user lxml

Of course replace /home/ruben/lib by the folder where you want to install the libraries, and check the corresponding websites for newer versions of the libraries.

Usage

This library is a python module, that reads a KAF or NAF file and parses it. It basically parses one KAF/NAF file and allows to access to all the layers through different methods and functions. This is one example of usage:

python
>>> from KafNafParserPy import KafNafParser
>>> my_parser = KafNafParser('myfile.kaf')
>>> for token_obj in my_parser.get_tokens():
>>>     print 'Token id',token.get_id()
>>>     print 'Token text',token.get_text()
>>>
>>> for term_obj in my_parser.get_terms():
>>>    print 'Lemma',term_obj.get_lemma()
>>>    print 'Ids:',term_obj.get_span().get_span_ids()
>>>
>>> for prop in my_paser.get_properties():
>>>    print 'Id',prop.get_id()
>>>    for reference in prop.get_references():
>>>        for span_obj in reference: ##Iterator over Creference object
>>>            print 'span ids',span_obj.get_span_ids()

You can find some examples of usage of this parser in the subfolder examples.

Documentation

The documentation can be generated automatically by running:

epydoc --config documentation.cfg

This will call to the external program epydoc (http://epydoc.sourceforge.net/) with the provided configuration file, and will create the HTML documents for the API in the folder apidocs. As said before the already generated documentation can be seen at http://kyoto.let.vu.nl/~izquierdo/api/KafNafParserPy

Contact

Ruben Izquierdo Bevia
[email protected]
http://rubenizquierdobevia.com/
Vrije University of Amsterdam

License

Sofware distributed under GPL.v3, see LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 163 Commits
examples		examples
feature_extractor		feature_extractor
.gitignore		.gitignore
KafNafParserMod.py		KafNafParserMod.py
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
causal_data.py		causal_data.py
constituency_data.py		constituency_data.py
coreference_data.py		coreference_data.py
dependency_data.py		dependency_data.py
documentation.cfg		documentation.cfg
entity_data.py		entity_data.py
external_references_data.py		external_references_data.py
factuality_data.py		factuality_data.py
features_data.py		features_data.py
header_data.py		header_data.py
kaf_example.xml		kaf_example.xml
markable_data.py		markable_data.py
naf.dtd		naf.dtd
naf_example.xml		naf_example.xml
opinion_data.py		opinion_data.py
references_data.py		references_data.py
setup.py		setup.py
span_data.py		span_data.py
srl_data.py		srl_data.py
temporal_data.py		temporal_data.py
term_data.py		term_data.py
term_sentiment_data.py		term_sentiment_data.py
test.py		test.py
text_data.py		text_data.py
time_data.py		time_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KafNafParserPy

Description

Quick Installation

Installation

Usage

Documentation

Contact

License

About

Releases

Packages

Languages

License

amcat/KafNafParserPy

Folders and files

Latest commit

History

Repository files navigation

KafNafParserPy

Description

Quick Installation

Installation

Usage

Documentation

Contact

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages