Legal Document Precedent Extraction and Classification

This repository contains the code for the extraction and classification of precedents in legal documents. The project involves working with various types of legal cases such as 'Writ Petition,' 'Civil Appeal,' and 'Twenty_six.' The output for each case is available in the respective folders named 'precedents classified output.' Individual case outputs, corresponding to files like filename.xml or filename.txt, can be found in folders named 'filename output.txt.'

Project Structure:

Writ Petition
Civil Appeal
Twenty_six
precedents classified output
filename output.txt

Code Execution:

The main Python code file for preprocessing, extraction, and classification is Data Analysis.ipynb. Each block in the file is thoroughly commented with its functionality at the beginning.

Project Highlights:

Extracted and classified precedents and statutes from 150+ legal documents with a high accuracy of 90% using rule-based approaches.
Conducted sentiment analysis using nltk and spaCy libraries to categorize precedents into Relied, Distinguished, and Overruled.
Engineered a deep learning model incorporating Bert Tokenizer, bi-Directional LSTM Encoder-Decoder, Attention Pooling, and Conditional Random Field Layer with Adam Optimizer.
Achieved a test weighted F1 score of 82.61% using the Negative Log-Likelihood loss function on a Gold Dataset containing 2471 RR labeled cases.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Annotated Data_with xml and original PDF 2		Annotated Data_with xml and original PDF 2
Annotated Data_with xml and original PDF output		Annotated Data_with xml and original PDF output
Annotated Data_with xml and original PDF		Annotated Data_with xml and original PDF
Civil Appeal output		Civil Appeal output
Civil Appeal precedents classified output		Civil Appeal precedents classified output
Civil Appeal precedents output		Civil Appeal precedents output
Civil Appeal		Civil Appeal
Twenty_six output		Twenty_six output
Twenty_six		Twenty_six
Writ Petition output		Writ Petition output
Writ Petition precedents classified output		Writ Petition precedents classified output
Writ Petition precedents output		Writ Petition precedents output
Writ Petition		Writ Petition
abc 2		abc 2
abc		abc
abc_output		abc_output
new_case_json_100		new_case_json_100
new_case_json_100_output		new_case_json_100_output
.DS_Store		.DS_Store
220608.Legislation List_consolidated.xlsx		220608.Legislation List_consolidated.xlsx
220608.Precedents.Note.docx		220608.Precedents.Note.docx
220704.PrecedentDistinguished.pdf		220704.PrecedentDistinguished.pdf
Annotated sample data.txt		Annotated sample data.txt
Data Analysis.ipynb		Data Analysis.ipynb
Info.txt		Info.txt
LICENSE		LICENSE
Precedents not relied output.txt		Precedents not relied output.txt
Precedents output.txt		Precedents output.txt
Precedents relied output.txt		Precedents relied output.txt
README.md		README.md
abc_output.zip		abc_output.zip
case.xml		case.xml
correct_data.py		correct_data.py
first.cpp		first.cpp
jsonformatter.txt		jsonformatter.txt
modifier.py		modifier.py
newfile.txt		newfile.txt
output.txt		output.txt
regexTask.py		regexTask.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Legal Document Precedent Extraction and Classification

Project Structure:

Code Execution:

Project Highlights:

About

Releases

Packages

Languages

License

Devesh1602/Legal-Document-Precedent-Extraction-and-Classification

Folders and files

Latest commit

History

Repository files navigation

Legal Document Precedent Extraction and Classification

Project Structure:

Code Execution:

Project Highlights:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages