SAPG: Semantically-Aware Paraphrase Generation with AMR Graphs

This repo contains the code for the paper SAPG: Semantically-Aware Paraphrase Generation with AMR Graphs, by Afonso Sousa & Henrique Lopes Cardoso (accepted at ICAART 2025).

Automatically generating paraphrases is crucial for various natural language processing tasks. Current approaches primarily try to control the surface form of generated paraphrases by resorting to syntactic graph structures. However, paraphrase generation is rooted in semantics, but there are almost no works trying to leverage semantic structures as inductive biases for the task of generating paraphrases. We propose SAPG, a semantically-aware paraphrase generation model, which encodes Abstract Meaning Representation (AMR) graphs into a pretrained language model using a graph neural network-based encoder. We demonstrate that SAPG enables the generation of more diverse paraphrases by transforming the input AMR graphs, allowing for control over the output generations' surface forms rooted in semantics. This approach ensures that the semantic meaning is preserved, offering flexibility in paraphrase generation without sacrificing fluency or coherence. Our extensive evaluation on two widely-used paraphrase generation datasets confirms the effectiveness of this method.

Installation

First, to create a fresh conda environment with all the used dependencies run:

conda env create -f environment.yml

Preprocess data

SAPG requires AMR data. To extract it you may run:

sh ./scripts/extract_amr.sh

Train and test models

To train/test SAPG or any other model refered to in the paper you can run the corresponding script. For example:

sh ./scripts/train_graph_amr.sh

sh ./scripts/test_graph_amr.sh

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data_preprocessing		data_preprocessing
metrics		metrics
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
access_inference_speed.py		access_inference_speed.py
amr_parsing_error_analysis.py		amr_parsing_error_analysis.py
amr_utils.py		amr_utils.py
compute_metrics.py		compute_metrics.py
environment.yml		environment.yml
evaluation.py		evaluation.py
find_interesting_examples.py		find_interesting_examples.py
glossary.py		glossary.py
graph_collator.py		graph_collator.py
graph_decoder_attention_t5.py		graph_decoder_attention_t5.py
graph_encoder_attention_t5.py		graph_encoder_attention_t5.py
groups_on_sentence_length.py		groups_on_sentence_length.py
linearize.py		linearize.py
missing_indices_paranmt-small.json		missing_indices_paranmt-small.json
missing_indices_qqppos.json		missing_indices_qqppos.json
perturbations.py		perturbations.py
predict.py		predict.py
raw-data		raw-data
sbert_per_bin.png		sbert_per_bin.png
struct_adapt.py		struct_adapt.py
to_remove.py		to_remove.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAPG: Semantically-Aware Paraphrase Generation with AMR Graphs

Installation

Preprocess data

Train and test models

About

Releases

Packages

Languages

License

afonso-sousa/sapg

Folders and files

Latest commit

History

Repository files navigation

SAPG: Semantically-Aware Paraphrase Generation with AMR Graphs

Installation

Preprocess data

Train and test models

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages