-
Notifications
You must be signed in to change notification settings - Fork 8
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
sort files and separate factor with datagen (#111)
* sort files * update tools markdown doc * seperate factor and generate * add note
- Loading branch information
1 parent
7599200
commit 946fc48
Showing
117 changed files
with
281 additions
and
266 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,6 +1,62 @@ | ||
# Tools | ||
|
||
Here lists some tools for graph data processing. | ||
- dataprofiler: a tool for profiling graph data, including degree distribution, etc. | ||
- graphgen: a simple tool/example code to generate power-law distributed graph data. | ||
- paramgen: a Parameter Search tool to generate parameters for queries using TuGraph. | ||
- paramgen: | ||
- parameter_curation: a tool for generating parameters for finbench queries | ||
- check_*.py: python scripts used for check the data features like consistency, distribution | ||
- merge_cluster_output.py: a python script to merge the output in cluster mode | ||
- statistic.py: a python script to calculate the statistics of the data | ||
- legacy: some legacy tools | ||
- dataprofiler: a tool for profiling graph data, including degree distribution, etc. | ||
- graphgen: a simple tool/example code to generate power-law distributed graph data. | ||
- factorgen: factor table generators in python version | ||
|
||
|
||
## ParamsGen | ||
|
||
`params_gen.py` uses the CREATE_VALIDATION feature to generate parameters. | ||
|
||
The specific steps are as follows: | ||
|
||
1. Select vertices of type Account, Person, and Loan from the dataset, and generate a parameter file that meets the input specifications for ldbc_finbench_driver. | ||
2. Execute CREATE_VALIDATION to generate validation_params.csv. | ||
3. Select non-empty results from validation_params.csv. | ||
|
||
Example: | ||
|
||
```bash | ||
python3 params_gen.py 1 # gen tcr1 params | ||
``` | ||
|
||
Other notes: | ||
|
||
1. The generated start_timestamp and end_timestamp in the current version are fixed values. | ||
2. For tcr4 and tcr10, this method is not efficient enough. Use the following Cypher query to search for parameters: | ||
|
||
```Cypher | ||
// tcr4 | ||
MATCH | ||
(n1:Account)-[:transfer]-> | ||
(n2:Account)-[:transfer]-> | ||
(n3:Account)-[:transfer]->(n4:Account) | ||
WHERE | ||
n1.id = n4.id AND n1.id > n2.id AND n2.id > n3.id | ||
WITH | ||
n1.id as n1id, | ||
n2.id as n2id, | ||
n3.id as n3id, | ||
n4.id as n4id | ||
LIMIT 1000 | ||
RETURN DISTINCT toString(n1id)+"|"+toString(n2id) | ||
// tcr10 | ||
MATCH | ||
(c:Company)<-[:invest]-(p:Person) | ||
WITH | ||
c.id as cid, | ||
count(p.id) as num, | ||
collect(p.id) as person | ||
WHERE num >= 2 | ||
RETURN | ||
tostring(person[0])+"|"+tostring(person[1]) | ||
LIMIT 1000 | ||
``` |
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
This file was deleted.
Oops, something went wrong.
Oops, something went wrong.