1.5 KiB
Document Merging
The use case in this directory generates new Non-Disclosure Agreement (NDA) based on several input ones that partially overlap in terms of their contents. We provide implementations of five different approaches:
- IO
- Chain-of-Thought (CoT)
- Tree of Thought (ToT)
- Graph of Thoughts (GoT):
- GoT: aggregation of fully merged NDAs
- GoT2: aggregation of partially merged NDAs
Data
We provide an input file with 50 samples: documents.csv.
Execution
The file to execute the use case is called
doc_merge.py. In the main body, one can
select the specific samples to be run (variable samples) and the
approaches (variable approaches). It is also possible to set a budget in
dollars (variable budget).
The Python scripts will create the directory result, if it is not
already present. In the result directory, another directory is created
for each run: {name of LLM}_{list of approaches}_{day}_{start time}.
Inside each execution specific directory two files (config.json,
log.log) and a separate directory for each selected approach are
created. config.json contains the configuration of the run: input data,
selected approaches, name of the LLM, and the budget. log.log contains
the prompts and responses of the LLM as well as additional debug data.
The approach directories contain a separate json file for every sample
and the file contains the Graph Reasoning State (GRS) for that sample.
Plot Data
Change the results directory in line 158 of plot.py and run python3 plot.py to plot your data.