Skip to content

pegasi-ai/fine-grained-editting

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

fine-grained-editting

made-with-python

Intro

This is a repo for our hallucination detection and editing work in the finance domain. This repo includes information on synthetic data generation for training and evaluating our fine-tuned model on FinQA+TATQA.

Overview

  1. Data Preparation
  2. Inference
  3. Detection Evaluation
  4. Editing Evaluation
## Data Preparation

Step 1: Error Insertion

cd data_preparation
python insert_errors.py \
--input_file {input_file_path} \
--output_file {output_file_path} \
--api_key {your_openai_key}

Step 2: Filtering and Correction

cd data_preparation
python verify_responses.py \
--input_file {input_file_path} \
--output_file {output_file_path} \

Step 3: Training Data Preparation

cd data_preparation
python convert_format.py \
--input_file {input_file_path} \
--output_file {output_file_path} \
## Inference

Step 1: Model Inference

cd evalution
python phi_4_inference.py \
--input_file {input_file_path} \
--output_file {output_file_path} \

Step 2: Postprocessing

cd evalution
python postprocess.py \
--input_file {input_file_path} \
--output_file {output_file_path} \

Evaluations

Step 1: Detection

cd evalution
python eval_detection.py \
--input_file {input_file_path} \
--output_file {output_file_path} \

Step 2: Editing

cd evalution
python eval_factscore.py \
--input_file {input_file_path} \

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages