Added Baseline experiments section to README.md notes, fixed shell scripts

3 years ago · 31c1f7d232
parent 8fef49d43c
commit 31c1f7d232
3 changed files with 73 additions and 12 deletions
--- a/README.md
+++ b/README.md
@ -21,24 +21,27 @@ In the above table, term "*dpd*" refers to "*dialogues per domain*". For example
 All the training and testing data can be found under [/data/baseline/](data/baseline/) folder.
 ## Environment Setup
 ### Baseline (SOLOIST) Environment Setup
 Python 3.6 is required for training the baseline model. `conda` is used for creating environments.
-#### Create conda environment
+### Create conda environment (for baseline model)
-Create an environment with specific python version (Python 3.6).
+Create an environment for baseline training with a specific python version (Python 3.6).
 ```shell
-conda create -n <env-name> python=3.6
+conda create -n <baseline-env-name> python=3.6
 ```
 ### Create conda environment (for prompt learning)
 Create an environment for prompt-based methods
 ```shell
 # TODO
 ```
 #### Activate the conda environment
-Activate the conda environment for installing the requirements.
+To activate the conda environment, run:
 ```shell
 conda activate <env-name>
 ```
 #### Deactivating the conda evironment
-Deactivate the conda environment by running the following command:
+To deactivate the conda environment, run: (Only after running all the experiments)
 (After running all the experiments)
 ```shell
 conda deactivate
 ```
@ -74,8 +77,11 @@ cd master-thesis
 Next step is to set environment variables that contains path to pre-trained model, saved models and output dirs.
 Edit the [set_env.sh](set_env.sh) file and set the paths for: (`nano` or `vim` can be used)
 `PRE_TRAINED_SOLOIST` - Path to the extracted pre-trained SOLOIST model
 `SAVED_MODELS_BASELINE` - Path for saving the trained models at checkpoints
 `OUTPUTS_DIR_BASELINE` - Path for storing the outputs of belief state predictions.
 ```shell
@ -89,3 +95,48 @@ Run the below line to unset the environment variables
 ```shell
 sh unset_env.sh
 ```
 ## Baseline Experiments
 SOLOIST ([Peng et al., 2021](https://arxiv.org/abs/2005.05298)), the baseline model for this thesis, is a task-oriented dialog system that uses transfer learning and machine teaching to build task bots at scale. SOLOIST uses the pre-train, fine-tune paradigm for building end-to-end dialog systems using a transformer-based auto-regressive language model GPT-2. In the pre-training stage, SOLOIST is initialized with 12-layer GPT-2 (117M parameters) and further trained on two task-oriented dialog corpora for solving *belief state prediction* task. In the fine-tuning stage, the pre-trained SOLOIST is fine-tuned on MultiWOZ 2.1 dataset to perform belief prediction task.
 ### Install the requirements
 After following the environment setup steps in the previous [section](#environment-setup), install the required python modules for baseline model training.
 Change directory to `baseline` and install the requirements. Make sure the correct baseline conda environment is activated before installing the requirements.
 ```shell
 cd baseline
 pip install requirements.txt
 ```
 ### Train the baseline model
 Train a separate model for each data split. Edit the [train_baseline.sh](baseline/train_baseline.sh) file to modify the hyperparameters while training (learning rate, epochs). Use `CUDA_VISIBLE_DEVICES` to specify a CUDA device (GPU) for training the model. 
 ```shell
 sh train_baseline.sh -d <data-split-name>
 ```
 Pass the data split name to `-d` flag. Possible values are: `50-dpd`, `100-dpd`, `125-dpd`, `250-dpd`
 Example training command: `sh train_baseline.sh -d 50-dpd`
 ### Belief State Prediction
 Choose a checkpoint of the saved baseline model to generate belief state predictions.
 Set the `MODEL_CHECKPOINT` environment variable with the path to the chosen model checkpoint. It should only contain the path from the "experiment-{datetime}" folder.
 ```shell
 export MODEL_CHECKPOINT=<experiment-folder>/<data-split-name>/<checkpoint-folder>
 ```
 Example: `export MODEL_CHECKPOINT=experiment-20220831/100-dpd/checkpoint-90000`
 Generate belief states by running decode script
 ```shell
 sh decode_baseline.sh
 ```
 The generated predictions are saved under `OUTPUTS_DIR_BASELINE` folder. Some of the generated belief state predictions are uploaded to this repository and can found under [outputs](outputs) folder.
 ### Baseline Evaluation
 The standard Joint Goal Accuracy (JGA) is used to evaluate the belief predictions. This metric compares all the predicted belief states to the ground-truth states for each turn. The prediction is considered correct only if all the predicted belief states match with the ground-truth states. Both slots and values must match for the prediction to be correct.
 Edit the [evaluate.py](baseline/evaluate.py) to set the predictions output file before running the evaluation
 ```shell
 python evaluate.py
 ```
--- a/baseline/decode_baseline.sh
+++ b/baseline/decode_baseline.sh
@ -4,7 +4,7 @@
 # Check whether the required environment vars are set
 if [[ -z "${SAVED_MODELS_BASELINE}" ]]; then
-    echo "SAVED_MODELS_BASELINE Environment variable not set. Run set_env_var.sh bash script"
+    echo "SAVED_MODELS_BASELINE Environment variable not set. Run set_env.sh bash script"
    exit 1
 fi
--- a/baseline/train_baseline.sh
+++ b/baseline/train_baseline.sh
@ -23,14 +23,24 @@ fi
 # Check whether the required environment vars are set
 if [[ -z "${SAVED_MODELS_BASELINE}" ]] || [[ -z "${PRE_TRAINED_SOLOIST}" ]]; then
-    echo "Required Environment variables not set. First run set_env_var.sh"
+    echo "Required Environment variables not set. First run set_env.sh"
    exit 1
 fi
-datetime_now=$(date +"%Y%m%d")
+datetime_now=$(date +"%Y%m%dT%H%M%S")
 experiment_folder=experiment-${datetime_now}/"${data_split}"
 SAVE_DIR="${SAVED_MODELS_BASELINE}"/"${experiment_folder}"
 # create the dirs if not exist
 mkdir -p "${SAVE_DIR}"
 echo "Trained Model checkpoints are saved in ${SAVE_DIR}"
 echo "Cleaning the directory (${experiment_folder}) to make sure it's empty!"
 # delete the contents of the dir (where model checkpoints will be saved!)
 rm -rf "${SAVE_DIR}"
 python soloist_train.py \
--output_dir="${SAVED_MODELS_BASELINE}"/"${experiment_folder}" \
+--output_dir="${SAVE_DIR}" \
 --model_type=gpt2 \
 --model_name_or_path="${PRE_TRAINED_SOLOIST}" \
 --do_train \
@ -38,7 +48,7 @@ python soloist_train.py \
 --eval_data_file=../data/baseline/valid/valid.soloist.json  \
 --add_special_action_tokens=../data/resource/special_tokens.txt \
 --per_gpu_train_batch_size 1 \
--num_train_epochs 25 \
+--num_train_epochs 50 \
 --learning_rate 5e-5 \
 --overwrite_cache \
 --save_steps 5000 \