| Model | Features | Scores | Grid search | Precision-recall | Random Forest model | Sample tree | |
|---|---|---|---|---|---|---|---|
| without_aae | .txt | .txt | .tsv | .png | .pkl | .json | .svg |
| simple_aae | .txt | .txt | .tsv | .png | .pkl | .json | .svg |
| complex_aae | .txt | .txt | .tsv | .png | .pkl | .json | .svg |
| 3utr | .txt | .txt | .tsv | .png | .pkl | .json | .svg |
| 5utr | .txt | .txt | .tsv | .png | .pkl | .json | .svg |
| All models | .tar.gz | .tar.gz | .tar.gz | .tar.gz | .tar.gz | .tar.gz | .tar.gz |
For gnomAD and ClinVar, training and test sets have the format:
chr pos ref alt transcript
HGMD sets have the format:
HGMD ID transcript
Training and test sets were generated at transcript level. Thus, a variant can appear in the training and test sets if it affects more than one transcript.
24 March 2021