DNABERT Fine-Tuning#

Zeeshan Siddiqui

Nov 7, 2023

6 min read

On this page, we will show and explain the use of DNABERT. As well as document the BioLM API for fine-tuning, demonstrate no-code and code interfaces.

Description#

The gene regulatory code is governed not only by individual DNA sequences, but also by intricate interactions between regulatory elements and other cellular components. Elucidating these complex relationships is imperative for deciphering genomic regulation. While substantial data exists for protein-coding regions, annotations for non-coding regulatory regions can be sparse, presenting modeling challenges. Furthermore, non-coding DNA may exhibit polysemy, with a single sequence associated with multiple functions, alongside distant semantic ties.

Standard bioinformatics tools often struggle to capture such intricacies, necessitating advanced computational methods to model the multidimensional connections within genomics data. As described by Li et al. (2021), The DNABERT model aims to address these needs by learning meaningful representations of non-coding DNA for predictive tasks. DNABERT implements the Transformer architecture utilized in BERT with 12 layers, 768 hidden units, and 12 attention heads. The same model topology and training methodology are consistently applied across DNABERT variants. Through easy fine-tuning, DNABERT (a pre-trained bidirectional encoder representation model) achieved state-of-the-art performance on diverse regulatory predictions (promoters, splice sites and transcription factor binding sites), highlighting the power of pretraining on the complex patterns within non-coding DNA. In addition, the researchers showed that DNABERT, originally pretrained on the human genome, achieved excellent performance when fine-tuned and applied to model non-human genomic sequences ( cross-organism transferability).

API Usage#

The endpoint to Finetune DNABERT Classifier: https://biolm.ai/api/v1/finetune_run/.

Making Requests#

curl --location 'https://biolm.ai/api/v1/finetune_run/' \
--header "Authorization: Token $BIOLMAI_TOKEN" \
--header 'Content-Type: application/json' \
--data '{
"pipeline": "finetune_DNABERT_classifier",
"hyperopt": false,
"input_json": {
    "max_train": 40000,
    "max_validate": 20000,
    "train": [{"seq":"CACAGCACAGCCCAGCCAAGCCAGGCCAGCCCAGCCCAGCCAAGCCACGCCACTCCACTACACTAGACTAGGCTAGGCTAGGCCAGGCCCGGCCCTGCCCTGCCCTGTCCTGTCCTGTCCTGTCCTGTCCTGTCCTGCCCTGCACTGCAGTGCAGCGCAGCCCAGCCCAGCCCCGCCCCCCCCCCTCCCCTGCCCTGTCCTGTACTGTAGTGTAGGGTAGGGTAGGGGAGGGGTGGGGTCGGGTCTGGTCTGGTCTGGTCTGGACTGGAATGGAACGGAACAGAACAGAACAGCACAGCCCAGCCAAGCCAGGCCAGGCCAGGACAGGAGAGGAGTGGAGTGGAGTGGAGTGGTGTGGTTTGGTTTGGTTTAGTTTAATTTAAGTTAAGATAAGAGAAGAGGAGAGGCGAGGCAAGGCAGGGCAGGGCAGGGCAGGGGAGGGGAGGGGAGGGGAGTGGAGTCGAGTCGAGTCGCGTCGCCTCGCCTCGCCTTGCCTTGCCTTGCCTTGCCTTGCCCTGCCCTGCCCTGCCCTGTCCTGTGCTGTGCTGTGCCGTGCCATGCCACGCCACACCACAC","label":"non-promoter"},{"seq":"CTAATCTAATCTAATCTAATCTAGTCTAGTCTAGTATAGTAAAGTAATGTAATGTAATGCAATGCCATGCCGTGCCGCGCCGCGCCGCGTCGCGTTGCGTTGCGTTGGGTTGGTTTGGTGTGGTGGGGTGGAGTGGAATGGAAAGGAAAGGAAAGAAAAGACAAGACAAGACATGACATGACATGACATGACATGACATGACATGACATAACATACCATACCATACCTTACCTCACCTCACCTCAACTCAAATCAAACCAAACAAAACAGAACAGCACAGCACAGCAGAGCAGGGCAGGGCAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCCGCGCCACGCCATGCCATGCCATGCCATGCGATGCGCTGCGCCGCGCCACGCCAAGCCAAGCCAAGCCAAGCCAAGCCCAGCCCGGCCCGCCCCGCACCGCAGCGCAGAGCAGAGCAGAGGAGAGGGGAGGGTAGGGTTGGGTTGGGTTGTGTTGTCTTGTCCTGTCCAGTCCAATCCAACCCAACTCAACTCAACTCCACTCCTCTCCTATCCTATCCTATTCTATTCTATTCCATTCCT","label":"promoter"},{"seq":"GGAAGAGAAGAGAAGAGGAGAGGGGAGGGAAGGGAAGGGAAGGGAAGGGAAGGAAAGGAAAGGAAAGGAAATGAAATGAAATGCAATGCCATGCCCTGCCCCGCCCCGCCCCGGCCCGGGCCGGGTCGGGTCGGGTCCGGTCCCGTCCCATCCCAGCCCAGGCCAGGCCAGGCGAGGCGGGGCGGGGCGGGGCGGGGCGGGGCCGGGCCTGGCCTCGCCTCGCCTCGACTCGAGTCGAGCCGAGCGGAGCGTAGCGTGGCGTGCCGTGCCGTGCCCTGCCCAGCCCACCCCACGCCACGCCACGCCACGCCGCGCCGCGCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTGCCCTGCCCTGCTCTGCTGTGCTGGGCTGGCCTGGCCTGGCCAGGCCACGCCACGCCACGCCACGCCACGCCTCGCCTGGCCTGGCCTGGACTGGAGTGGAGTGGAGTTGAGTTGAGTTGCGTTGCATTGCAGTGCAGGGCAGGACAGGAAAGGAACGGAACCGAACCGAACCGGACCGGGCCGGGCCGGGCGGGGCGCGGCGCCGCGCCGCGCCGGGCCGGG","label":"promoter"},{"seq":"CGAAAGGAAAGCAAAGCAAAGCAAAGCAATGCAATCCAATCAAATCAGATCAGTTCAGTGCAGTGGAGTGGCGTGGCCTGGCCTGGCCTGGCCTGGCCTGGACTGGACTGGACCGGACCAGACCATACCATGCCATGTCATGTGATGTGTTGTGTAGTGTAGTGTAGTGTAGTATAGTATAGTATAGTATAGTATAGAATAGAGTAGAGAAGAGAGGAGAGCAGAGCAGAGCAAAGCAACGCAACACAACAGAACAGCACAGCGCAGCGCAGCGCCGCGCCACGCCATGCCATCCCATCTCATCTAATCTATTCTATGCTATGCTATGCTATGCTTTGCTTAGCTTAACTTAATTTAATTTAATTTAATTTGATTTGGTTTGGCTTGGCATGGCAAGGCAACGCAACACAACATAACATTACATTACATTACATTACATTACATTACATGACATGTCATGTAATGTAGTGTAGTGTAGTCTAGTCCAGTCCCGTCCCGTCCCGGCCCGGACCGGAACGGAAAGGAAAAGAAAATAAAATCAAATCTAATCTTATCTTTTCTTTTCTTTTATTTTAA","label":"promoter"},{"seq":"TGACTCGACTCCACTCCCCTCCCATCCCAACCCAAACCAAACCAAACCAAACCAAACCAAACCAACCCAACACAACAAAACAAAACAAAACAAAAGAAAAGGAAAGGGAAGGGGAGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGTGGAGTTGAGTTCAGTTCAGTTCATTTCATCTCATCACATCACATCACCTCACCACACCACACCACTCCACTACACTAGACTAGACTAGACTAGACTAGACTTGACTTTACTTTCCTTTCCTTTCCTTTCCTTTCCTTACCTTATCTTATATTATAATATAAAATAAAATAAAAAAAAAAAAAAAACAAAACAAAACACAACACTACACTACACTAGACTAGACTAGAGTAGAGGAGAGGGGAGGGAAGGGAGGGGAGTGGAGTGGAGTGCAGTGCTGTGCTTTGCTTAGCTTAACTTAAGTTAAGCTAAGCAAAGCAGAGCAGAGCAGAACAGAAAAGAAAGGAAAGAAAAGAAAAGAAAAGAAAAGAAAAAAAAAAAAAAAAAAAAAATAAAATAAAATACAATACTATACTATACTAA","label":"promoter"},{"seq":"AAGCATAGCATGGCATGACATGAAATGAAATGAAATGAAATGAAATGAAATGAAATGAAATGAAAGGAAAGAAAAGACAAGACTAGACTGGACTGGACTGGGCTGGGCTGGGCTGGGCTAGGCTAGGCTAGGCTAGGCTAGGCAAGGCACGGCACAGCACAGCACAGTACAGTGCAGTGGAGTGGCGTGGCTTGGCTCGGCTCAGCTCACCTCACATCACACCACACCACACCTCACCTGACCTGTCCTGTACTGTAATGTAATGTAATCTAATCCAATCCCATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCAGGCCAAGCCAAGCCAAGACAAGACAAGACAAGACAGGACAGGACAGGCCAGGCAAGGCAGGGCAGAGCAGATCAGATGAGATGAGATGACATGACCTGACCTGACCTGACCTGACCTGAGCTGAGGTGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGTGGAGTTGAGTTTAGTTTG","label":"non-promoter"},{"seq":"GGCTTTGCTTTGCTTTGTTTTGTTTTGTTTTGTTTTGTTTTCTTTTCTTTTCTGTTCTGTTCTGTGCTGTGATGTGAGGTGAGTTGAGTTGAGTTAAGTTACGTTACGTTACGGTACGGGACGGGGCGGGGCGGGGCTGGGCTGGGCTGCGCTGCCCTGCCATGCCACGCCACCCCACCTCACCTGACCTGCCCTGCACTGCAGTGCAGGGCAGGTCAGGTAAGGTAAGGTAAAGTAAAATAAAATAAAATCAAATCTAATCTGATCTGGTCTGGACTGGACTGGACAGGACATGACATTACATTGCATTGCATTGCCTTGCCCTGCCCTGCCCTGCCCTGACCTGAACTGAAATGAAATGAAATTAAATTGAATTGAATTGACTTGACCTGACCGGACCGAACCGAACCGAACCGAACCGAACCTAACCTTACCTTGCCTTGGCTTGGATTGGATTGGATAGGATACGATACAATACAATACAAAACAAACCAAACCAAACCCAACCCGACCCGGCCCGGCCCGGCCCGGCCTGGCCTGGCCTGACCTGACCTGACATGACAGGACAGTACAGTG","label":"promoter"},{"seq":"TCACCGCACCGTACCGTTCCGTTACGTTACGTTACTTTACTGTACTGCACTGCCCTGCCTTGCCTCGCCTCCCCTCCTCTCCTATCCTAGCCTAGTCTAGTGTAGTGGAGTGGCGTGGCGTGGCGGGGCGGAGCGGATCGGATAGGATACGATACGATACGGTACGGCACGGCGCGGCGGGGCGGCGCGGCACGGCAAGGCAATGCAATACAATAGAATAGTATAGTGTAGTGGAGTGGCGTGGCGTGGCGCGGCGCAGCGCACCGCACAGCACATCACATTACATTCCATTCAATTCAATTCAAGTCAAGGCAAGGCAAGGCAAGGCAGGGCAGGGCAGGACAGGAAAGGAAGGGAAGCGAAGCAAAGCAAAGCAAGGCAAGACAAGAGAAGAGGAGAGGAGAGGAAAGGAACGGAACAGAACAGAACAGAACAGAGCAGAGCAGAGCCGAGCCAAGCCACGCCACCCCACCACACCAGACCAGCCCAGCACAGCAGAGCAGGGCAGGTCAGGTTAGGTTTGGTTTGGTTTGGTTTGGCTTGGCCTGGCCCGGCCCAGCCCAGCCCAGTCCAGTG","label":"promoter"},{"seq":"AGAAAAGAAAACAAAACAAAACAAAACAAAACAAAACAAAAGAAAAGCAAAGCTAAGCTCAGCTCCGCTCCGCTCCGGTCCGGACCGGAGCGGAGTGGAGTAGAGTAGAGTAGGGTAGGATAGGAAAGGAAAGGAAAGGAAAGTAAAGTGAAGTGAAGTGACGTGACATGACACGACACAACACAGCACAGCACAGCGCAGCGCAGCGCCGCGCCACGCCACGCCACCCCACCTCACCTCACCTCCCCTCCCCTCCCGTCCCGGCCCGGTCCGGTACGGTAGGGTAGCGTAGCCTAGCCTAGCCTGGCCTGGCCTGGCCTGGCCTGGCCGGGCCGGGCCGGCCCGGCCCGGCCAGGCCAAGCCAAGCCAAGGCAAGGCAAGGCCAGGCCTGGCCTCGCCTCTCCTCTGCTCTGGTCTGGCCTGGCTTGGCTTGGCTTAGCTTAACTTAAGTTAAGCTAAGCGAAGCGGAGCGGGGCGGGCCGGGCCGGGCCTGGCCTCGCCTCTCCTCTGCTCTGGTCTGGCCTGGCCTGGCCTGGCCTGGCCTGCCCTGCCCTGCCATGCCAAGCCAAACCAAAA","label":"promoter"},{"seq":"AAGTAGAGTAGAGTAGAGTAGAGGAGAGGCGAGGCCAGGCCTGGCCTCGCCTCCCCTCCTCTCCTGTCCTGCCCTGCTCTGCTTTGCTTCGCTTCACTTCAGTTCAGGTCAGGGCAGGGAAGGGAAGGGAAGGGAAGTGAAGTAAAGTAGAGTAGAGTAGAGTAGAGCAGAGCCGAGCCGAGCCGGGCCGGTCCGGTGCGGTGTGGTGTCGTGTCTTGTCTCGTCTCGTCTCGCCTCGCATCGCACCGCACCGCACCACACCAGACCAGACCAGAGCAGAGCAGAGCCGAGCCCAGCCCCGCCCCACCCCAGCCCAGACCAGATCAGATGAGATGGGATGGAATGGAATGGAACGGAACTGAACTCAACTCTACTCTGCTCTGTTCTGTCCTGTCCTGTCCCGTCCCATCCCATCCCATTCCATTCCATTCAATTCACTTCACATCACATCACATTACATTACATTAAATTAATTTAATTTAATTGAATTGAATTGAATTGAATTGAATCGAATCCAATCCAATCCAGTCCAGTCCAGTACAGTACAGTACTGTACTTTACTTTACTTTGCTTTGA","label":"non-promoter"},{"seq":"GGTGCAGTGCAATGCAAGGCAAGGCAAGGAAAGGAAAGGAATGGAATGGAATGAAATGAAATGAAGTGAAGCGAAGCCAAGCCAAGCCAAGCCAATCCAATTCAATTTAATTTCATTTCTTTTCTCTTCTCATCTCAACTCAATTCAATCCAATCAAATCATATCATCTCATCGCATCGAATCGAGTCGAGGCGAGGCGAGGCTAGGCTAGGCTACGCTACCCTACCCTACCCTACCCTGCCCTGCCCTGCCCTGCCATGCCATGCCATCCCATCTCATCTTATCTTGTCTTGTCTTGTGTTGTGGTGTGGCGTGGCCTGGCCAGGCCATGCCATGCCATGTCATGTGATGTGATGTGAGGTGAGGTGAGGGGAGGGAAGGGATGGGATGGGATGCGATGCAATGCACTGCACAGCACACCACACGACACGTCACGTGACGTGTCGTGTAGTGTAGTGTAGAGTAGATTAGATCAGATCAGATCAAATCAATTCAATTCAATTTAATTTCATTTCTTTTCTCTTCTCATCTCAGCTCAGCTCAGCACAGCATAGCATCGCATCACATCACATCACA","label":"promoter"},{"seq":"AGAGACGAGACTAGACTGGACTGGACTGGCCTGGCATGGCAAGGCAAGGCAAGGCAAGGAAAGGACAGGACAGGACAGGACAGGACAGGCCAGGCTAGGCTCGGCTCGGCTCGCCTCGCCTCGCCCCGCCCTGCCCTTCCCTTCCCTTCTCTTCTGTTCTGTTCTGTACTGTAGTGTAGAGTAGAGTAGAGCAGAGCCGAGCCTAGCCTCGCCTCGCCTCGCCTCGCATCGCATCGCATTGCATTGCATTGGATTGGCTTGGCCTGGCCAGGCCACGCCACCCCACCACACCAGACCAGGCCAGGACAGGAGAGGAGGGGAGGCGAGGCAAGGCAGGGCAGTGCAGTGCAGTGTAGTGTTGTGTTGTGTTGTGTTGTCTTGTCTTGTCTGGTCTGCTCTGCCCTGCCTTGCCTCGCCTCTCCTCTCCTCTCGTCTCGACTCGAATCGAACCGAACTGAACTTAACTTGACTTGGCTTGGCTTGGCTTGGCTGGGCTGCGCTGCCCTGCCCTGCCCAGCCCAACCCAAGCCAAGGCAAGGTAAGGTGAGGTGAGGTGAGGTGAGATGAGAAGAGAAG","label":"promoter"},{"seq":"TGGCGAGGCGACGCGACCCGACCCGACCCCACCCCACCCCAACCCAACCCAACCCAACCTAACCTGACCTGCCCTGCCCTGCCCTGCCCTGCCCTTCCCTTGCCTTGCCTTGCTTTGCTTTGCTTCGCTTCGCTTCGGTTCGGATCGGACCGGACAGGACACGACACTACACTGCACTGCACTGCACTGCAGTGCAGCGCAGCACAGCACAGCACCGCACCCCACCCAACCCAACCCAATCCAATGCAATGGAATGGCATGGCGTGGCGCGGCGCCGCGCCCCGCCCAGCCCAGCCCAGACCAGAACAGAACAGAACCGAACCCAACCCGACCCGCCCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTGCCCTGCCCTGCCCTGCCGTGCCGCGCCGCGCCGCGGCGCGGGGCGGGCCGGGCAGGGCAGGGCAGTGCAGTGCAGTGCAGTGCAGTGCAGTGCAGCGCAGCCCAGCCCAGCCCGGCCCGGCCCGGGCCGGGACGGGATGGGATAGGATAGGATAGCATAGCGTAGCGCAGCGCCGCGCCCCGCCCCGCCCCCCCCCCACCCCAA","label":"non-promoter"},{"seq":"CTGTGTTGTGTAGTGTATTGTATAGTATATTATATCATATCTTATCTGATCTGTTCTGTACTGTAATGTAAAGTAAAGTAAAGTAAAGTTAAGTTAAGTTATGTTATCTTATCTTATCTCATCTCCTCTCCACTCCAGTCCAGTCCAGTCCAGTCAAGTCAAGTCAACTCAACGCAACGCAACGCTACGCTACGCTAGGCTAGGCTAGGGTAGGGAAGGGATGGGATGGGATGCGATGCAATGCACTGCACAGCACACCACACTACACTCCACTCTACTCTGCTCTGCTCTGCACTGCAATGCAACGCAACACAACACAACACTACACTCCACTCTACTCTACTCTAGTCTAGGCTAGGTTAGGTGAGGTGGGGTGGCGTGGCCTGGCCTGGCCTTGCCTTCCCTTCTCTTCTGTTCTGTTCTGTACTGTATTGTATAGTATATTATATAATATATTATATGATATGGTATGGCATGGCATGGCAGGGCAGAGCAGAACAGAAAAGAAAAGAAAAAAAAAAGAAAAGAAAAGAAAAGAAAAGAAAGGAAAGTAAAGTAAAGTAAAGTAAAGTAAAT","label":"non-promoter"},{"seq":"CTATATTATATTATATTTTATTTGATTTGGTTTGGATTGGACTGGACAGGACAAGACAATACAATCCAATCGAATCGCATCGCCTCGCCGCGCCGTGCCGTGCCGTGACGTGATGTGATTTGATTAGATTAAATTAAATTAAACTAAACGAAACGAAACGAGACGAGTCGAGTGGAGTGTAGTGTAGTGTATTGTATGGTATGATATGAAATGAAATGAAAGGAAAGGAAAGGCAAGGCGAGGCGTGGCGTCGCGTCTCGTCTGGTCTGATCTGAACTGAAGTGAAGCGAAGCTAAGCTAAGCTAGGCTAGGCTAGGGTAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCTGCGCTACGCTAGGCTAGACTAGATTAGATAAGATAAGATAAAATAAACTAAACAAAACACAACACTACACTGCACTGAACTGATCTGATTTGATTTGATTTCATTTCCTTTCCCTTCCCCTCCCCTCCCCTTCCCTTTCCTTTACTTTAGTTTAGGTTAGGGTAGGGAAGGGAAGGGAAAGGAAAAGAAAAAAAAAAGAAAAGAAAAGAAAAGAATAGAATG","label":"promoter"},{"seq":"AACTGCACTGCACTGCAGTGCAGGGCAGGACAGGATAGGATGGGATGCGATGCTATGCTCTGCTCTGCTCTTCTCTTGTCTTGGCTTGGATTGGAGTGGAGTGGAGTTGAGTTCAGTTCTGTTCTGTTCTGGTCTGGTCTGGTCTGGTCTGGTCTAGTCTACTCTACTCTACTCTACTCTACTCTGCTCTGCTCTGCGCTGCGATGCGATGCGATGCGATGCGATGCTATGCTTTGCTTGGCTTGTCTTGTTTTGTTTTGTTTGGTTTGCTTTGCATTGCAATGCAAAGCAAAACAAAACAAAACCAAACCCAACCCTACCCTGCCCTGTCCTGTCCTGTCATGTCATGTCATGTCATGACATGAGATGAGATGAGAAGAGAAGAGAAGGGAAGGTAAGGTCAGGTCCGGTCCAGTCCACTCCACTCCACTACACTAGACTAGACTAGATTAGATGAGATGGGATGGCATGGCATGGCAGGGCAGGGCAGGTCAGGTTAGGTTTGGTTTCGTTTCATTTCAGTTCAGCTCAGCTCAGCTGAGCTGTGCTGTGCTGTGCTGTGCTGTGCTATGCTAC","label":"promoter"},{"seq":"GCTTTCCTTTCCTTTCCTTTCCTGTCCTGGCCTGGCCTGGCCTGGCCCGGCCCCGCCCCCCCCCCACCCCAACCCAAGCCAAGACAAGAGAAGAGTAGAGTGGAGTGCAGTGCAGTGCAGTGCAGGGCAGGGCAGGGAAGGGATGGGATGGGATGCGATGCCATGCCCTGCCCAGCCCAGCCCAGGCCAGGTCAGGTCAGGTCTGGTCTGGTCTGCTCTGCACTGCAATGCAACGCAACCCAACCAAACCACACCACCCCACCACACCACACCACTCCACTGCACTGGACTGGGCTGGGTTGGGTGGGGTGGGGTGGCGTGGCTTGGCTGGGCTGCGCTGCACTGCAGTGCAGCGCAGCTCAGCTGAGCTGTGCTGTGCTGTGCTGTGCCGTGCCCTGCCCAGCCCAGCCCAGGCCAGGACAGGAGAGGAGGGGAGGGGAGGGTAGGGTGGGGTGGGGTGGGGTGGGATGGGACGGGACTGGACTCGACTCCACTCCTCTCCTGTCCTGCCCTGCCCTGCCCTGCCCCGCCCCACCCCACCCCACCCCACCACACCAAACCAACCCAACTCAACTC","label":"non-promoter"},{"seq":"TGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCCGAGCCGAGCCGGGCCGGGCCGGGTCGGGTGGGGTGCGGTGCCGTGCCCTGCCCCGCCCCGCCCCGGCCCGGGCCGGGTCGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCCGAGCCGAGCCGGGCCGGGCCGGGTCGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTG","label":"non-promoter"},{"seq":"GGGTGAGGTGAGGTGAGGTGAGGGGAGGGAAGGGAAGGGAAGGGAAGTGAAGTGAAGTGCAGTGCAGTGCATTGCATCGCATCCCATCCCATCCCTTCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGTTGGGTGGGGTGAGGTGAGGTGAGATGAGAAGAGAAAAGAAACGAAACAAAACATAACATGACATGGCATGGGATGGGTTGGGTAGGGTAAGGTAAGGTAAGCTAAGCGAAGCGTAGCGTGGCGTGTCGTGTGGTGTGCTGTGCAGTGCAGTGCAGAGCAGACCAGACGAGACGTGACGTGACGTGGCGTGGAGTGGAGTGGAGAGGAGAGGAGAGGAGAGGGGAGGGCAGGGCGGGGCGTGGCGTGGCGTGGCGTGGGGTGGGGTGGGGTGGGGTGGGGTGAGGTGAGGTGAGGTGAGGGGAGGGAAGGGACGGGACGGGACGCGACGCCACGCCCCGCCCAGCCCACCCCACCCCACCCCACCCCACCCCTCCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGTTGGGTGGGGTGAGGTGAGGTGAGATGAGAAGAGAAA","label":"non-promoter"},{"seq":"AAGTTTAGTTTTGTTTTTTTTTTCTTTTCCTTTCCATTCCACTCCACCCCACCTCACCTGACCTGCCCTGCCCTGCCATGCCACGCCACTCCACTTCACTTCACTTCACTTCACTTCACATCACAACACAATACAATGCAATGAAATGACATGACCTGACCCGACCCTACCCTCCCCTCCCCTCCACTCCAGTCCAGCCCAGCGCAGCGCAGCGCCGCGCCCCGCCCTGCCCTCCCCTCTCCTCTACTCTACTCTACTCTACTGTACTGGACTGGCCTGGCATGGCAGGGCAGAGCAGAGCAGAGAAGAGACGAGACTAGACTAGACTAGACTAGCCTAGCATAGCATAGCATCGCATCACATCAAATCAAGTCAAGCCAAGCCAAGCCAAGCCAGGCCAGCCCAGCTCAGCTGAGCTGGGCTGGCCTGGCATGGCAAGGCAAAGCAAACCAAACCAAACCAAACCAGACCAGACCAGAGCAGAGGAGAGGCGAGGCGAGGCGTGGCGTCGCGTCCCGTCCTGTCCTTTCCTTTCCTTTACTTTAATTTAAGTTAAGGTAAGGTAAGGTCAGGTCC","label":"promoter"},{"seq":"TTTTTTTTTTTTTTTTTGTTTTGCTTTGCGTTGCGGTGCGGGGCGGGGCGGGGCGGGGCGGGGCGCGGCGCAGCGCAGCGCAGTGCAGTGCAGTGGAGTGGCGTGGCTTGGCTCGGCTCAGCTCATCTCATGTCATGCCATGCCATGCCTTGCCTGGCCTGTCCTGTACTGTAGTGTAGTGTAGTCTAGTCCAGTCCCGTCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCAGGCCAAGCCAAGCCAAGACAAGACAAGACGAGACGGGACGGGACGGGCCGGGCAGGGCAGGGCAGAGCAGATCAGATCAGATCAGATCACATCACGTCACGACACGAGACGAGGCGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGAGGAGATGAGATCAGATCGGATCGAATCGAGTCGAGACGAGACGAGACTAGACTAGACTATACTATCCTATCCTATCCTATCCTGTCCTGGCCTGGCCTGGCTTGGCTAGGCTAA","label":"non-promoter"},{"seq":"CCGCCTCGCCTTGCCTTCCCTTCCCTTCCCTTCCCTTCCCTCCCCTCTCCTCTGCTCTGTTCTGTTCTGTTTTGTTTTGTTTTTTTTTTGTTTTGGTTTGGCTTGGCATGGCATGGCATAGCATAACATAAGATAAGATAAGAAAAGAAAAGAAACGAAACAAAACAAAACAATACAATTCAATTCAATTCAATTCAGTTCAGGTCAGGTCAGGTTAGGTTTGGTTTAGTTTATTTTATCTTATCATATCAAATCAAGTCAAGGCAAGGAAAGGAGAGGAGAGGAGAGGAGAGTAGAGTCGAGTCCAGTCCAGTCCAGTCCAGGCCAGGGCAGGGTAGGGTCGGGTCAGGTCAGGTCAGATCAGAACAGAATAGAATTGAATTTAATTTTATTTTTTTTTTCTTTTCTTTTCTATTCTAATCTAACCTAACCTAACCAAACCACACCACCCCACCACACCAGACCAGGCCAGGTCAGGTGAGGTGGGGTGGCGTGGCGTGGCGCGGCGCAGCGCAGCGCAGAGCAGAGCAGAGCAGAGCAGAGCAAAGCAAGGCAAGCCAAGCTAAGCTTAGCTTA","label":"promoter"},{"seq":"ACAAAACAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAAAAAAAAAAAAAGAAAAGCAAAGCGAAGCGGAGCGGGGCGGGACGGGAAGGGAAGGGAAGCGAAGCAAAGCAGAGCAGCGCAGCCCAGCCTAGCCTTGCCTTGCCTTGGCTTGGGTTGGGTTGGGTTGGGTTAGGTTATGTTATCTTATCTTATCTCATCTCCTCTCCACTCCAGTCCAGCCCAGCACAGCACAGCACAGCACACCACACCACACCCCACCCAACCCACCCCACCCCACCACACCAGACCAGACCAGAGCAGAGGAGAGGGGAGGGCAGGGCAGGGCAGGGCAGCGCAGCACAGCAGAGCAGAGCAGACCAGACAAGACACGACACTACACTGCACTGGACTGGCCTGGCTTGGCTAGGCTAAGCTAAACTAAAGTAAAGCAAAGCTAAGCTCAGCTCTGCTCTTCTCTTATCTTAGCTTAGTTTAGTCTAGTCAAGTCATGTCATATCATAACATAAGATAAGTTAAGTCAAGTCCAGTCCTGTCCTGTCCTGACCTGAGCTGAGTTGAGTGGAGTGCAGTGCT","label":"promoter"},{"seq":"ACTGGACTGGAATGGAAAGGAAAAGAAAATAAAATTAAATTTAATTTTATTTTATTTTAATTTAAATTAAATTAAATGAAATGAAATGAAATGAATTGAATGGAATGAAATGATATGATGTGATGTGATGTGATGTGATGTGATGTGATTTGATTCGATTCTATTCTGTTCTGTTCTGTGCTGTGGTGTGGTGTGGTGTGGTGCGGTGCTGTGCTCTGCTCCGCTCCTCTCCTGTCCTGTCCTGTGCTGTGGTGTGGGGTGGGCTGGGCAGGGCAGGGCAGCGCAGCACAGCACAGCACTGCACTGCACTGGACTGGCCTGGCCTGGCCTGGCCTGGCCTGACCTGAACTGAAGTGAAGCGAAGCAAAGCACAGCACAGCACAACACAAAACAAACCAAACCAAACCTAACCTGACCTGGCCTGGACTGGAGTGGAGCGGAGCCGAGCCTAGCCTCGCCTCCCCTCCACTCCAGTCCAGCCCAGCACAGCAGAGCAGGGCAGGGCAGGGGAGGGGGGGGGGCGGGGCAGGGCAGGGCAGTGCAGTCCAGTCCAGTCCTGTCCTCTCCTCACCTCAG","label":"promoter"},{"seq":"AGAGCTGAGCTGAGCTGTGCTGTCCTGTCTTGTCTGGTCTGCTCTGCTCTGCTGTGCTGGGCTGGGCTGGGGTGGGGGGGGGGCGGGGCAGGGCAGGGCAGGGCAGGGCAGGGCAGGGCGGGGCGCGGCGCTGCGCTGCGCTGTGCTGTTCTGTTCTGTTCTGTTCTGTTCTGGTCTGGGCTGGGCTGGGCAGGGCACGGCACTGCACTGCACTGTACTGTACTGTAGTGTAGGGTAGGATAGGATAGGATGGGATGTGATGTTATGTTATGTTAGGTTAGCTTAGCATAGCAGAGCAGCGCAGCGCAGCGAAGCGACGCGACCCGACCCGACCCTACCCTGCCCTGGCCTGGCCTGGCCTGGCCTGGCCTCGCCTCTCCTCTACTCTACTCTACCCTACCATACCACACCACTCCACTACACTAGACTAGACTAGATTAGATGAGATGCGATGCCATGCCATGCCAGGCCAGTCCAGTACAGTAGAGTAGCGTAGCATAGCACAGCACCGCACCCCACCCTACCCTCCCCTCCCCTCCTCTCCTCTCCTCTCCTCTCCTCTCCTCTCCACTCCAG","label":"promoter"},{"seq":"GCTTTGCTTTGTTTTGTTTTGTTATGTTACGTTACATTACAGTACAGGACAGGTCAGGTGAGGTGTGGTGTCGTGTCTTGTCTGGTCTGTTCTGTTCTGTTATGTTAAGTTAACTTAACATAACATAACATTACATTCCATTCCATTCCATTCCATTCCATGCCATGGCATGGAATGGACTGGACCGGACCAGACCAAACCAAACCAAAACAAAACAAAACAAAACAAAACAAGACAAGGCAAGGCAAGGCCAGGCCAGGCCAAGCCAAACCAAACCAAACCAAACCCAACCCAACCCAACCCAAACCAAAACAAAATAAAATCAAATCAAATCAAATCAAGTCAAGGCAAGGGAAGGGAAGGGACGGGACAGGACAGGACAGGACAGGACAGGAAAGGAAGGGAAGTGAAGTAAAGTAGAGTAGAGTAGACTAGACTAGACTCGACTCCACTCCACTCCACTCCACCCCACCCCACCCAACCCATCCCATGCCATGCCATGCAATGCAGTGCAGGGCAGGTCAGGTGAGGTGGGGTGGAGTGGAATGGAAGGGAAGGGAAGGGAAGGGGAGGGGA","label":"non-promoter"},{"seq":"GGTGATGTGATGTGATGCGATGCTATGCTATGCTACGCTACACTACAGTACAGGACAGGGCAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCCGGCCCTGCCCTCCCCTCACCTCATCTCATATCATAGCATAGGATAGGATAGGACAGGACAGGACAGGACAGGACAGGTCAGGTGAGGTGCGGTGCTGTGCTCTGCTCAGCTCACCTCACCTCACCACACCAGACCAGGCCAGGTCAGGTGAGGTGCGGTGCGGTGCGGTGCGGAGCGGAGCGGAGGGGAGGAGAGGACAGGACAGGACAAGACAACACAACCCAACCCAACCCGACCCGTCCCGTCCCGTCCCGTCCGGTCCGGTCCGGGCCGGGCCGGGCTGGGCTGGGCTGGGCTGGACTGGAGTGGAGCGGAGCAGAGCAGAGCAGGGCAGGTCAGGTCAGGTCAGGTCAAGTCAAGTCAAGACAAGAGAAGAGGAGAGGCGAGGCTAGGCTCGGCTCTGCTCTGCTCTGGTCTGGGCTGGGATGGGAGGGGAGAGGAGACGAGACAAGACACGACACTACACTTCACTTCACTTCC","label":"non-promoter"},{"seq":"GGGGCAGGGCAGGGCAGGGCAGGACAGGAAAGGAAAGGAAAGGAAAGCAAAGCCAAGCCCAGCCCAGCCCATCCCATCCCATCTCATCTAATCTACTCTACACTACAATACAAGACAAGGCAAGGCAAGGCCAGGCCAGGCCAGGCCAGTCCAGTGCAGTGGAGTGGCGTGGCTTGGCTTGGCTTTGCTTTTCTTTTCTTTTCCTTTCCCTTCCCCTCCCCCCCCCCACCCCAACCCAACCCAACCCAACCCAACCCAACCCAGCCCAGTCCAGTCCAGTCCAGTCCTGTCCTTTCCTTCCCTTCCCTTCCCTTCCCATCCCAACCCAAACCAAATCAAATTAAATTCAATTCCATTCCCTTCCCATCCCACCCCACACCACAGCACAGCACAGCCCAGCCTAGCCTCGCCTCCCCTCCACTCCAGTCCAGACCAGATCAGATCAGATCCGATCCCATCCCTTCCCTGCCCTGCCCTGCACTGCAATGCAACGCAACCCAACCCAACCCTACCCTCCCCTCCCCTCCCCTCCCTTCCCTCCCCTCCCCTCCCCTCCCGTCCCGCCCCGCTCCGCTT","label":"non-promoter"},{"seq":"ATACAGTACAGGACAGGTCAGGTTAGGTTTGGTTTCGTTTCCTTTCCGTTCCGTTCCGTGCCGTGGCGTGGGGTGGGATGGGAGGGGAGAGGAGAGGAGAGGAGAGGTGAGGTAAGGTAAGGTAACGTAACATAACACAACACAACACAACACAATACAATACAATAGAATAGCATAGCTTAGCTTAGCTTGGCTTGTCTTGTATTGTATTGTATCGTATCATATCAGATCAGTTCAGTCCAGTCAAGTCATGTCATTTCATTACATTACATTACCTTACCATACCACACCACTCCACTTCACTTGACTTGACTTGAGTTGAGTTGAGTGGAGTGTAGTGTGGTGTGATGTGAAGTGAAGTGAAGCGAAGCAAAGCAGAGCAGTGCAGTTCAGTTAAGTTAGGTTAGTTTAGTCTAGTCAAGTCAAGTCAAATCAAAGCAAAGTAAAGTCAAGTCTAGTCTGGTCTGGTCTGGGCTGGGATGGGAGGGGAGTGGAGTGGAGTGAAGTGAAGTGAATTGAATGGAATGAAATGAGATGAGATGAGAGGAGAGTAGAGTAGAGTAGAGTAGAGTAGAA","label":"non-promoter"},{"seq":"AACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAAAAAAAAAAAAAAACAAAACAAAACACAACACAACACAGCACAGCACAGCACAGCAAAGCAAAGCAAACCAAACCAAACCTAACCTGACCTGTCCTGTACTGTATTGTATGGTATGTTATGTTATGTTGTGTTGTGTTGTCTTGTCCTGTCCCGTCCCTTCCCTTCCCTTCCCTTCCCTTCCATTCCAGTCCAGGCCAGGTCAGGTCAGGTCCGGTCCCGTCCCCTCCCCCCCCCCTCCCCTGCCCTGCCCTGCTCTGCTGTGCTGGGCTGGGCTGGGCTGGGCAGGGCATGGCATTGCATTTCATTTGATTTGCTTTGCATTGCAGTGCAGAGCAGAACAGAACAGAACCGAACCGAACCGCACCGCACCGCAGCGCAGCGCAGCACAGCATAGCATCGCATCCCATCCCATCCCATCCCAGCCCAGACCAGATCAGATCAGATCAGATCACATCACTTCACTCCACTCGACTCGTCTCGTTTCGTTACGTTAAGTTAAATTAAAATAAAAAAAAAAAAAAAATAAAATT","label":"promoter"},{"seq":"TCCTGACCTGATCTGATATGATAAGATAAAATAAACTAAACCAAACCCAACCCAACCCATCCCATGCCATGGCATGGGATGGGATGGGATGGGATCGGATCTGATCTCATCTCATCTCATCTCATGTCATGACATGAGATGAGATGAGAAGAGAATAGAATTGAATTAAATTATATTATTTTATTCTATTCAATTCATTTCATTTCATTACATTATATTATCTTATCATATCATATCATGTCATGACATGAGATGAGATGAGAAGAGAATAGAATAGAATAGAATAGTATAGTATAGTATAGTATGGTATGGTATGGGATGGGATGGGAAGGGAAAGGAAAGGAAAGAAAAGACAAGACCAGACCAGACCAGACCAGTCCAGTCCAGTCCAGTCCCGTCCCCTCCCCACCCCATCCCATGCCATGACATGATATGATTTGATTCGATTCAATTCAATTCAATTCAATTCAATTAAATTACATTACCTTACCTTACCTCACCTCCCCTCCCCTCCCCTCCCCCCCCCCTCCCCTGCCCTGGCCTGGGCTGGGTTGGGTCGGGTCCGGTCCCGTCCCT","label":"non-promoter"},{"seq":"GTCATCTCATCGCATCGTATCGTATCGTAGCGTAGTGTAGTATAGTACAGTACTGTACTATACTACACTACACTACATTACATTACATTTCATTTTATTTTATTTTAATTTAAATTAAACTAAACAAAACATAACATGACATGTCATGTAATGTAATGTAAAGTAAAGTAAAGAAAAGAGAAGAGCAGAGCTGAGCTCAGCTCAGCTCAGCTCAGTTCAGTGCAGTGGAGTGGTGTGGTGTGGTGCGGTGCTGTGCTCTGCTCCGCTCCACTCCAATCCAAGCCAAGACAAGAAAAGAAGAGAAGCGAAGCAAAGCAAAGCAAGGCAAGACAAGACAAGACTAGACTTGACTTTACTTTGCTTTGGTTTGGTTTGGTATGGTAGGGTAGAGTAGAGTAGAGAAGAGACGAGACGAGACGGGACGGCACGGCCCGGCCGGGCCGCGCCGCTCCGCTTCGCTTGGCTTGCCTTGCTTTGCTCTGCTCCGCTCCCCTCCCATCCCAACCCAAACCAAATCAAATAAAATATAATATCATATCATATCATATCATGTCATGCCATGCTATGCTGTGCTGA","label":"non-promoter"},{"seq":"ACTGCGCTGCGCTGCGCGGCGCGCCGCGCCGCGCCGCGCCGAGCCGACCCGACGCGACGGGACGGTACGGTGCGGTGGGGTGGGGTGGGCTGGGCTGGGCTGGGCTGGGCTGGCCTGGCGTGGCGGGGCGGGGCGGGACGGGACGGGACCGGACCAGACCAGACCAGGCCAGGACAGGACAGGACAGGACAGGACAGGACAGGACAGGAAAGGAACGGAACAGAACAAAACAATACAATGCAATGGAATGGGATGGGATGGGATGGGATTGGATTCGATTCCATTCCGTTCCGATCCGAGCCGAGGCGAGGGGAGGGCAGGGCCGGGCCGGGCCGCGCCGCACCGCAACGCAAGGCAAGGCAAGGGAAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCTGCGCTCCGCTCCGCTCCTCTCCTTTCCTTCCCTTCTCTTCTGTTCTGCTCTGCGCTGCGGTGCGGGGCGGGTCGGGTTGGGTTGGGTTGGGTTGGGTTGGGGTGGGGTGGGGTGGGGTGCGGTGCGGTGCGATGCGAGGCGAGGCGAGGCGAGGCCAGGCCGGGCCGGGCCGGA","label":"promoter"},{"seq":"TGTGCTGTGCTGTGCTGAGCTGATCTGATGTGATGCGATGCCATGCCTTGCCTGGCCTGTCCTGTGCTGTGGTGTGGTGTGGTTTGGTTTGGTTTGGTTTGGTTTGGTTTGGTGTGGTGGGGTGGGGTGGGGTGGGGCGGGGCTGGGCTAGGCTACGCTACACTACAATACAACACAACACAACAGAACAGGACAGGACAGGAAAGGAAAGGAAATGAAATTAAATTCAATTCCATTCCTTTCCTGTCCTGCCCTGCTCTGCTTTGCTTTGCTTTGCTTTGGTTTGGATTGGAATGGAAAGGAAAGGAAAGAAAAGACAAGACAAGACAGGACAGAACAGAACAGAAAAGAAAGGAAAGCAAAGCAAAGCAGAGCAGAGCAGATCAGATAAGATAGGATAGCATAGCCTAGCCAAGCCAAGCCAAACCAAATCAAATTAAATTCAATTCTATTCTCTTCTCTTCTCTCCTCTCTTCTCTACTCTACTCTACCCTACCATACCACACCACACCACATCACATTACATTTCATTTTATTTTGTTTTGGTTTGGATTGGAATGGAAAGGAAACGAAACT","label":"non-promoter"},{"seq":"GGTTCCGTTCCCTTCCCGTCCCGCCCCGCTCCGCTTCGCTTCGCTTCCCTTCCATTCCACTCCACCCCACCGCACCGAACCGAGCCGAGGCGAGGGGAGGGCAGGGCCGGGCCGGGCCGAGCCGACCCGACTCGACTGGACTGCACTGCGCTGCGATGCGAGGCGAGGCGAGGTGAGGTGAGGTGCGGTGCAGTGCATTGCATGGCATGCCATGCTATGCTGTGCTGGGCTGGGCTGGGATGGGAGGGGAGTGGAGTCGAGTCGAGTCGTGTCGTATCGTAGCGTAGTGTAGTATAGTACAGTACCGTACCGTACCGCACCGCACCGCACCGCACCGCACCGCACCGGACCGGGCCGGGGCGGGGCGGGGCGGGGCGGGGCGGAGCGGAACGGAACGGAACAGAACAGAACAGCACAGCTCAGCTCAGCTCCGCTCCGCTCCGCTCCGCCCCGCCCCGCCCCGCCCCGCCCCGGCCCGGCCCGGCGCGGCGGGGCGGAGCGGATCGGATGGGATGGGATGGTATGGTGTGGTGTGGTGTTGTGTTTTGTTTCGTTTCCTTTCCATTCCAGTCCAGA","label":"non-promoter"},{"seq":"GCCCGGCCCGGGCCGGGACGGGAGGGGAGCGGAGCGGAGCGTAGCGTCGCGTCGCGTCGCGTCGCATCGCAGCGCAGCGCAGCCCAGCCTAGCCTCGCCTCCCCTCCCCTCCCCTCCCCCCCCCCGCCCCGCCCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTCCCCTCCCCTCCCCTCCCCTCCCCGCCCCGCCCCGCCCCGCCTCGCCTCGCCTCGCCTCGGCTCGGGTCGGGGCGGGGAGGGGACGGGACTGGACTCGACTCGACTCGTCTCGTCTCGTCCCGTCCCGTCCCTTCCCTCCCCTCCCCTCCACTCCACTCCACACCACAGCACAGCACAGCCCAGCCCAGCCCCGCCCCTCCCCTCCCCTCCCCTCCCCTCCCTTCCCTCCCCTCCCCTCCCCTCCCGTCCCGTCCCGTCCCGTCGCGTCGGGTCGGATCGGAACGGAATGGAATTGAATTCAATTCGATTCGCTTCGCATCGCAGCGCAGCGCAGCCCAGCCTAGCCTCGCCTCCCCTCCGCTCCGCTCCGCCCCGCCGCGCCGTGCCGTTCCGTTCCGTTCTGTTCTT","label":"non-promoter"},{"seq":"CTGGCTTGGCTGGGCTGCGCTGCTCTGCTCTGCTCCGCTCCTCTCCTTTCCTTACCTTACCTTACATTACAATACAAAACAAACCAAACCAAACCTAACCTGACCTGTCCTGTGCTGTGGTGTGGAGTGGAGTGGAGTGGAGTTGAGTTGAGTTGGGTTGGATTGGACTGGACTGGACTTGACTTGACTTGCCTTGCTTTGCTGTGCTGTGCTGTTCTGTTTTGTTTTGTTTTTTTTTTCTTTTCCTTTCCTTTCCTCTCCTCTCCTCTTCTCTTGTCTTGCCTTGCCTTGCCATGCCACGCCACTCCACTACACTAGACTAGACTAGAGTAGAGGAGAGGGGAGGGTAGGGTAGGGTAGGGTAGAGTAGAATAGAATAGAATAGAATATAATATGATATGATATGAAATGAAATGAAAAGAAAAGAAAAGAAAAGAAAAGAAGAGAAGAGAAGATAAGATTAGATTAGATTAGATTAGCTTAGCATAGCATAGCATGGCATGTCATGTTATGTTTTGTTTTGTTTTCTTTTCATTTCATTTCATCTCATCCCATCCTATCCTTTCCTTGCCTTGC","label":"promoter"},{"seq":"CCCTGCCCTGCTCTGCTATGCTACGCTACACTACAGTACAGTACAGTTCAGTTTAGTTTTGTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTGTTTTGATTTGATTTGATATGATATGATATAATATACTATACAATACAGTACAGGACAGGTCAGGTTAGGTTTGGTTTTGTTTTGTTTTGCTTTGCCTTGCCATGCCAGGCCAGCCCAGCTCAGCTTAGCTTTGCTTTTCTTTTTTTTTTTTTTTTCTTTTCATTTCACTTCACTTCACTACACTAGACTAGACTAGATTAGATGAGATGGGATGGTATGGTGTGGTGCGGTGCTGTGCTATGCTAAGCTAATCTAATTTAATTCAATTCCATTCCCTTCCCTTCCCTTCCCTTTCCTTTACTTTAGTTTAGTTTAGTGTAGTGAAGTGACGTGACCTGACCTGACCTAACCTAGCCTAGGCTAGGCTAGGCTAGGCTGGGCTGTGCTGTTCTGTTTTGTTTTGTTTTCTTTTCATTTCATTTCATATCATACCATACGATACGATACGATACGATGCGATGTGATGTTATGTTTTGTTTA","label":"promoter"}],
    "validation": [{"seq":"GTGGGGTGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGCGCCGCCCCGCCCCGCCCCGCCCCACCCCACCCCACTCCACTGCACTGCACTGCACTGCAGTGCAGGGCAGGTCAGGTGAGGTGGGGTGGGGTGGGCTGGGCCGGGCCTGGCCTGGCCTGTCCTGTACTGTAGTGTAGCGTAGCATAGCAGAGCAGCGCAGCTCAGCTGAGCTGCGCTGCACTGCACTGCACCGCACCTCACCTGACCTGACCTGAGCTGAGGTGAGGCGAGGCAAGGCAGGGCAGGGCAGGGCAGGGCAGGGCTGGGCTGGGCTGGGCTGGCCTGGCATGGCAGGGCAGCGCAGCCCAGCCCAGCCCCGCCCCTCCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGGTGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGCGCCGCCCCGCCCCGCCCCGCCCCACCCCACCCCACTCCACTGCACTGCACTGCACTGCAGTGCAGGGCAGGTCAGGTG","label":"non-promoter"},{"seq":"GTGTGGTGTGGGGTGGGATGGGATGGGATCGGATCAGATCATATCATGTCATGTCATGTAATGTATTGTATCGTATCATATCAGATCAGTTCAGTGCAGTGCAGTGCAGTGCAGTGCAGCGCAGCCCAGCCTAGCCTTGCCTTGCCTTGACTTGACTTGACCTGACCTGACCTCACCTCCCCTCCTCTCCTGTCCTGGCCTGGGCTGGGCTGGGCTGGGCTCGGCTCAGCTCAACTCAAGTCAAGCCAAGCAAAGCATAGCATTGCATTCCATTCTATTCTTTTCTTCTCTTCCCTTCCCTTCCCATCCCACCCCACCCCACCTCACCTCACCTCACCTCAACTCAACTCAACCCAACCTAACCTCACCTCTCCTCTTCTCTTGTCTTGACTTGAGTTGAGTTGAGTAGAGTAGAGTAGCGTAGCTTAGCTGAGCTGAGCTGAACTGAAATGAAATGAAATTAAATTAAATTACATTACATTACAGTACAGGACAGGACAGGAAAGGAACGGAACAGAACATAACATGACATGCCATGCCATGCCATGCCACGCCACCCCACCACACCACACCACA","label":"non-promoter"},{"seq":"CCCTGCCCTGCACTGCATTGCATGGCATGCCATGCCATGCCATGCCACGCCACACCACATCACATAACATAGCATAGCATAGCATAGCAAAGCAAGGCAAGGCAAGGTAAGGTGAGGTGCGGTGCTGTGCTGTGCTGGGCTGGGCTGGGTTGGGTCGGGTCAGGTCACGTCACTTCACTGCACTGAACTGATCTGATGTGATGCGATGCTATGCTATGCTAAGCTAACCTAACATAACATAACATCACATCTCATCTAATCTAATCTAAACTAAACTAAACAAAACAGAACAGGACAGGGCAGGGGAGGGGCGGGGCCGGGCCAGGCCAGGCCAGGCCAGGTCAGGTGAGGTGCGGTGCGGTGCGGTGCGGTGCGGTGCGGTGGGGTGGCGTGGCTTGGCTCGGCTCAGCTCACCTCACTTCACTCCACTCTACTCTTCTCTTGTCTTGACTTGAATTGAAATGAAATGAAATCAAATCCAATCCCATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGC","label":"non-promoter"},{"seq":"TTGGAGTGGAGCGGAGCAGAGCAAAGCAAGGCAAGGCAAGGCAAGGCTAGGCTAGGCTATGCTATGCTATGCTATGCAATGCACTGCACCGCACCACACCATACCATACCATACCATACAATACATTACATGACATGCCATGCTATGCTCTGCTCTGCTCTGCTCTGATCTGAGCTGAGTTGAGTGGAGTGGAGTGGGGTGGGCTGGGCTGGGCTTGGCTTGGCTTGACTTGATTTGATTTGATTCGATTCCATTCCTTTCCTCTCCTCCCCTCCACTCCAGTCCAGGCCAGGGCAGGGAAGGGAAGGGAAGGGAAGAGAAGAGAAGAGGAGAGGCGAGGCCAGGCCAGGCCAGGCCAGGCCAGGACAGGAAAGGAAAGGAAAGGAAAGCAAAGCAAAGCATAGCATTGCATTGCATTGAATTGATTTGATGTGATGTGATGTGATGTGATGTGAAGTGAAATGAAAAGAAAACAAAACAAAACAGAACAGCACAGCCCAGCCTAGCCTTGCCTTTCCTTTCCTTTCCTTTCCCTTCCCTTCCCTTCCCTTGCCTTGCCTTGCCTTGCCATGCCAT","label":"non-promoter"},{"seq":"AGCACAGCACAGCACAGGACAGGGCAGGGCAGGGCAGGGCACGGCACTGCACTGCACTGGACTGGTCTGGTGTGGTGGGGTGGAGTGGAGTGGAGGGGAGGGGAGGGAAGGGAGGGGAGCGGAGCCGAGCCCAGCCCTGCCCTGCCCTGCCCTGCGCTGCGGTGCGGGGCGGGGCGGGGCGGGGCAGGGCAGGGCAGTGCAGTCCAGTCCAGTCCTGTCCTCTCCTCACCTCAACTCAAGTCAAGGCAAGGCAAGGCCAGGCCTGGCCTCGCCTCCCCTCCGCTCCGGTCCGGACCGGATCGGATGGGATGGGATGGGATGGGTTGGGTGGGGTGTGGTGTGGTGTGATGTGAGGTGAGATGAGAGGAGAGGAGAGGCGAGGCAAGGCACGGCACCGCACCGCACCGGACCGGGCCGGGGCGGGGCGGGGCTGGGCTGGGCTGAGCTGAACTGAAGTGAAGCGAAGCAAAGCAGAGCAGCGCAGCACAGCATAGCATCGCATCTCATCTGATCTGGTCTGGGCTGGGTTGGGTTGGGTTTGGTTTGGTTTGATTTGAGTTGAGGTGAGGAGAGGAA","label":"non-promoter"},{"seq":"AGGCCAGGCCAGGCCAGCCCAGCTCAGCTGAGCTGGGCTGGGCTGGGGTGGGGTGGGGTCGGGTCAGGTCAAGTCAAGTCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGGAAGGGGAGGGGGGGGGGCGGGGCTGGGCTGGGCTGCGCTGCCCTGCCCTGCCCAGCCCAGCCCAGCCCAGCACAGCACAGCACAGCACAGCACAGTACAGTGCAGTGGAGTGGTGTGGTTTGGTTCGGTTCTGTTCTGTTCTGCTCTGCTCTGCTCTGCTCCGCTCCACTCCAGTCCAGACCAGAGCAGAGGAGAGGTGAGGTGAGGTGCGGTGCAGTGCAGTGCAGTGCAGTCCAGTCAAGTCAGGTCAGATCAGACCAGACTAGACTGGACTGCACTGCCCTGCCTTGCCTGGCCTGGCCTGGGCTGGGTTGGGTTGGGTTGGGTTGGGTTGGCTTGGCTTGGCTCGGCTCAGCTCATCTCATGTCATGCCATGCCATGCCTTGCCTGGCCTGGCCTGGGCTGGGTTGGGTCGGGTCTGGTCTGGTCTGTTCTGTCCTGTCATGTCATGTCATT","label":"non-promoter"},{"seq":"GTGCGATGCGAGGCGAGACGAGATGAGATGAGATGAGATGACATGACGTGACGCGACGCAACGCACCGCACTGCACTTCACTTCACTTCCCTTCCTTTCCTGTCCTGCCCTGCCCTGCCTTGCCTGGCCTGACCTGAGCTGAGGTGAGGCGAGGCGAGGCGGGGCGGCGCGGCCCGGCCGGGCCGCGCCGCTCCGCTGCGCTGTGCTGTTCTGTTCTGTTCTGTTCTCTTCTCGTCTCGCCTCGCGTCGCGGCGCGGCGCGGCTCGGCTTGGCTTCGCTTCCCTTCCGTTCCGGTCCGGCCCGGCACGGCAGGGCAGGGCAGGTCAGGTGAGGTGGGGTGGCGTGGCGTGGCGCGGCGCTGCGCTGCGCTGAGCTGAGCTGAGATGAGACGAGACCAGACCAGACCACACCACGCCACGGCACGGGACGGGACGGGAAGGGAAGGGAAGCGAAGCCAAGCCAAGCCAGGCCAGCCCAGCCCAGCCTAGCCTGGCCTGGCCTGGCCTGGCTTGGCTGGGCTGTGCTGTCCTGTCGTGTCGGGTCGGTTCGGTTCGGTTAGGTTAGGTTAGCTTAGCC","label":"promoter"},{"seq":"GTTCTTTTCTTGTCTTGGCTTGGATTGGATTGGATCGGATCAGATCACATCACATCACACCACACTACACTCCACTCGACTCGACTCGAGTCGAGGCGAGGAGAGGAAAGGAAAGGAAAGGAAAGCAAAGCTAAGCTCAGCTCCGCTCCACTCCAGTCCAGCCCAGCTCAGCTGAGCTGGGCTGGGCTGGGCTGGGCCGGGCCCGGCCCAGCCCAGCCCAGACCAGATCAGATTAGATTTGATTTGATTTGGTTTGGGTTGGGGTGGGGCGGGGCTGGGCTTGGCTTCGCTTCTCTTCTGTTCTGTTCTGTCCTGTCCTGTCCTGTCCTGTCCTGACCTGAACTGAAATGAAAGGAAAGGAAAGGCAAGGCGAGGCGCGGCGCTGCGCTGCGCTGGGCTGGCCTGGCTTGGCTCGGCTCCGCTCCTCTCCTGTCCTGGCCTGGTCTGGTGTGGTGTGGTGTGGTGTGATGTGAAGTGAATTGAATGGAATGGAATGGGATGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCCGGCCCAGCCCAGCCCAGGCCAGGGCAGGGCAGGGCTGGGCTG","label":"non-promoter"},{"seq":"GGCCAGGCCAGGCCAGGGCAGGGGAGGGGAGGGGACGGGACCGGACCAGACCAGACCAGGCCAGGCCAGGCTAGGCTGGGCTGGGCTGGGCTGGGATGGGAGGGGAGAGGAGAGGAGAGCAGAGCTGAGCTGAGCTGCGCTGCCCTGCCATGCCAAGCCAACCCAACCCAACCGAACCGCACCGCACCGCACCGCACCGCACCTCACCTGACCTGTCCTGTGCTGTGATGTGAAGTGAAGTGAAGGGAAGGAAAGGAAAGGAATGGAATGGAATGGAATGGTATGGTCTGGTCAGGTCAGGTCAGGTCAGGACAGGAAAGGAACGGAACCGAACCCAACCCTACCCTCCCCTCCCCTCCCCTCCCATCCCACCCCACCCCACCCCACCCTACCCTGCCCTGGCCTGGGCTGGGATGGGATGGGATGGGATGCGATGCAATGCATTGCATTGCATTCCATTCCATTCCTTTCCTGTCCTGGCCTGGCCTGGCTTGGCTTGGCTTTGCTTTTCTTTTATTTTACTTTACCTTACCATACCAGACCAGTCCAGTTCAGTTAAGTTATGTTATTTTATTC","label":"non-promoter"},{"seq":"ATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGAGCCGAGCCGAGGCGAGGTGAGGTGAGGTGGGGTGGGGTGGGTTGGGTGGGGTGGGGTGGAGTGGATTGGATCGGATCAGATCATATCATCTCATCTCATCTGATCTGATCTGAGCTGAGGTGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGTGGAGTTGAGTTTAGTTTGGTTTGATTTGAATTGAAATGAAACGAAACCAAACCAAACCAGACCAGCCCAGCCCAGCCTAGCCTGGCCTGGCCTGGCCTGGCCTGGCCAGGCCAAGCCAACCCAACACAACATAACATGACATGGCATGGCATGGCATGGCAAGGCAAAGCAAAACAAAACAAAACCAAACCCAACCCCACCCCGCCCCGTCCCGTCCCGTCTCGTCTCGTCTCTTCTCTACTCTACTCTACTCTACTATACTAAACTAAACTAAAATAAAAAAAAAATAAAATAAAATAC","label":"non-promoter"},{"seq":"CCGCCACGCCAGGCCAGGCCAGGCCAGGCTAGGCTCGGCTCCGCTCCTCTCCTCTCCTCTCCTCTGCTCTGCTCTGCACTGCAGTGCAGCGCAGCGCAGCGCAGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGCCGCGCAGCGCAGCGCAGCGCAGCGCAGCGCAGCGCCGCGCCCCGCCCCGCCCCCCCCCCTCCCCTCCCCTCGCCTCGCCTCGCGTCGCGGCGCGGTGCGGTACGGTAGGGTAGGGTAGGCTAGGCGAGGCGCGGCGCGGCGCGGCGCGGAGCGGAGCGGAGGGGAGGAGAGGAAAGGAAGGGAAGCGAAGCGAAGCGGAGCGGCGCGGCCCGGCCAGGCCACGCCACACCACAGCACAGGACAGGGCAGGGCAGGGCTGGGCTGGGCTGCGCTGCCCTGCCGTGCCGCGCCGCCCCGCCCCGCCCGGCCCGCCCCGCACCGCAGCGCAGAGCAGAACAGAATAGAATCGAATCGAATCGCATCGCATCGCAGCGCAGCGCAGCTCAGCTGAGCTGCGCTGCACTGCACTGCACGGCACGACACGATACGATC","label":"promoter"},{"seq":"CGCAGTGCAGTGCAGTGGAGTGGTGTGGTCTGGTCTGGTCTTGTCTTGTCTTGGCTTGGCTTGGCATGGCAGGGCAGCGCAGCTCAGCTGAGCTGCGCTGCCCTGCCATGCCAGGCCAGGCCAGGACAGGAGAGGAGTGGAGTAGAGTAGAGTAGGGTAGGTTAGGTAAGGTAGGGTAGTGTAGTGTAGTGCAGTGCCGTGCCTTGCCTCGCCTCCCCTCCGCTCCGGTCCGGACCGGACCGGACCGGACCCGACCCTACCCTCCCCTCGCCTCGCCTCGCTTCGCTACGCTAGGCTAGGCTAGGTTAGGTGAGGTGGGGTGGCGTGGCATGGCACGGCACTGCACTACACTATACTATCCTATCATATCACATCACATCACAACACAAAACAAAGCAAAGGAAAGGAAAGGACAGGACAGGACAAGACAAAACAAAGCAAAGCAAAGCTAAGCTGAGCTGGGCTGGTCTGGTTTGGTTGGGTTGAGTTGAATTGAAGTGAAGCGAAGCTAAGCTAAGCTACGCTACACTACAGTACAGAACAGAACAGAAAAGAAATGAAATCAAATCCAATCCC","label":"promoter"},{"seq":"TACCGGACCGGACCGGAGCGGAGAGGAGACGAGACCAGACCGGACCGCACCGCACCGCACCGCACTGCACTGCACTGAACTGAACTGAAGTGAAGAGAAGACAAGACTAGACTGGACTGTACTGTTCTGTTTTGTTTTGTTTTATTTTAGTTTAGATTAGAGTAGAGTAGAGTTGAGTTGAGTTGAGTTGACTTGACTTGACTGGACTGAACTGACCTGACATGACAGGACAGTACAGTGCAGTGGAGTGGCGTGGCATGGCAGGGCAGCGCAGCGCAGCGAAGCGATGCGATTCGATTCGATTCTATTCTCTTCTCCTCTCCTCTCCTGTCCTGTCCTGTCCTGTCTTGTCTCGTCTCCTCTCCACTCCAGTCCAGCCCAGCCCAGCCCAGCCCTGCCCTCCCCTCACCTCAGCTCAGCTCAGCACAGCAGAGCAGTGCAGTGCAGTGTAGTGTCGTGTCCTGTCCCGTCCCTTCCCTTCCCTTTCCTTTGCTTTGGTTTGGGTTGGGCTGGGCAGGGCACGGCACCGCACCCCACCCAACCCAGCCCAGCCCAGCCCAGCCCAGCCCCGCCCCA","label":"non-promoter"},{"seq":"CAGAATAGAATCGAATCGAATCGCATCGCATCGCAACGCAAGGCAAGACAAGAAAAGAATAGAATCGAATCAAATCATATCATGTCATGCCATGCAATGCAGTGCAGAGCAGAGCAGAGCAGAGCGGAGCGAAGCGACGCGACCCGACCTGACCTGACCTGACCTGATCTGATTTGATTTGATTTAATTTACTTTACGTTACGCTACGCTACGCTTCGCTTCGCTTCACTTCACTTCACCTCACCTCACCTAACCTAGCCTAGACTAGATTAGATTAGATTGGATTGAATTGACTTGACTTGACTTGACTTTACTTTTCTTTTTTTTTTATTTTATTTTATTTTATTCTATTCTATTCTGTTCTGCTCTGCACTGCATTGCATCGCATCGCATCGTATCGTTTCGTTGCGTTGTGTTGTGTTGTGTTGTGTTGTGTTCTGTTCTGTTCTTTTCTTCTCTTCCCTTCCCTTCCCCTCCCCCCCCCCACCCCACCCCACTCCACTTCACTTCACTTCCCTTCCTTTCCTCTCCTCTCCTCTTCTCTTCTCTTCTCTTCTTTTCTTGTCTTGCCTTGCT","label":"non-promoter"}],
    "epochs": 1
}
}'
import requests
import json

url = "https://biolm.ai/api/v1/finetune_run/"

payload = json.dumps({
"pipeline": "finetune_DNABERT_classifier",
"hyperopt": False,
"input_json": {
    "max_train": 40000,
    "max_validate": 20000,
    "train": [
    {
        "seq": "CACAGCACAGCCCAGCCAAGCCAGGCCAGCCCAGCCCAGCCAAGCCACGCCACTCCACTACACTAGACTAGGCTAGGCTAGGCCAGGCCCGGCCCTGCCCTGCCCTGTCCTGTCCTGTCCTGTCCTGTCCTGTCCTGCCCTGCACTGCAGTGCAGCGCAGCCCAGCCCAGCCCCGCCCCCCCCCCTCCCCTGCCCTGTCCTGTACTGTAGTGTAGGGTAGGGTAGGGGAGGGGTGGGGTCGGGTCTGGTCTGGTCTGGTCTGGACTGGAATGGAACGGAACAGAACAGAACAGCACAGCCCAGCCAAGCCAGGCCAGGCCAGGACAGGAGAGGAGTGGAGTGGAGTGGAGTGGTGTGGTTTGGTTTGGTTTAGTTTAATTTAAGTTAAGATAAGAGAAGAGGAGAGGCGAGGCAAGGCAGGGCAGGGCAGGGCAGGGGAGGGGAGGGGAGGGGAGTGGAGTCGAGTCGAGTCGCGTCGCCTCGCCTCGCCTTGCCTTGCCTTGCCTTGCCTTGCCCTGCCCTGCCCTGCCCTGTCCTGTGCTGTGCTGTGCCGTGCCATGCCACGCCACACCACAC",
        "label": "non-promoter"
    },
    {
        "seq": "CTAATCTAATCTAATCTAATCTAGTCTAGTCTAGTATAGTAAAGTAATGTAATGTAATGCAATGCCATGCCGTGCCGCGCCGCGCCGCGTCGCGTTGCGTTGCGTTGGGTTGGTTTGGTGTGGTGGGGTGGAGTGGAATGGAAAGGAAAGGAAAGAAAAGACAAGACAAGACATGACATGACATGACATGACATGACATGACATGACATAACATACCATACCATACCTTACCTCACCTCACCTCAACTCAAATCAAACCAAACAAAACAGAACAGCACAGCACAGCAGAGCAGGGCAGGGCAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCCGCGCCACGCCATGCCATGCCATGCCATGCGATGCGCTGCGCCGCGCCACGCCAAGCCAAGCCAAGCCAAGCCAAGCCCAGCCCGGCCCGCCCCGCACCGCAGCGCAGAGCAGAGCAGAGGAGAGGGGAGGGTAGGGTTGGGTTGGGTTGTGTTGTCTTGTCCTGTCCAGTCCAATCCAACCCAACTCAACTCAACTCCACTCCTCTCCTATCCTATCCTATTCTATTCTATTCCATTCCT",
        "label": "promoter"
    },
    {
        "seq": "GGAAGAGAAGAGAAGAGGAGAGGGGAGGGAAGGGAAGGGAAGGGAAGGGAAGGAAAGGAAAGGAAAGGAAATGAAATGAAATGCAATGCCATGCCCTGCCCCGCCCCGCCCCGGCCCGGGCCGGGTCGGGTCGGGTCCGGTCCCGTCCCATCCCAGCCCAGGCCAGGCCAGGCGAGGCGGGGCGGGGCGGGGCGGGGCGGGGCCGGGCCTGGCCTCGCCTCGCCTCGACTCGAGTCGAGCCGAGCGGAGCGTAGCGTGGCGTGCCGTGCCGTGCCCTGCCCAGCCCACCCCACGCCACGCCACGCCACGCCGCGCCGCGCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTGCCCTGCCCTGCTCTGCTGTGCTGGGCTGGCCTGGCCTGGCCAGGCCACGCCACGCCACGCCACGCCACGCCTCGCCTGGCCTGGCCTGGACTGGAGTGGAGTGGAGTTGAGTTGAGTTGCGTTGCATTGCAGTGCAGGGCAGGACAGGAAAGGAACGGAACCGAACCGAACCGGACCGGGCCGGGCCGGGCGGGGCGCGGCGCCGCGCCGCGCCGGGCCGGG",
        "label": "promoter"
    },
    {
        "seq": "CGAAAGGAAAGCAAAGCAAAGCAAAGCAATGCAATCCAATCAAATCAGATCAGTTCAGTGCAGTGGAGTGGCGTGGCCTGGCCTGGCCTGGCCTGGCCTGGACTGGACTGGACCGGACCAGACCATACCATGCCATGTCATGTGATGTGTTGTGTAGTGTAGTGTAGTGTAGTATAGTATAGTATAGTATAGTATAGAATAGAGTAGAGAAGAGAGGAGAGCAGAGCAGAGCAAAGCAACGCAACACAACAGAACAGCACAGCGCAGCGCAGCGCCGCGCCACGCCATGCCATCCCATCTCATCTAATCTATTCTATGCTATGCTATGCTATGCTTTGCTTAGCTTAACTTAATTTAATTTAATTTAATTTGATTTGGTTTGGCTTGGCATGGCAAGGCAACGCAACACAACATAACATTACATTACATTACATTACATTACATTACATGACATGTCATGTAATGTAGTGTAGTGTAGTCTAGTCCAGTCCCGTCCCGTCCCGGCCCGGACCGGAACGGAAAGGAAAAGAAAATAAAATCAAATCTAATCTTATCTTTTCTTTTCTTTTATTTTAA",
        "label": "promoter"
    },
    {
        "seq": "TGACTCGACTCCACTCCCCTCCCATCCCAACCCAAACCAAACCAAACCAAACCAAACCAAACCAACCCAACACAACAAAACAAAACAAAACAAAAGAAAAGGAAAGGGAAGGGGAGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGTGGAGTTGAGTTCAGTTCAGTTCATTTCATCTCATCACATCACATCACCTCACCACACCACACCACTCCACTACACTAGACTAGACTAGACTAGACTAGACTTGACTTTACTTTCCTTTCCTTTCCTTTCCTTTCCTTACCTTATCTTATATTATAATATAAAATAAAATAAAAAAAAAAAAAAAACAAAACAAAACACAACACTACACTACACTAGACTAGACTAGAGTAGAGGAGAGGGGAGGGAAGGGAGGGGAGTGGAGTGGAGTGCAGTGCTGTGCTTTGCTTAGCTTAACTTAAGTTAAGCTAAGCAAAGCAGAGCAGAGCAGAACAGAAAAGAAAGGAAAGAAAAGAAAAGAAAAGAAAAGAAAAAAAAAAAAAAAAAAAAAATAAAATAAAATACAATACTATACTATACTAA",
        "label": "promoter"
    },
    {
        "seq": "AAGCATAGCATGGCATGACATGAAATGAAATGAAATGAAATGAAATGAAATGAAATGAAATGAAAGGAAAGAAAAGACAAGACTAGACTGGACTGGACTGGGCTGGGCTGGGCTGGGCTAGGCTAGGCTAGGCTAGGCTAGGCAAGGCACGGCACAGCACAGCACAGTACAGTGCAGTGGAGTGGCGTGGCTTGGCTCGGCTCAGCTCACCTCACATCACACCACACCACACCTCACCTGACCTGTCCTGTACTGTAATGTAATGTAATCTAATCCAATCCCATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCAGGCCAAGCCAAGCCAAGACAAGACAAGACAAGACAGGACAGGACAGGCCAGGCAAGGCAGGGCAGAGCAGATCAGATGAGATGAGATGACATGACCTGACCTGACCTGACCTGACCTGAGCTGAGGTGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGTGGAGTTGAGTTTAGTTTG",
        "label": "non-promoter"
    },
    {
        "seq": "GGCTTTGCTTTGCTTTGTTTTGTTTTGTTTTGTTTTGTTTTCTTTTCTTTTCTGTTCTGTTCTGTGCTGTGATGTGAGGTGAGTTGAGTTGAGTTAAGTTACGTTACGTTACGGTACGGGACGGGGCGGGGCGGGGCTGGGCTGGGCTGCGCTGCCCTGCCATGCCACGCCACCCCACCTCACCTGACCTGCCCTGCACTGCAGTGCAGGGCAGGTCAGGTAAGGTAAGGTAAAGTAAAATAAAATAAAATCAAATCTAATCTGATCTGGTCTGGACTGGACTGGACAGGACATGACATTACATTGCATTGCATTGCCTTGCCCTGCCCTGCCCTGCCCTGACCTGAACTGAAATGAAATGAAATTAAATTGAATTGAATTGACTTGACCTGACCGGACCGAACCGAACCGAACCGAACCGAACCTAACCTTACCTTGCCTTGGCTTGGATTGGATTGGATAGGATACGATACAATACAATACAAAACAAACCAAACCAAACCCAACCCGACCCGGCCCGGCCCGGCCCGGCCTGGCCTGGCCTGACCTGACCTGACATGACAGGACAGTACAGTG",
        "label": "promoter"
    },
    {
        "seq": "TCACCGCACCGTACCGTTCCGTTACGTTACGTTACTTTACTGTACTGCACTGCCCTGCCTTGCCTCGCCTCCCCTCCTCTCCTATCCTAGCCTAGTCTAGTGTAGTGGAGTGGCGTGGCGTGGCGGGGCGGAGCGGATCGGATAGGATACGATACGATACGGTACGGCACGGCGCGGCGGGGCGGCGCGGCACGGCAAGGCAATGCAATACAATAGAATAGTATAGTGTAGTGGAGTGGCGTGGCGTGGCGCGGCGCAGCGCACCGCACAGCACATCACATTACATTCCATTCAATTCAATTCAAGTCAAGGCAAGGCAAGGCAAGGCAGGGCAGGGCAGGACAGGAAAGGAAGGGAAGCGAAGCAAAGCAAAGCAAGGCAAGACAAGAGAAGAGGAGAGGAGAGGAAAGGAACGGAACAGAACAGAACAGAACAGAGCAGAGCAGAGCCGAGCCAAGCCACGCCACCCCACCACACCAGACCAGCCCAGCACAGCAGAGCAGGGCAGGTCAGGTTAGGTTTGGTTTGGTTTGGTTTGGCTTGGCCTGGCCCGGCCCAGCCCAGCCCAGTCCAGTG",
        "label": "promoter"
    },
    {
        "seq": "AGAAAAGAAAACAAAACAAAACAAAACAAAACAAAACAAAAGAAAAGCAAAGCTAAGCTCAGCTCCGCTCCGCTCCGGTCCGGACCGGAGCGGAGTGGAGTAGAGTAGAGTAGGGTAGGATAGGAAAGGAAAGGAAAGGAAAGTAAAGTGAAGTGAAGTGACGTGACATGACACGACACAACACAGCACAGCACAGCGCAGCGCAGCGCCGCGCCACGCCACGCCACCCCACCTCACCTCACCTCCCCTCCCCTCCCGTCCCGGCCCGGTCCGGTACGGTAGGGTAGCGTAGCCTAGCCTAGCCTGGCCTGGCCTGGCCTGGCCTGGCCGGGCCGGGCCGGCCCGGCCCGGCCAGGCCAAGCCAAGCCAAGGCAAGGCAAGGCCAGGCCTGGCCTCGCCTCTCCTCTGCTCTGGTCTGGCCTGGCTTGGCTTGGCTTAGCTTAACTTAAGTTAAGCTAAGCGAAGCGGAGCGGGGCGGGCCGGGCCGGGCCTGGCCTCGCCTCTCCTCTGCTCTGGTCTGGCCTGGCCTGGCCTGGCCTGGCCTGCCCTGCCCTGCCATGCCAAGCCAAACCAAAA",
        "label": "promoter"
    },
    {
        "seq": "AAGTAGAGTAGAGTAGAGTAGAGGAGAGGCGAGGCCAGGCCTGGCCTCGCCTCCCCTCCTCTCCTGTCCTGCCCTGCTCTGCTTTGCTTCGCTTCACTTCAGTTCAGGTCAGGGCAGGGAAGGGAAGGGAAGGGAAGTGAAGTAAAGTAGAGTAGAGTAGAGTAGAGCAGAGCCGAGCCGAGCCGGGCCGGTCCGGTGCGGTGTGGTGTCGTGTCTTGTCTCGTCTCGTCTCGCCTCGCATCGCACCGCACCGCACCACACCAGACCAGACCAGAGCAGAGCAGAGCCGAGCCCAGCCCCGCCCCACCCCAGCCCAGACCAGATCAGATGAGATGGGATGGAATGGAATGGAACGGAACTGAACTCAACTCTACTCTGCTCTGTTCTGTCCTGTCCTGTCCCGTCCCATCCCATCCCATTCCATTCCATTCAATTCACTTCACATCACATCACATTACATTACATTAAATTAATTTAATTTAATTGAATTGAATTGAATTGAATTGAATCGAATCCAATCCAATCCAGTCCAGTCCAGTACAGTACAGTACTGTACTTTACTTTACTTTGCTTTGA",
        "label": "non-promoter"
    },
    {
        "seq": "GGTGCAGTGCAATGCAAGGCAAGGCAAGGAAAGGAAAGGAATGGAATGGAATGAAATGAAATGAAGTGAAGCGAAGCCAAGCCAAGCCAAGCCAATCCAATTCAATTTAATTTCATTTCTTTTCTCTTCTCATCTCAACTCAATTCAATCCAATCAAATCATATCATCTCATCGCATCGAATCGAGTCGAGGCGAGGCGAGGCTAGGCTAGGCTACGCTACCCTACCCTACCCTACCCTGCCCTGCCCTGCCCTGCCATGCCATGCCATCCCATCTCATCTTATCTTGTCTTGTCTTGTGTTGTGGTGTGGCGTGGCCTGGCCAGGCCATGCCATGCCATGTCATGTGATGTGATGTGAGGTGAGGTGAGGGGAGGGAAGGGATGGGATGGGATGCGATGCAATGCACTGCACAGCACACCACACGACACGTCACGTGACGTGTCGTGTAGTGTAGTGTAGAGTAGATTAGATCAGATCAGATCAAATCAATTCAATTCAATTTAATTTCATTTCTTTTCTCTTCTCATCTCAGCTCAGCTCAGCACAGCATAGCATCGCATCACATCACATCACA",
        "label": "promoter"
    },
    {
        "seq": "AGAGACGAGACTAGACTGGACTGGACTGGCCTGGCATGGCAAGGCAAGGCAAGGCAAGGAAAGGACAGGACAGGACAGGACAGGACAGGCCAGGCTAGGCTCGGCTCGGCTCGCCTCGCCTCGCCCCGCCCTGCCCTTCCCTTCCCTTCTCTTCTGTTCTGTTCTGTACTGTAGTGTAGAGTAGAGTAGAGCAGAGCCGAGCCTAGCCTCGCCTCGCCTCGCCTCGCATCGCATCGCATTGCATTGCATTGGATTGGCTTGGCCTGGCCAGGCCACGCCACCCCACCACACCAGACCAGGCCAGGACAGGAGAGGAGGGGAGGCGAGGCAAGGCAGGGCAGTGCAGTGCAGTGTAGTGTTGTGTTGTGTTGTGTTGTCTTGTCTTGTCTGGTCTGCTCTGCCCTGCCTTGCCTCGCCTCTCCTCTCCTCTCGTCTCGACTCGAATCGAACCGAACTGAACTTAACTTGACTTGGCTTGGCTTGGCTTGGCTGGGCTGCGCTGCCCTGCCCTGCCCAGCCCAACCCAAGCCAAGGCAAGGTAAGGTGAGGTGAGGTGAGGTGAGATGAGAAGAGAAG",
        "label": "promoter"
    },
    {
        "seq": "TGGCGAGGCGACGCGACCCGACCCGACCCCACCCCACCCCAACCCAACCCAACCCAACCTAACCTGACCTGCCCTGCCCTGCCCTGCCCTGCCCTTCCCTTGCCTTGCCTTGCTTTGCTTTGCTTCGCTTCGCTTCGGTTCGGATCGGACCGGACAGGACACGACACTACACTGCACTGCACTGCACTGCAGTGCAGCGCAGCACAGCACAGCACCGCACCCCACCCAACCCAACCCAATCCAATGCAATGGAATGGCATGGCGTGGCGCGGCGCCGCGCCCCGCCCAGCCCAGCCCAGACCAGAACAGAACAGAACCGAACCCAACCCGACCCGCCCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTGCCCTGCCCTGCCCTGCCGTGCCGCGCCGCGCCGCGGCGCGGGGCGGGCCGGGCAGGGCAGGGCAGTGCAGTGCAGTGCAGTGCAGTGCAGTGCAGCGCAGCCCAGCCCAGCCCGGCCCGGCCCGGGCCGGGACGGGATGGGATAGGATAGGATAGCATAGCGTAGCGCAGCGCCGCGCCCCGCCCCGCCCCCCCCCCACCCCAA",
        "label": "non-promoter"
    },
    {
        "seq": "CTGTGTTGTGTAGTGTATTGTATAGTATATTATATCATATCTTATCTGATCTGTTCTGTACTGTAATGTAAAGTAAAGTAAAGTAAAGTTAAGTTAAGTTATGTTATCTTATCTTATCTCATCTCCTCTCCACTCCAGTCCAGTCCAGTCCAGTCAAGTCAAGTCAACTCAACGCAACGCAACGCTACGCTACGCTAGGCTAGGCTAGGGTAGGGAAGGGATGGGATGGGATGCGATGCAATGCACTGCACAGCACACCACACTACACTCCACTCTACTCTGCTCTGCTCTGCACTGCAATGCAACGCAACACAACACAACACTACACTCCACTCTACTCTACTCTAGTCTAGGCTAGGTTAGGTGAGGTGGGGTGGCGTGGCCTGGCCTGGCCTTGCCTTCCCTTCTCTTCTGTTCTGTTCTGTACTGTATTGTATAGTATATTATATAATATATTATATGATATGGTATGGCATGGCATGGCAGGGCAGAGCAGAACAGAAAAGAAAAGAAAAAAAAAAGAAAAGAAAAGAAAAGAAAAGAAAGGAAAGTAAAGTAAAGTAAAGTAAAGTAAAT",
        "label": "non-promoter"
    },
    {
        "seq": "CTATATTATATTATATTTTATTTGATTTGGTTTGGATTGGACTGGACAGGACAAGACAATACAATCCAATCGAATCGCATCGCCTCGCCGCGCCGTGCCGTGCCGTGACGTGATGTGATTTGATTAGATTAAATTAAATTAAACTAAACGAAACGAAACGAGACGAGTCGAGTGGAGTGTAGTGTAGTGTATTGTATGGTATGATATGAAATGAAATGAAAGGAAAGGAAAGGCAAGGCGAGGCGTGGCGTCGCGTCTCGTCTGGTCTGATCTGAACTGAAGTGAAGCGAAGCTAAGCTAAGCTAGGCTAGGCTAGGGTAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCTGCGCTACGCTAGGCTAGACTAGATTAGATAAGATAAGATAAAATAAACTAAACAAAACACAACACTACACTGCACTGAACTGATCTGATTTGATTTGATTTCATTTCCTTTCCCTTCCCCTCCCCTCCCCTTCCCTTTCCTTTACTTTAGTTTAGGTTAGGGTAGGGAAGGGAAGGGAAAGGAAAAGAAAAAAAAAAGAAAAGAAAAGAAAAGAATAGAATG",
        "label": "promoter"
    },
    {
        "seq": "AACTGCACTGCACTGCAGTGCAGGGCAGGACAGGATAGGATGGGATGCGATGCTATGCTCTGCTCTGCTCTTCTCTTGTCTTGGCTTGGATTGGAGTGGAGTGGAGTTGAGTTCAGTTCTGTTCTGTTCTGGTCTGGTCTGGTCTGGTCTGGTCTAGTCTACTCTACTCTACTCTACTCTACTCTGCTCTGCTCTGCGCTGCGATGCGATGCGATGCGATGCGATGCTATGCTTTGCTTGGCTTGTCTTGTTTTGTTTTGTTTGGTTTGCTTTGCATTGCAATGCAAAGCAAAACAAAACAAAACCAAACCCAACCCTACCCTGCCCTGTCCTGTCCTGTCATGTCATGTCATGTCATGACATGAGATGAGATGAGAAGAGAAGAGAAGGGAAGGTAAGGTCAGGTCCGGTCCAGTCCACTCCACTCCACTACACTAGACTAGACTAGATTAGATGAGATGGGATGGCATGGCATGGCAGGGCAGGGCAGGTCAGGTTAGGTTTGGTTTCGTTTCATTTCAGTTCAGCTCAGCTCAGCTGAGCTGTGCTGTGCTGTGCTGTGCTGTGCTATGCTAC",
        "label": "promoter"
    },
    {
        "seq": "GCTTTCCTTTCCTTTCCTTTCCTGTCCTGGCCTGGCCTGGCCTGGCCCGGCCCCGCCCCCCCCCCACCCCAACCCAAGCCAAGACAAGAGAAGAGTAGAGTGGAGTGCAGTGCAGTGCAGTGCAGGGCAGGGCAGGGAAGGGATGGGATGGGATGCGATGCCATGCCCTGCCCAGCCCAGCCCAGGCCAGGTCAGGTCAGGTCTGGTCTGGTCTGCTCTGCACTGCAATGCAACGCAACCCAACCAAACCACACCACCCCACCACACCACACCACTCCACTGCACTGGACTGGGCTGGGTTGGGTGGGGTGGGGTGGCGTGGCTTGGCTGGGCTGCGCTGCACTGCAGTGCAGCGCAGCTCAGCTGAGCTGTGCTGTGCTGTGCTGTGCCGTGCCCTGCCCAGCCCAGCCCAGGCCAGGACAGGAGAGGAGGGGAGGGGAGGGTAGGGTGGGGTGGGGTGGGGTGGGATGGGACGGGACTGGACTCGACTCCACTCCTCTCCTGTCCTGCCCTGCCCTGCCCTGCCCCGCCCCACCCCACCCCACCCCACCACACCAAACCAACCCAACTCAACTC",
        "label": "non-promoter"
    },
    {
        "seq": "TGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCCGAGCCGAGCCGGGCCGGGCCGGGTCGGGTGGGGTGCGGTGCCGTGCCCTGCCCCGCCCCGCCCCGGCCCGGGCCGGGTCGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCCGAGCCGAGCCGGGCCGGGCCGGGTCGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTG",
        "label": "non-promoter"
    },
    {
        "seq": "GGGTGAGGTGAGGTGAGGTGAGGGGAGGGAAGGGAAGGGAAGGGAAGTGAAGTGAAGTGCAGTGCAGTGCATTGCATCGCATCCCATCCCATCCCTTCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGTTGGGTGGGGTGAGGTGAGGTGAGATGAGAAGAGAAAAGAAACGAAACAAAACATAACATGACATGGCATGGGATGGGTTGGGTAGGGTAAGGTAAGGTAAGCTAAGCGAAGCGTAGCGTGGCGTGTCGTGTGGTGTGCTGTGCAGTGCAGTGCAGAGCAGACCAGACGAGACGTGACGTGACGTGGCGTGGAGTGGAGTGGAGAGGAGAGGAGAGGAGAGGGGAGGGCAGGGCGGGGCGTGGCGTGGCGTGGCGTGGGGTGGGGTGGGGTGGGGTGGGGTGAGGTGAGGTGAGGTGAGGGGAGGGAAGGGACGGGACGGGACGCGACGCCACGCCCCGCCCAGCCCACCCCACCCCACCCCACCCCACCCCTCCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGTTGGGTGGGGTGAGGTGAGGTGAGATGAGAAGAGAAA",
        "label": "non-promoter"
    },
    {
        "seq": "AAGTTTAGTTTTGTTTTTTTTTTCTTTTCCTTTCCATTCCACTCCACCCCACCTCACCTGACCTGCCCTGCCCTGCCATGCCACGCCACTCCACTTCACTTCACTTCACTTCACTTCACATCACAACACAATACAATGCAATGAAATGACATGACCTGACCCGACCCTACCCTCCCCTCCCCTCCACTCCAGTCCAGCCCAGCGCAGCGCAGCGCCGCGCCCCGCCCTGCCCTCCCCTCTCCTCTACTCTACTCTACTCTACTGTACTGGACTGGCCTGGCATGGCAGGGCAGAGCAGAGCAGAGAAGAGACGAGACTAGACTAGACTAGACTAGCCTAGCATAGCATAGCATCGCATCACATCAAATCAAGTCAAGCCAAGCCAAGCCAAGCCAGGCCAGCCCAGCTCAGCTGAGCTGGGCTGGCCTGGCATGGCAAGGCAAAGCAAACCAAACCAAACCAAACCAGACCAGACCAGAGCAGAGGAGAGGCGAGGCGAGGCGTGGCGTCGCGTCCCGTCCTGTCCTTTCCTTTCCTTTACTTTAATTTAAGTTAAGGTAAGGTAAGGTCAGGTCC",
        "label": "promoter"
    },
    {
        "seq": "TTTTTTTTTTTTTTTTTGTTTTGCTTTGCGTTGCGGTGCGGGGCGGGGCGGGGCGGGGCGGGGCGCGGCGCAGCGCAGCGCAGTGCAGTGCAGTGGAGTGGCGTGGCTTGGCTCGGCTCAGCTCATCTCATGTCATGCCATGCCATGCCTTGCCTGGCCTGTCCTGTACTGTAGTGTAGTGTAGTCTAGTCCAGTCCCGTCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCAGGCCAAGCCAAGCCAAGACAAGACAAGACGAGACGGGACGGGACGGGCCGGGCAGGGCAGGGCAGAGCAGATCAGATCAGATCAGATCACATCACGTCACGACACGAGACGAGGCGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGAGGAGATGAGATCAGATCGGATCGAATCGAGTCGAGACGAGACGAGACTAGACTAGACTATACTATCCTATCCTATCCTATCCTGTCCTGGCCTGGCCTGGCTTGGCTAGGCTAA",
        "label": "non-promoter"
    },
    {
        "seq": "CCGCCTCGCCTTGCCTTCCCTTCCCTTCCCTTCCCTTCCCTCCCCTCTCCTCTGCTCTGTTCTGTTCTGTTTTGTTTTGTTTTTTTTTTGTTTTGGTTTGGCTTGGCATGGCATGGCATAGCATAACATAAGATAAGATAAGAAAAGAAAAGAAACGAAACAAAACAAAACAATACAATTCAATTCAATTCAATTCAGTTCAGGTCAGGTCAGGTTAGGTTTGGTTTAGTTTATTTTATCTTATCATATCAAATCAAGTCAAGGCAAGGAAAGGAGAGGAGAGGAGAGGAGAGTAGAGTCGAGTCCAGTCCAGTCCAGTCCAGGCCAGGGCAGGGTAGGGTCGGGTCAGGTCAGGTCAGATCAGAACAGAATAGAATTGAATTTAATTTTATTTTTTTTTTCTTTTCTTTTCTATTCTAATCTAACCTAACCTAACCAAACCACACCACCCCACCACACCAGACCAGGCCAGGTCAGGTGAGGTGGGGTGGCGTGGCGTGGCGCGGCGCAGCGCAGCGCAGAGCAGAGCAGAGCAGAGCAGAGCAAAGCAAGGCAAGCCAAGCTAAGCTTAGCTTA",
        "label": "promoter"
    },
    {
        "seq": "ACAAAACAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAAAAAAAAAAAAAGAAAAGCAAAGCGAAGCGGAGCGGGGCGGGACGGGAAGGGAAGGGAAGCGAAGCAAAGCAGAGCAGCGCAGCCCAGCCTAGCCTTGCCTTGCCTTGGCTTGGGTTGGGTTGGGTTGGGTTAGGTTATGTTATCTTATCTTATCTCATCTCCTCTCCACTCCAGTCCAGCCCAGCACAGCACAGCACAGCACACCACACCACACCCCACCCAACCCACCCCACCCCACCACACCAGACCAGACCAGAGCAGAGGAGAGGGGAGGGCAGGGCAGGGCAGGGCAGCGCAGCACAGCAGAGCAGAGCAGACCAGACAAGACACGACACTACACTGCACTGGACTGGCCTGGCTTGGCTAGGCTAAGCTAAACTAAAGTAAAGCAAAGCTAAGCTCAGCTCTGCTCTTCTCTTATCTTAGCTTAGTTTAGTCTAGTCAAGTCATGTCATATCATAACATAAGATAAGTTAAGTCAAGTCCAGTCCTGTCCTGTCCTGACCTGAGCTGAGTTGAGTGGAGTGCAGTGCT",
        "label": "promoter"
    },
    {
        "seq": "ACTGGACTGGAATGGAAAGGAAAAGAAAATAAAATTAAATTTAATTTTATTTTATTTTAATTTAAATTAAATTAAATGAAATGAAATGAAATGAATTGAATGGAATGAAATGATATGATGTGATGTGATGTGATGTGATGTGATGTGATTTGATTCGATTCTATTCTGTTCTGTTCTGTGCTGTGGTGTGGTGTGGTGTGGTGCGGTGCTGTGCTCTGCTCCGCTCCTCTCCTGTCCTGTCCTGTGCTGTGGTGTGGGGTGGGCTGGGCAGGGCAGGGCAGCGCAGCACAGCACAGCACTGCACTGCACTGGACTGGCCTGGCCTGGCCTGGCCTGGCCTGACCTGAACTGAAGTGAAGCGAAGCAAAGCACAGCACAGCACAACACAAAACAAACCAAACCAAACCTAACCTGACCTGGCCTGGACTGGAGTGGAGCGGAGCCGAGCCTAGCCTCGCCTCCCCTCCACTCCAGTCCAGCCCAGCACAGCAGAGCAGGGCAGGGCAGGGGAGGGGGGGGGGCGGGGCAGGGCAGGGCAGTGCAGTCCAGTCCAGTCCTGTCCTCTCCTCACCTCAG",
        "label": "promoter"
    },
    {
        "seq": "AGAGCTGAGCTGAGCTGTGCTGTCCTGTCTTGTCTGGTCTGCTCTGCTCTGCTGTGCTGGGCTGGGCTGGGGTGGGGGGGGGGCGGGGCAGGGCAGGGCAGGGCAGGGCAGGGCAGGGCGGGGCGCGGCGCTGCGCTGCGCTGTGCTGTTCTGTTCTGTTCTGTTCTGTTCTGGTCTGGGCTGGGCTGGGCAGGGCACGGCACTGCACTGCACTGTACTGTACTGTAGTGTAGGGTAGGATAGGATAGGATGGGATGTGATGTTATGTTATGTTAGGTTAGCTTAGCATAGCAGAGCAGCGCAGCGCAGCGAAGCGACGCGACCCGACCCGACCCTACCCTGCCCTGGCCTGGCCTGGCCTGGCCTGGCCTCGCCTCTCCTCTACTCTACTCTACCCTACCATACCACACCACTCCACTACACTAGACTAGACTAGATTAGATGAGATGCGATGCCATGCCATGCCAGGCCAGTCCAGTACAGTAGAGTAGCGTAGCATAGCACAGCACCGCACCCCACCCTACCCTCCCCTCCCCTCCTCTCCTCTCCTCTCCTCTCCTCTCCTCTCCACTCCAG",
        "label": "promoter"
    },
    {
        "seq": "GCTTTGCTTTGTTTTGTTTTGTTATGTTACGTTACATTACAGTACAGGACAGGTCAGGTGAGGTGTGGTGTCGTGTCTTGTCTGGTCTGTTCTGTTCTGTTATGTTAAGTTAACTTAACATAACATAACATTACATTCCATTCCATTCCATTCCATTCCATGCCATGGCATGGAATGGACTGGACCGGACCAGACCAAACCAAACCAAAACAAAACAAAACAAAACAAAACAAGACAAGGCAAGGCAAGGCCAGGCCAGGCCAAGCCAAACCAAACCAAACCAAACCCAACCCAACCCAACCCAAACCAAAACAAAATAAAATCAAATCAAATCAAATCAAGTCAAGGCAAGGGAAGGGAAGGGACGGGACAGGACAGGACAGGACAGGACAGGAAAGGAAGGGAAGTGAAGTAAAGTAGAGTAGAGTAGACTAGACTAGACTCGACTCCACTCCACTCCACTCCACCCCACCCCACCCAACCCATCCCATGCCATGCCATGCAATGCAGTGCAGGGCAGGTCAGGTGAGGTGGGGTGGAGTGGAATGGAAGGGAAGGGAAGGGAAGGGGAGGGGA",
        "label": "non-promoter"
    },
    {
        "seq": "GGTGATGTGATGTGATGCGATGCTATGCTATGCTACGCTACACTACAGTACAGGACAGGGCAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCCGGCCCTGCCCTCCCCTCACCTCATCTCATATCATAGCATAGGATAGGATAGGACAGGACAGGACAGGACAGGACAGGTCAGGTGAGGTGCGGTGCTGTGCTCTGCTCAGCTCACCTCACCTCACCACACCAGACCAGGCCAGGTCAGGTGAGGTGCGGTGCGGTGCGGTGCGGAGCGGAGCGGAGGGGAGGAGAGGACAGGACAGGACAAGACAACACAACCCAACCCAACCCGACCCGTCCCGTCCCGTCCCGTCCGGTCCGGTCCGGGCCGGGCCGGGCTGGGCTGGGCTGGGCTGGACTGGAGTGGAGCGGAGCAGAGCAGAGCAGGGCAGGTCAGGTCAGGTCAGGTCAAGTCAAGTCAAGACAAGAGAAGAGGAGAGGCGAGGCTAGGCTCGGCTCTGCTCTGCTCTGGTCTGGGCTGGGATGGGAGGGGAGAGGAGACGAGACAAGACACGACACTACACTTCACTTCACTTCC",
        "label": "non-promoter"
    },
    {
        "seq": "GGGGCAGGGCAGGGCAGGGCAGGACAGGAAAGGAAAGGAAAGGAAAGCAAAGCCAAGCCCAGCCCAGCCCATCCCATCCCATCTCATCTAATCTACTCTACACTACAATACAAGACAAGGCAAGGCAAGGCCAGGCCAGGCCAGGCCAGTCCAGTGCAGTGGAGTGGCGTGGCTTGGCTTGGCTTTGCTTTTCTTTTCTTTTCCTTTCCCTTCCCCTCCCCCCCCCCACCCCAACCCAACCCAACCCAACCCAACCCAACCCAGCCCAGTCCAGTCCAGTCCAGTCCTGTCCTTTCCTTCCCTTCCCTTCCCTTCCCATCCCAACCCAAACCAAATCAAATTAAATTCAATTCCATTCCCTTCCCATCCCACCCCACACCACAGCACAGCACAGCCCAGCCTAGCCTCGCCTCCCCTCCACTCCAGTCCAGACCAGATCAGATCAGATCCGATCCCATCCCTTCCCTGCCCTGCCCTGCACTGCAATGCAACGCAACCCAACCCAACCCTACCCTCCCCTCCCCTCCCCTCCCTTCCCTCCCCTCCCCTCCCCTCCCGTCCCGCCCCGCTCCGCTT",
        "label": "non-promoter"
    },
    {
        "seq": "ATACAGTACAGGACAGGTCAGGTTAGGTTTGGTTTCGTTTCCTTTCCGTTCCGTTCCGTGCCGTGGCGTGGGGTGGGATGGGAGGGGAGAGGAGAGGAGAGGAGAGGTGAGGTAAGGTAAGGTAACGTAACATAACACAACACAACACAACACAATACAATACAATAGAATAGCATAGCTTAGCTTAGCTTGGCTTGTCTTGTATTGTATTGTATCGTATCATATCAGATCAGTTCAGTCCAGTCAAGTCATGTCATTTCATTACATTACATTACCTTACCATACCACACCACTCCACTTCACTTGACTTGACTTGAGTTGAGTTGAGTGGAGTGTAGTGTGGTGTGATGTGAAGTGAAGTGAAGCGAAGCAAAGCAGAGCAGTGCAGTTCAGTTAAGTTAGGTTAGTTTAGTCTAGTCAAGTCAAGTCAAATCAAAGCAAAGTAAAGTCAAGTCTAGTCTGGTCTGGTCTGGGCTGGGATGGGAGGGGAGTGGAGTGGAGTGAAGTGAAGTGAATTGAATGGAATGAAATGAGATGAGATGAGAGGAGAGTAGAGTAGAGTAGAGTAGAGTAGAA",
        "label": "non-promoter"
    },
    {
        "seq": "AACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAAAAAAAAAAAAAAACAAAACAAAACACAACACAACACAGCACAGCACAGCACAGCAAAGCAAAGCAAACCAAACCAAACCTAACCTGACCTGTCCTGTACTGTATTGTATGGTATGTTATGTTATGTTGTGTTGTGTTGTCTTGTCCTGTCCCGTCCCTTCCCTTCCCTTCCCTTCCCTTCCATTCCAGTCCAGGCCAGGTCAGGTCAGGTCCGGTCCCGTCCCCTCCCCCCCCCCTCCCCTGCCCTGCCCTGCTCTGCTGTGCTGGGCTGGGCTGGGCTGGGCAGGGCATGGCATTGCATTTCATTTGATTTGCTTTGCATTGCAGTGCAGAGCAGAACAGAACAGAACCGAACCGAACCGCACCGCACCGCAGCGCAGCGCAGCACAGCATAGCATCGCATCCCATCCCATCCCATCCCAGCCCAGACCAGATCAGATCAGATCAGATCACATCACTTCACTCCACTCGACTCGTCTCGTTTCGTTACGTTAAGTTAAATTAAAATAAAAAAAAAAAAAAAATAAAATT",
        "label": "promoter"
    },
    {
        "seq": "TCCTGACCTGATCTGATATGATAAGATAAAATAAACTAAACCAAACCCAACCCAACCCATCCCATGCCATGGCATGGGATGGGATGGGATGGGATCGGATCTGATCTCATCTCATCTCATCTCATGTCATGACATGAGATGAGATGAGAAGAGAATAGAATTGAATTAAATTATATTATTTTATTCTATTCAATTCATTTCATTTCATTACATTATATTATCTTATCATATCATATCATGTCATGACATGAGATGAGATGAGAAGAGAATAGAATAGAATAGAATAGTATAGTATAGTATAGTATGGTATGGTATGGGATGGGATGGGAAGGGAAAGGAAAGGAAAGAAAAGACAAGACCAGACCAGACCAGACCAGTCCAGTCCAGTCCAGTCCCGTCCCCTCCCCACCCCATCCCATGCCATGACATGATATGATTTGATTCGATTCAATTCAATTCAATTCAATTCAATTAAATTACATTACCTTACCTTACCTCACCTCCCCTCCCCTCCCCTCCCCCCCCCCTCCCCTGCCCTGGCCTGGGCTGGGTTGGGTCGGGTCCGGTCCCGTCCCT",
        "label": "non-promoter"
    },
    {
        "seq": "GTCATCTCATCGCATCGTATCGTATCGTAGCGTAGTGTAGTATAGTACAGTACTGTACTATACTACACTACACTACATTACATTACATTTCATTTTATTTTATTTTAATTTAAATTAAACTAAACAAAACATAACATGACATGTCATGTAATGTAATGTAAAGTAAAGTAAAGAAAAGAGAAGAGCAGAGCTGAGCTCAGCTCAGCTCAGCTCAGTTCAGTGCAGTGGAGTGGTGTGGTGTGGTGCGGTGCTGTGCTCTGCTCCGCTCCACTCCAATCCAAGCCAAGACAAGAAAAGAAGAGAAGCGAAGCAAAGCAAAGCAAGGCAAGACAAGACAAGACTAGACTTGACTTTACTTTGCTTTGGTTTGGTTTGGTATGGTAGGGTAGAGTAGAGTAGAGAAGAGACGAGACGAGACGGGACGGCACGGCCCGGCCGGGCCGCGCCGCTCCGCTTCGCTTGGCTTGCCTTGCTTTGCTCTGCTCCGCTCCCCTCCCATCCCAACCCAAACCAAATCAAATAAAATATAATATCATATCATATCATATCATGTCATGCCATGCTATGCTGTGCTGA",
        "label": "non-promoter"
    },
    {
        "seq": "ACTGCGCTGCGCTGCGCGGCGCGCCGCGCCGCGCCGCGCCGAGCCGACCCGACGCGACGGGACGGTACGGTGCGGTGGGGTGGGGTGGGCTGGGCTGGGCTGGGCTGGGCTGGCCTGGCGTGGCGGGGCGGGGCGGGACGGGACGGGACCGGACCAGACCAGACCAGGCCAGGACAGGACAGGACAGGACAGGACAGGACAGGACAGGAAAGGAACGGAACAGAACAAAACAATACAATGCAATGGAATGGGATGGGATGGGATGGGATTGGATTCGATTCCATTCCGTTCCGATCCGAGCCGAGGCGAGGGGAGGGCAGGGCCGGGCCGGGCCGCGCCGCACCGCAACGCAAGGCAAGGCAAGGGAAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCTGCGCTCCGCTCCGCTCCTCTCCTTTCCTTCCCTTCTCTTCTGTTCTGCTCTGCGCTGCGGTGCGGGGCGGGTCGGGTTGGGTTGGGTTGGGTTGGGTTGGGGTGGGGTGGGGTGGGGTGCGGTGCGGTGCGATGCGAGGCGAGGCGAGGCGAGGCCAGGCCGGGCCGGGCCGGA",
        "label": "promoter"
    },
    {
        "seq": "TGTGCTGTGCTGTGCTGAGCTGATCTGATGTGATGCGATGCCATGCCTTGCCTGGCCTGTCCTGTGCTGTGGTGTGGTGTGGTTTGGTTTGGTTTGGTTTGGTTTGGTTTGGTGTGGTGGGGTGGGGTGGGGTGGGGCGGGGCTGGGCTAGGCTACGCTACACTACAATACAACACAACACAACAGAACAGGACAGGACAGGAAAGGAAAGGAAATGAAATTAAATTCAATTCCATTCCTTTCCTGTCCTGCCCTGCTCTGCTTTGCTTTGCTTTGCTTTGGTTTGGATTGGAATGGAAAGGAAAGGAAAGAAAAGACAAGACAAGACAGGACAGAACAGAACAGAAAAGAAAGGAAAGCAAAGCAAAGCAGAGCAGAGCAGATCAGATAAGATAGGATAGCATAGCCTAGCCAAGCCAAGCCAAACCAAATCAAATTAAATTCAATTCTATTCTCTTCTCTTCTCTCCTCTCTTCTCTACTCTACTCTACCCTACCATACCACACCACACCACATCACATTACATTTCATTTTATTTTGTTTTGGTTTGGATTGGAATGGAAAGGAAACGAAACT",
        "label": "non-promoter"
    },
    {
        "seq": "GGTTCCGTTCCCTTCCCGTCCCGCCCCGCTCCGCTTCGCTTCGCTTCCCTTCCATTCCACTCCACCCCACCGCACCGAACCGAGCCGAGGCGAGGGGAGGGCAGGGCCGGGCCGGGCCGAGCCGACCCGACTCGACTGGACTGCACTGCGCTGCGATGCGAGGCGAGGCGAGGTGAGGTGAGGTGCGGTGCAGTGCATTGCATGGCATGCCATGCTATGCTGTGCTGGGCTGGGCTGGGATGGGAGGGGAGTGGAGTCGAGTCGAGTCGTGTCGTATCGTAGCGTAGTGTAGTATAGTACAGTACCGTACCGTACCGCACCGCACCGCACCGCACCGCACCGCACCGGACCGGGCCGGGGCGGGGCGGGGCGGGGCGGGGCGGAGCGGAACGGAACGGAACAGAACAGAACAGCACAGCTCAGCTCAGCTCCGCTCCGCTCCGCTCCGCCCCGCCCCGCCCCGCCCCGCCCCGGCCCGGCCCGGCGCGGCGGGGCGGAGCGGATCGGATGGGATGGGATGGTATGGTGTGGTGTGGTGTTGTGTTTTGTTTCGTTTCCTTTCCATTCCAGTCCAGA",
        "label": "non-promoter"
    },
    {
        "seq": "GCCCGGCCCGGGCCGGGACGGGAGGGGAGCGGAGCGGAGCGTAGCGTCGCGTCGCGTCGCGTCGCATCGCAGCGCAGCGCAGCCCAGCCTAGCCTCGCCTCCCCTCCCCTCCCCTCCCCCCCCCCGCCCCGCCCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTCCCCTCCCCTCCCCTCCCCTCCCCGCCCCGCCCCGCCCCGCCTCGCCTCGCCTCGCCTCGGCTCGGGTCGGGGCGGGGAGGGGACGGGACTGGACTCGACTCGACTCGTCTCGTCTCGTCCCGTCCCGTCCCTTCCCTCCCCTCCCCTCCACTCCACTCCACACCACAGCACAGCACAGCCCAGCCCAGCCCCGCCCCTCCCCTCCCCTCCCCTCCCCTCCCTTCCCTCCCCTCCCCTCCCCTCCCGTCCCGTCCCGTCCCGTCGCGTCGGGTCGGATCGGAACGGAATGGAATTGAATTCAATTCGATTCGCTTCGCATCGCAGCGCAGCGCAGCCCAGCCTAGCCTCGCCTCCCCTCCGCTCCGCTCCGCCCCGCCGCGCCGTGCCGTTCCGTTCCGTTCTGTTCTT",
        "label": "non-promoter"
    },
    {
        "seq": "CTGGCTTGGCTGGGCTGCGCTGCTCTGCTCTGCTCCGCTCCTCTCCTTTCCTTACCTTACCTTACATTACAATACAAAACAAACCAAACCAAACCTAACCTGACCTGTCCTGTGCTGTGGTGTGGAGTGGAGTGGAGTGGAGTTGAGTTGAGTTGGGTTGGATTGGACTGGACTGGACTTGACTTGACTTGCCTTGCTTTGCTGTGCTGTGCTGTTCTGTTTTGTTTTGTTTTTTTTTTCTTTTCCTTTCCTTTCCTCTCCTCTCCTCTTCTCTTGTCTTGCCTTGCCTTGCCATGCCACGCCACTCCACTACACTAGACTAGACTAGAGTAGAGGAGAGGGGAGGGTAGGGTAGGGTAGGGTAGAGTAGAATAGAATAGAATAGAATATAATATGATATGATATGAAATGAAATGAAAAGAAAAGAAAAGAAAAGAAAAGAAGAGAAGAGAAGATAAGATTAGATTAGATTAGATTAGCTTAGCATAGCATAGCATGGCATGTCATGTTATGTTTTGTTTTGTTTTCTTTTCATTTCATTTCATCTCATCCCATCCTATCCTTTCCTTGCCTTGC",
        "label": "promoter"
    },
    {
        "seq": "CCCTGCCCTGCTCTGCTATGCTACGCTACACTACAGTACAGTACAGTTCAGTTTAGTTTTGTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTGTTTTGATTTGATTTGATATGATATGATATAATATACTATACAATACAGTACAGGACAGGTCAGGTTAGGTTTGGTTTTGTTTTGTTTTGCTTTGCCTTGCCATGCCAGGCCAGCCCAGCTCAGCTTAGCTTTGCTTTTCTTTTTTTTTTTTTTTTCTTTTCATTTCACTTCACTTCACTACACTAGACTAGACTAGATTAGATGAGATGGGATGGTATGGTGTGGTGCGGTGCTGTGCTATGCTAAGCTAATCTAATTTAATTCAATTCCATTCCCTTCCCTTCCCTTCCCTTTCCTTTACTTTAGTTTAGTTTAGTGTAGTGAAGTGACGTGACCTGACCTGACCTAACCTAGCCTAGGCTAGGCTAGGCTAGGCTGGGCTGTGCTGTTCTGTTTTGTTTTGTTTTCTTTTCATTTCATTTCATATCATACCATACGATACGATACGATACGATGCGATGTGATGTTATGTTTTGTTTA",
        "label": "promoter"
    }
    ],
    "validation": [
    {
        "seq": "GTGGGGTGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGCGCCGCCCCGCCCCGCCCCGCCCCACCCCACCCCACTCCACTGCACTGCACTGCACTGCAGTGCAGGGCAGGTCAGGTGAGGTGGGGTGGGGTGGGCTGGGCCGGGCCTGGCCTGGCCTGTCCTGTACTGTAGTGTAGCGTAGCATAGCAGAGCAGCGCAGCTCAGCTGAGCTGCGCTGCACTGCACTGCACCGCACCTCACCTGACCTGACCTGAGCTGAGGTGAGGCGAGGCAAGGCAGGGCAGGGCAGGGCAGGGCAGGGCTGGGCTGGGCTGGGCTGGCCTGGCATGGCAGGGCAGCGCAGCCCAGCCCAGCCCCGCCCCTCCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGGTGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGCGCCGCCCCGCCCCGCCCCGCCCCACCCCACCCCACTCCACTGCACTGCACTGCACTGCAGTGCAGGGCAGGTCAGGTG",
        "label": "non-promoter"
    },
    {
        "seq": "GTGTGGTGTGGGGTGGGATGGGATGGGATCGGATCAGATCATATCATGTCATGTCATGTAATGTATTGTATCGTATCATATCAGATCAGTTCAGTGCAGTGCAGTGCAGTGCAGTGCAGCGCAGCCCAGCCTAGCCTTGCCTTGCCTTGACTTGACTTGACCTGACCTGACCTCACCTCCCCTCCTCTCCTGTCCTGGCCTGGGCTGGGCTGGGCTGGGCTCGGCTCAGCTCAACTCAAGTCAAGCCAAGCAAAGCATAGCATTGCATTCCATTCTATTCTTTTCTTCTCTTCCCTTCCCTTCCCATCCCACCCCACCCCACCTCACCTCACCTCACCTCAACTCAACTCAACCCAACCTAACCTCACCTCTCCTCTTCTCTTGTCTTGACTTGAGTTGAGTTGAGTAGAGTAGAGTAGCGTAGCTTAGCTGAGCTGAGCTGAACTGAAATGAAATGAAATTAAATTAAATTACATTACATTACAGTACAGGACAGGACAGGAAAGGAACGGAACAGAACATAACATGACATGCCATGCCATGCCATGCCACGCCACCCCACCACACCACACCACA",
        "label": "non-promoter"
    },
    {
        "seq": "CCCTGCCCTGCACTGCATTGCATGGCATGCCATGCCATGCCATGCCACGCCACACCACATCACATAACATAGCATAGCATAGCATAGCAAAGCAAGGCAAGGCAAGGTAAGGTGAGGTGCGGTGCTGTGCTGTGCTGGGCTGGGCTGGGTTGGGTCGGGTCAGGTCACGTCACTTCACTGCACTGAACTGATCTGATGTGATGCGATGCTATGCTATGCTAAGCTAACCTAACATAACATAACATCACATCTCATCTAATCTAATCTAAACTAAACTAAACAAAACAGAACAGGACAGGGCAGGGGAGGGGCGGGGCCGGGCCAGGCCAGGCCAGGCCAGGTCAGGTGAGGTGCGGTGCGGTGCGGTGCGGTGCGGTGCGGTGGGGTGGCGTGGCTTGGCTCGGCTCAGCTCACCTCACTTCACTCCACTCTACTCTTCTCTTGTCTTGACTTGAATTGAAATGAAATGAAATCAAATCCAATCCCATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGC",
        "label": "non-promoter"
    },
    {
        "seq": "TTGGAGTGGAGCGGAGCAGAGCAAAGCAAGGCAAGGCAAGGCAAGGCTAGGCTAGGCTATGCTATGCTATGCTATGCAATGCACTGCACCGCACCACACCATACCATACCATACCATACAATACATTACATGACATGCCATGCTATGCTCTGCTCTGCTCTGCTCTGATCTGAGCTGAGTTGAGTGGAGTGGAGTGGGGTGGGCTGGGCTGGGCTTGGCTTGGCTTGACTTGATTTGATTTGATTCGATTCCATTCCTTTCCTCTCCTCCCCTCCACTCCAGTCCAGGCCAGGGCAGGGAAGGGAAGGGAAGGGAAGAGAAGAGAAGAGGAGAGGCGAGGCCAGGCCAGGCCAGGCCAGGCCAGGACAGGAAAGGAAAGGAAAGGAAAGCAAAGCAAAGCATAGCATTGCATTGCATTGAATTGATTTGATGTGATGTGATGTGATGTGATGTGAAGTGAAATGAAAAGAAAACAAAACAAAACAGAACAGCACAGCCCAGCCTAGCCTTGCCTTTCCTTTCCTTTCCTTTCCCTTCCCTTCCCTTCCCTTGCCTTGCCTTGCCTTGCCATGCCAT",
        "label": "non-promoter"
    },
    {
        "seq": "AGCACAGCACAGCACAGGACAGGGCAGGGCAGGGCAGGGCACGGCACTGCACTGCACTGGACTGGTCTGGTGTGGTGGGGTGGAGTGGAGTGGAGGGGAGGGGAGGGAAGGGAGGGGAGCGGAGCCGAGCCCAGCCCTGCCCTGCCCTGCCCTGCGCTGCGGTGCGGGGCGGGGCGGGGCGGGGCAGGGCAGGGCAGTGCAGTCCAGTCCAGTCCTGTCCTCTCCTCACCTCAACTCAAGTCAAGGCAAGGCAAGGCCAGGCCTGGCCTCGCCTCCCCTCCGCTCCGGTCCGGACCGGATCGGATGGGATGGGATGGGATGGGTTGGGTGGGGTGTGGTGTGGTGTGATGTGAGGTGAGATGAGAGGAGAGGAGAGGCGAGGCAAGGCACGGCACCGCACCGCACCGGACCGGGCCGGGGCGGGGCGGGGCTGGGCTGGGCTGAGCTGAACTGAAGTGAAGCGAAGCAAAGCAGAGCAGCGCAGCACAGCATAGCATCGCATCTCATCTGATCTGGTCTGGGCTGGGTTGGGTTGGGTTTGGTTTGGTTTGATTTGAGTTGAGGTGAGGAGAGGAA",
        "label": "non-promoter"
    },
    {
        "seq": "AGGCCAGGCCAGGCCAGCCCAGCTCAGCTGAGCTGGGCTGGGCTGGGGTGGGGTGGGGTCGGGTCAGGTCAAGTCAAGTCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGGAAGGGGAGGGGGGGGGGCGGGGCTGGGCTGGGCTGCGCTGCCCTGCCCTGCCCAGCCCAGCCCAGCCCAGCACAGCACAGCACAGCACAGCACAGTACAGTGCAGTGGAGTGGTGTGGTTTGGTTCGGTTCTGTTCTGTTCTGCTCTGCTCTGCTCTGCTCCGCTCCACTCCAGTCCAGACCAGAGCAGAGGAGAGGTGAGGTGAGGTGCGGTGCAGTGCAGTGCAGTGCAGTCCAGTCAAGTCAGGTCAGATCAGACCAGACTAGACTGGACTGCACTGCCCTGCCTTGCCTGGCCTGGCCTGGGCTGGGTTGGGTTGGGTTGGGTTGGGTTGGCTTGGCTTGGCTCGGCTCAGCTCATCTCATGTCATGCCATGCCATGCCTTGCCTGGCCTGGCCTGGGCTGGGTTGGGTCGGGTCTGGTCTGGTCTGTTCTGTCCTGTCATGTCATGTCATT",
        "label": "non-promoter"
    },
    {
        "seq": "GTGCGATGCGAGGCGAGACGAGATGAGATGAGATGAGATGACATGACGTGACGCGACGCAACGCACCGCACTGCACTTCACTTCACTTCCCTTCCTTTCCTGTCCTGCCCTGCCCTGCCTTGCCTGGCCTGACCTGAGCTGAGGTGAGGCGAGGCGAGGCGGGGCGGCGCGGCCCGGCCGGGCCGCGCCGCTCCGCTGCGCTGTGCTGTTCTGTTCTGTTCTGTTCTCTTCTCGTCTCGCCTCGCGTCGCGGCGCGGCGCGGCTCGGCTTGGCTTCGCTTCCCTTCCGTTCCGGTCCGGCCCGGCACGGCAGGGCAGGGCAGGTCAGGTGAGGTGGGGTGGCGTGGCGTGGCGCGGCGCTGCGCTGCGCTGAGCTGAGCTGAGATGAGACGAGACCAGACCAGACCACACCACGCCACGGCACGGGACGGGACGGGAAGGGAAGGGAAGCGAAGCCAAGCCAAGCCAGGCCAGCCCAGCCCAGCCTAGCCTGGCCTGGCCTGGCCTGGCTTGGCTGGGCTGTGCTGTCCTGTCGTGTCGGGTCGGTTCGGTTCGGTTAGGTTAGGTTAGCTTAGCC",
        "label": "promoter"
    },
    {
        "seq": "GTTCTTTTCTTGTCTTGGCTTGGATTGGATTGGATCGGATCAGATCACATCACATCACACCACACTACACTCCACTCGACTCGACTCGAGTCGAGGCGAGGAGAGGAAAGGAAAGGAAAGGAAAGCAAAGCTAAGCTCAGCTCCGCTCCACTCCAGTCCAGCCCAGCTCAGCTGAGCTGGGCTGGGCTGGGCTGGGCCGGGCCCGGCCCAGCCCAGCCCAGACCAGATCAGATTAGATTTGATTTGATTTGGTTTGGGTTGGGGTGGGGCGGGGCTGGGCTTGGCTTCGCTTCTCTTCTGTTCTGTTCTGTCCTGTCCTGTCCTGTCCTGTCCTGACCTGAACTGAAATGAAAGGAAAGGAAAGGCAAGGCGAGGCGCGGCGCTGCGCTGCGCTGGGCTGGCCTGGCTTGGCTCGGCTCCGCTCCTCTCCTGTCCTGGCCTGGTCTGGTGTGGTGTGGTGTGGTGTGATGTGAAGTGAATTGAATGGAATGGAATGGGATGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCCGGCCCAGCCCAGCCCAGGCCAGGGCAGGGCAGGGCTGGGCTG",
        "label": "non-promoter"
    },
    {
        "seq": "GGCCAGGCCAGGCCAGGGCAGGGGAGGGGAGGGGACGGGACCGGACCAGACCAGACCAGGCCAGGCCAGGCTAGGCTGGGCTGGGCTGGGCTGGGATGGGAGGGGAGAGGAGAGGAGAGCAGAGCTGAGCTGAGCTGCGCTGCCCTGCCATGCCAAGCCAACCCAACCCAACCGAACCGCACCGCACCGCACCGCACCGCACCTCACCTGACCTGTCCTGTGCTGTGATGTGAAGTGAAGTGAAGGGAAGGAAAGGAAAGGAATGGAATGGAATGGAATGGTATGGTCTGGTCAGGTCAGGTCAGGTCAGGACAGGAAAGGAACGGAACCGAACCCAACCCTACCCTCCCCTCCCCTCCCCTCCCATCCCACCCCACCCCACCCCACCCTACCCTGCCCTGGCCTGGGCTGGGATGGGATGGGATGGGATGCGATGCAATGCATTGCATTGCATTCCATTCCATTCCTTTCCTGTCCTGGCCTGGCCTGGCTTGGCTTGGCTTTGCTTTTCTTTTATTTTACTTTACCTTACCATACCAGACCAGTCCAGTTCAGTTAAGTTATGTTATTTTATTC",
        "label": "non-promoter"
    },
    {
        "seq": "ATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGAGCCGAGCCGAGGCGAGGTGAGGTGAGGTGGGGTGGGGTGGGTTGGGTGGGGTGGGGTGGAGTGGATTGGATCGGATCAGATCATATCATCTCATCTCATCTGATCTGATCTGAGCTGAGGTGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGTGGAGTTGAGTTTAGTTTGGTTTGATTTGAATTGAAATGAAACGAAACCAAACCAAACCAGACCAGCCCAGCCCAGCCTAGCCTGGCCTGGCCTGGCCTGGCCTGGCCAGGCCAAGCCAACCCAACACAACATAACATGACATGGCATGGCATGGCATGGCAAGGCAAAGCAAAACAAAACAAAACCAAACCCAACCCCACCCCGCCCCGTCCCGTCCCGTCTCGTCTCGTCTCTTCTCTACTCTACTCTACTCTACTATACTAAACTAAACTAAAATAAAAAAAAAATAAAATAAAATAC",
        "label": "non-promoter"
    },
    {
        "seq": "CCGCCACGCCAGGCCAGGCCAGGCCAGGCTAGGCTCGGCTCCGCTCCTCTCCTCTCCTCTCCTCTGCTCTGCTCTGCACTGCAGTGCAGCGCAGCGCAGCGCAGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGCCGCGCAGCGCAGCGCAGCGCAGCGCAGCGCAGCGCCGCGCCCCGCCCCGCCCCCCCCCCTCCCCTCCCCTCGCCTCGCCTCGCGTCGCGGCGCGGTGCGGTACGGTAGGGTAGGGTAGGCTAGGCGAGGCGCGGCGCGGCGCGGCGCGGAGCGGAGCGGAGGGGAGGAGAGGAAAGGAAGGGAAGCGAAGCGAAGCGGAGCGGCGCGGCCCGGCCAGGCCACGCCACACCACAGCACAGGACAGGGCAGGGCAGGGCTGGGCTGGGCTGCGCTGCCCTGCCGTGCCGCGCCGCCCCGCCCCGCCCGGCCCGCCCCGCACCGCAGCGCAGAGCAGAACAGAATAGAATCGAATCGAATCGCATCGCATCGCAGCGCAGCGCAGCTCAGCTGAGCTGCGCTGCACTGCACTGCACGGCACGACACGATACGATC",
        "label": "promoter"
    },
    {
        "seq": "CGCAGTGCAGTGCAGTGGAGTGGTGTGGTCTGGTCTGGTCTTGTCTTGTCTTGGCTTGGCTTGGCATGGCAGGGCAGCGCAGCTCAGCTGAGCTGCGCTGCCCTGCCATGCCAGGCCAGGCCAGGACAGGAGAGGAGTGGAGTAGAGTAGAGTAGGGTAGGTTAGGTAAGGTAGGGTAGTGTAGTGTAGTGCAGTGCCGTGCCTTGCCTCGCCTCCCCTCCGCTCCGGTCCGGACCGGACCGGACCGGACCCGACCCTACCCTCCCCTCGCCTCGCCTCGCTTCGCTACGCTAGGCTAGGCTAGGTTAGGTGAGGTGGGGTGGCGTGGCATGGCACGGCACTGCACTACACTATACTATCCTATCATATCACATCACATCACAACACAAAACAAAGCAAAGGAAAGGAAAGGACAGGACAGGACAAGACAAAACAAAGCAAAGCAAAGCTAAGCTGAGCTGGGCTGGTCTGGTTTGGTTGGGTTGAGTTGAATTGAAGTGAAGCGAAGCTAAGCTAAGCTACGCTACACTACAGTACAGAACAGAACAGAAAAGAAATGAAATCAAATCCAATCCC",
        "label": "promoter"
    },
    {
        "seq": "TACCGGACCGGACCGGAGCGGAGAGGAGACGAGACCAGACCGGACCGCACCGCACCGCACCGCACTGCACTGCACTGAACTGAACTGAAGTGAAGAGAAGACAAGACTAGACTGGACTGTACTGTTCTGTTTTGTTTTGTTTTATTTTAGTTTAGATTAGAGTAGAGTAGAGTTGAGTTGAGTTGAGTTGACTTGACTTGACTGGACTGAACTGACCTGACATGACAGGACAGTACAGTGCAGTGGAGTGGCGTGGCATGGCAGGGCAGCGCAGCGCAGCGAAGCGATGCGATTCGATTCGATTCTATTCTCTTCTCCTCTCCTCTCCTGTCCTGTCCTGTCCTGTCTTGTCTCGTCTCCTCTCCACTCCAGTCCAGCCCAGCCCAGCCCAGCCCTGCCCTCCCCTCACCTCAGCTCAGCTCAGCACAGCAGAGCAGTGCAGTGCAGTGTAGTGTCGTGTCCTGTCCCGTCCCTTCCCTTCCCTTTCCTTTGCTTTGGTTTGGGTTGGGCTGGGCAGGGCACGGCACCGCACCCCACCCAACCCAGCCCAGCCCAGCCCAGCCCAGCCCCGCCCCA",
        "label": "non-promoter"
    },
    {
        "seq": "CAGAATAGAATCGAATCGAATCGCATCGCATCGCAACGCAAGGCAAGACAAGAAAAGAATAGAATCGAATCAAATCATATCATGTCATGCCATGCAATGCAGTGCAGAGCAGAGCAGAGCAGAGCGGAGCGAAGCGACGCGACCCGACCTGACCTGACCTGACCTGATCTGATTTGATTTGATTTAATTTACTTTACGTTACGCTACGCTACGCTTCGCTTCGCTTCACTTCACTTCACCTCACCTCACCTAACCTAGCCTAGACTAGATTAGATTAGATTGGATTGAATTGACTTGACTTGACTTGACTTTACTTTTCTTTTTTTTTTATTTTATTTTATTTTATTCTATTCTATTCTGTTCTGCTCTGCACTGCATTGCATCGCATCGCATCGTATCGTTTCGTTGCGTTGTGTTGTGTTGTGTTGTGTTGTGTTCTGTTCTGTTCTTTTCTTCTCTTCCCTTCCCTTCCCCTCCCCCCCCCCACCCCACCCCACTCCACTTCACTTCACTTCCCTTCCTTTCCTCTCCTCTCCTCTTCTCTTCTCTTCTCTTCTTTTCTTGTCTTGCCTTGCT",
        "label": "non-promoter"
    }
    ],
    "epochs": 1
}
})
headers = {
'Content-Type': 'application/json',
'Authorization': 'Token {}'.format(os.environ['BIOLMAI_TOKEN'])
}

response = requests.request("POST", url, headers=headers, data=payload)

print(response.text)
library(RCurl)
headers = c(
"Content-Type" = "application/json",
'Authorization' = paste('Token', Sys.getenv('BIOLMAI_TOKEN'))
)
params = "{
\"pipeline\": \"finetune_DNABERT_classifier\",
\"hyperopt\": false,
\"input_json\": {
    \"max_train\": 40000,
    \"max_validate\": 20000,
    \"train\": [
    {
        \"seq\": \"CACAGCACAGCCCAGCCAAGCCAGGCCAGCCCAGCCCAGCCAAGCCACGCCACTCCACTACACTAGACTAGGCTAGGCTAGGCCAGGCCCGGCCCTGCCCTGCCCTGTCCTGTCCTGTCCTGTCCTGTCCTGTCCTGCCCTGCACTGCAGTGCAGCGCAGCCCAGCCCAGCCCCGCCCCCCCCCCTCCCCTGCCCTGTCCTGTACTGTAGTGTAGGGTAGGGTAGGGGAGGGGTGGGGTCGGGTCTGGTCTGGTCTGGTCTGGACTGGAATGGAACGGAACAGAACAGAACAGCACAGCCCAGCCAAGCCAGGCCAGGCCAGGACAGGAGAGGAGTGGAGTGGAGTGGAGTGGTGTGGTTTGGTTTGGTTTAGTTTAATTTAAGTTAAGATAAGAGAAGAGGAGAGGCGAGGCAAGGCAGGGCAGGGCAGGGCAGGGGAGGGGAGGGGAGGGGAGTGGAGTCGAGTCGAGTCGCGTCGCCTCGCCTCGCCTTGCCTTGCCTTGCCTTGCCTTGCCCTGCCCTGCCCTGCCCTGTCCTGTGCTGTGCTGTGCCGTGCCATGCCACGCCACACCACAC\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"CTAATCTAATCTAATCTAATCTAGTCTAGTCTAGTATAGTAAAGTAATGTAATGTAATGCAATGCCATGCCGTGCCGCGCCGCGCCGCGTCGCGTTGCGTTGCGTTGGGTTGGTTTGGTGTGGTGGGGTGGAGTGGAATGGAAAGGAAAGGAAAGAAAAGACAAGACAAGACATGACATGACATGACATGACATGACATGACATGACATAACATACCATACCATACCTTACCTCACCTCACCTCAACTCAAATCAAACCAAACAAAACAGAACAGCACAGCACAGCAGAGCAGGGCAGGGCAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCCGCGCCACGCCATGCCATGCCATGCCATGCGATGCGCTGCGCCGCGCCACGCCAAGCCAAGCCAAGCCAAGCCAAGCCCAGCCCGGCCCGCCCCGCACCGCAGCGCAGAGCAGAGCAGAGGAGAGGGGAGGGTAGGGTTGGGTTGGGTTGTGTTGTCTTGTCCTGTCCAGTCCAATCCAACCCAACTCAACTCAACTCCACTCCTCTCCTATCCTATCCTATTCTATTCTATTCCATTCCT\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"GGAAGAGAAGAGAAGAGGAGAGGGGAGGGAAGGGAAGGGAAGGGAAGGGAAGGAAAGGAAAGGAAAGGAAATGAAATGAAATGCAATGCCATGCCCTGCCCCGCCCCGCCCCGGCCCGGGCCGGGTCGGGTCGGGTCCGGTCCCGTCCCATCCCAGCCCAGGCCAGGCCAGGCGAGGCGGGGCGGGGCGGGGCGGGGCGGGGCCGGGCCTGGCCTCGCCTCGCCTCGACTCGAGTCGAGCCGAGCGGAGCGTAGCGTGGCGTGCCGTGCCGTGCCCTGCCCAGCCCACCCCACGCCACGCCACGCCACGCCGCGCCGCGCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTGCCCTGCCCTGCTCTGCTGTGCTGGGCTGGCCTGGCCTGGCCAGGCCACGCCACGCCACGCCACGCCACGCCTCGCCTGGCCTGGCCTGGACTGGAGTGGAGTGGAGTTGAGTTGAGTTGCGTTGCATTGCAGTGCAGGGCAGGACAGGAAAGGAACGGAACCGAACCGAACCGGACCGGGCCGGGCCGGGCGGGGCGCGGCGCCGCGCCGCGCCGGGCCGGG\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"CGAAAGGAAAGCAAAGCAAAGCAAAGCAATGCAATCCAATCAAATCAGATCAGTTCAGTGCAGTGGAGTGGCGTGGCCTGGCCTGGCCTGGCCTGGCCTGGACTGGACTGGACCGGACCAGACCATACCATGCCATGTCATGTGATGTGTTGTGTAGTGTAGTGTAGTGTAGTATAGTATAGTATAGTATAGTATAGAATAGAGTAGAGAAGAGAGGAGAGCAGAGCAGAGCAAAGCAACGCAACACAACAGAACAGCACAGCGCAGCGCAGCGCCGCGCCACGCCATGCCATCCCATCTCATCTAATCTATTCTATGCTATGCTATGCTATGCTTTGCTTAGCTTAACTTAATTTAATTTAATTTAATTTGATTTGGTTTGGCTTGGCATGGCAAGGCAACGCAACACAACATAACATTACATTACATTACATTACATTACATTACATGACATGTCATGTAATGTAGTGTAGTGTAGTCTAGTCCAGTCCCGTCCCGTCCCGGCCCGGACCGGAACGGAAAGGAAAAGAAAATAAAATCAAATCTAATCTTATCTTTTCTTTTCTTTTATTTTAA\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"TGACTCGACTCCACTCCCCTCCCATCCCAACCCAAACCAAACCAAACCAAACCAAACCAAACCAACCCAACACAACAAAACAAAACAAAACAAAAGAAAAGGAAAGGGAAGGGGAGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGTGGAGTTGAGTTCAGTTCAGTTCATTTCATCTCATCACATCACATCACCTCACCACACCACACCACTCCACTACACTAGACTAGACTAGACTAGACTAGACTTGACTTTACTTTCCTTTCCTTTCCTTTCCTTTCCTTACCTTATCTTATATTATAATATAAAATAAAATAAAAAAAAAAAAAAAACAAAACAAAACACAACACTACACTACACTAGACTAGACTAGAGTAGAGGAGAGGGGAGGGAAGGGAGGGGAGTGGAGTGGAGTGCAGTGCTGTGCTTTGCTTAGCTTAACTTAAGTTAAGCTAAGCAAAGCAGAGCAGAGCAGAACAGAAAAGAAAGGAAAGAAAAGAAAAGAAAAGAAAAGAAAAAAAAAAAAAAAAAAAAAATAAAATAAAATACAATACTATACTATACTAA\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"AAGCATAGCATGGCATGACATGAAATGAAATGAAATGAAATGAAATGAAATGAAATGAAATGAAAGGAAAGAAAAGACAAGACTAGACTGGACTGGACTGGGCTGGGCTGGGCTGGGCTAGGCTAGGCTAGGCTAGGCTAGGCAAGGCACGGCACAGCACAGCACAGTACAGTGCAGTGGAGTGGCGTGGCTTGGCTCGGCTCAGCTCACCTCACATCACACCACACCACACCTCACCTGACCTGTCCTGTACTGTAATGTAATGTAATCTAATCCAATCCCATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCAGGCCAAGCCAAGCCAAGACAAGACAAGACAAGACAGGACAGGACAGGCCAGGCAAGGCAGGGCAGAGCAGATCAGATGAGATGAGATGACATGACCTGACCTGACCTGACCTGACCTGAGCTGAGGTGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGTGGAGTTGAGTTTAGTTTG\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"GGCTTTGCTTTGCTTTGTTTTGTTTTGTTTTGTTTTGTTTTCTTTTCTTTTCTGTTCTGTTCTGTGCTGTGATGTGAGGTGAGTTGAGTTGAGTTAAGTTACGTTACGTTACGGTACGGGACGGGGCGGGGCGGGGCTGGGCTGGGCTGCGCTGCCCTGCCATGCCACGCCACCCCACCTCACCTGACCTGCCCTGCACTGCAGTGCAGGGCAGGTCAGGTAAGGTAAGGTAAAGTAAAATAAAATAAAATCAAATCTAATCTGATCTGGTCTGGACTGGACTGGACAGGACATGACATTACATTGCATTGCATTGCCTTGCCCTGCCCTGCCCTGCCCTGACCTGAACTGAAATGAAATGAAATTAAATTGAATTGAATTGACTTGACCTGACCGGACCGAACCGAACCGAACCGAACCGAACCTAACCTTACCTTGCCTTGGCTTGGATTGGATTGGATAGGATACGATACAATACAATACAAAACAAACCAAACCAAACCCAACCCGACCCGGCCCGGCCCGGCCCGGCCTGGCCTGGCCTGACCTGACCTGACATGACAGGACAGTACAGTG\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"TCACCGCACCGTACCGTTCCGTTACGTTACGTTACTTTACTGTACTGCACTGCCCTGCCTTGCCTCGCCTCCCCTCCTCTCCTATCCTAGCCTAGTCTAGTGTAGTGGAGTGGCGTGGCGTGGCGGGGCGGAGCGGATCGGATAGGATACGATACGATACGGTACGGCACGGCGCGGCGGGGCGGCGCGGCACGGCAAGGCAATGCAATACAATAGAATAGTATAGTGTAGTGGAGTGGCGTGGCGTGGCGCGGCGCAGCGCACCGCACAGCACATCACATTACATTCCATTCAATTCAATTCAAGTCAAGGCAAGGCAAGGCAAGGCAGGGCAGGGCAGGACAGGAAAGGAAGGGAAGCGAAGCAAAGCAAAGCAAGGCAAGACAAGAGAAGAGGAGAGGAGAGGAAAGGAACGGAACAGAACAGAACAGAACAGAGCAGAGCAGAGCCGAGCCAAGCCACGCCACCCCACCACACCAGACCAGCCCAGCACAGCAGAGCAGGGCAGGTCAGGTTAGGTTTGGTTTGGTTTGGTTTGGCTTGGCCTGGCCCGGCCCAGCCCAGCCCAGTCCAGTG\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"AGAAAAGAAAACAAAACAAAACAAAACAAAACAAAACAAAAGAAAAGCAAAGCTAAGCTCAGCTCCGCTCCGCTCCGGTCCGGACCGGAGCGGAGTGGAGTAGAGTAGAGTAGGGTAGGATAGGAAAGGAAAGGAAAGGAAAGTAAAGTGAAGTGAAGTGACGTGACATGACACGACACAACACAGCACAGCACAGCGCAGCGCAGCGCCGCGCCACGCCACGCCACCCCACCTCACCTCACCTCCCCTCCCCTCCCGTCCCGGCCCGGTCCGGTACGGTAGGGTAGCGTAGCCTAGCCTAGCCTGGCCTGGCCTGGCCTGGCCTGGCCGGGCCGGGCCGGCCCGGCCCGGCCAGGCCAAGCCAAGCCAAGGCAAGGCAAGGCCAGGCCTGGCCTCGCCTCTCCTCTGCTCTGGTCTGGCCTGGCTTGGCTTGGCTTAGCTTAACTTAAGTTAAGCTAAGCGAAGCGGAGCGGGGCGGGCCGGGCCGGGCCTGGCCTCGCCTCTCCTCTGCTCTGGTCTGGCCTGGCCTGGCCTGGCCTGGCCTGCCCTGCCCTGCCATGCCAAGCCAAACCAAAA\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"AAGTAGAGTAGAGTAGAGTAGAGGAGAGGCGAGGCCAGGCCTGGCCTCGCCTCCCCTCCTCTCCTGTCCTGCCCTGCTCTGCTTTGCTTCGCTTCACTTCAGTTCAGGTCAGGGCAGGGAAGGGAAGGGAAGGGAAGTGAAGTAAAGTAGAGTAGAGTAGAGTAGAGCAGAGCCGAGCCGAGCCGGGCCGGTCCGGTGCGGTGTGGTGTCGTGTCTTGTCTCGTCTCGTCTCGCCTCGCATCGCACCGCACCGCACCACACCAGACCAGACCAGAGCAGAGCAGAGCCGAGCCCAGCCCCGCCCCACCCCAGCCCAGACCAGATCAGATGAGATGGGATGGAATGGAATGGAACGGAACTGAACTCAACTCTACTCTGCTCTGTTCTGTCCTGTCCTGTCCCGTCCCATCCCATCCCATTCCATTCCATTCAATTCACTTCACATCACATCACATTACATTACATTAAATTAATTTAATTTAATTGAATTGAATTGAATTGAATTGAATCGAATCCAATCCAATCCAGTCCAGTCCAGTACAGTACAGTACTGTACTTTACTTTACTTTGCTTTGA\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"GGTGCAGTGCAATGCAAGGCAAGGCAAGGAAAGGAAAGGAATGGAATGGAATGAAATGAAATGAAGTGAAGCGAAGCCAAGCCAAGCCAAGCCAATCCAATTCAATTTAATTTCATTTCTTTTCTCTTCTCATCTCAACTCAATTCAATCCAATCAAATCATATCATCTCATCGCATCGAATCGAGTCGAGGCGAGGCGAGGCTAGGCTAGGCTACGCTACCCTACCCTACCCTACCCTGCCCTGCCCTGCCCTGCCATGCCATGCCATCCCATCTCATCTTATCTTGTCTTGTCTTGTGTTGTGGTGTGGCGTGGCCTGGCCAGGCCATGCCATGCCATGTCATGTGATGTGATGTGAGGTGAGGTGAGGGGAGGGAAGGGATGGGATGGGATGCGATGCAATGCACTGCACAGCACACCACACGACACGTCACGTGACGTGTCGTGTAGTGTAGTGTAGAGTAGATTAGATCAGATCAGATCAAATCAATTCAATTCAATTTAATTTCATTTCTTTTCTCTTCTCATCTCAGCTCAGCTCAGCACAGCATAGCATCGCATCACATCACATCACA\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"AGAGACGAGACTAGACTGGACTGGACTGGCCTGGCATGGCAAGGCAAGGCAAGGCAAGGAAAGGACAGGACAGGACAGGACAGGACAGGCCAGGCTAGGCTCGGCTCGGCTCGCCTCGCCTCGCCCCGCCCTGCCCTTCCCTTCCCTTCTCTTCTGTTCTGTTCTGTACTGTAGTGTAGAGTAGAGTAGAGCAGAGCCGAGCCTAGCCTCGCCTCGCCTCGCCTCGCATCGCATCGCATTGCATTGCATTGGATTGGCTTGGCCTGGCCAGGCCACGCCACCCCACCACACCAGACCAGGCCAGGACAGGAGAGGAGGGGAGGCGAGGCAAGGCAGGGCAGTGCAGTGCAGTGTAGTGTTGTGTTGTGTTGTGTTGTCTTGTCTTGTCTGGTCTGCTCTGCCCTGCCTTGCCTCGCCTCTCCTCTCCTCTCGTCTCGACTCGAATCGAACCGAACTGAACTTAACTTGACTTGGCTTGGCTTGGCTTGGCTGGGCTGCGCTGCCCTGCCCTGCCCAGCCCAACCCAAGCCAAGGCAAGGTAAGGTGAGGTGAGGTGAGGTGAGATGAGAAGAGAAG\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"TGGCGAGGCGACGCGACCCGACCCGACCCCACCCCACCCCAACCCAACCCAACCCAACCTAACCTGACCTGCCCTGCCCTGCCCTGCCCTGCCCTTCCCTTGCCTTGCCTTGCTTTGCTTTGCTTCGCTTCGCTTCGGTTCGGATCGGACCGGACAGGACACGACACTACACTGCACTGCACTGCACTGCAGTGCAGCGCAGCACAGCACAGCACCGCACCCCACCCAACCCAACCCAATCCAATGCAATGGAATGGCATGGCGTGGCGCGGCGCCGCGCCCCGCCCAGCCCAGCCCAGACCAGAACAGAACAGAACCGAACCCAACCCGACCCGCCCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTGCCCTGCCCTGCCCTGCCGTGCCGCGCCGCGCCGCGGCGCGGGGCGGGCCGGGCAGGGCAGGGCAGTGCAGTGCAGTGCAGTGCAGTGCAGTGCAGCGCAGCCCAGCCCAGCCCGGCCCGGCCCGGGCCGGGACGGGATGGGATAGGATAGGATAGCATAGCGTAGCGCAGCGCCGCGCCCCGCCCCGCCCCCCCCCCACCCCAA\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"CTGTGTTGTGTAGTGTATTGTATAGTATATTATATCATATCTTATCTGATCTGTTCTGTACTGTAATGTAAAGTAAAGTAAAGTAAAGTTAAGTTAAGTTATGTTATCTTATCTTATCTCATCTCCTCTCCACTCCAGTCCAGTCCAGTCCAGTCAAGTCAAGTCAACTCAACGCAACGCAACGCTACGCTACGCTAGGCTAGGCTAGGGTAGGGAAGGGATGGGATGGGATGCGATGCAATGCACTGCACAGCACACCACACTACACTCCACTCTACTCTGCTCTGCTCTGCACTGCAATGCAACGCAACACAACACAACACTACACTCCACTCTACTCTACTCTAGTCTAGGCTAGGTTAGGTGAGGTGGGGTGGCGTGGCCTGGCCTGGCCTTGCCTTCCCTTCTCTTCTGTTCTGTTCTGTACTGTATTGTATAGTATATTATATAATATATTATATGATATGGTATGGCATGGCATGGCAGGGCAGAGCAGAACAGAAAAGAAAAGAAAAAAAAAAGAAAAGAAAAGAAAAGAAAAGAAAGGAAAGTAAAGTAAAGTAAAGTAAAGTAAAT\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"CTATATTATATTATATTTTATTTGATTTGGTTTGGATTGGACTGGACAGGACAAGACAATACAATCCAATCGAATCGCATCGCCTCGCCGCGCCGTGCCGTGCCGTGACGTGATGTGATTTGATTAGATTAAATTAAATTAAACTAAACGAAACGAAACGAGACGAGTCGAGTGGAGTGTAGTGTAGTGTATTGTATGGTATGATATGAAATGAAATGAAAGGAAAGGAAAGGCAAGGCGAGGCGTGGCGTCGCGTCTCGTCTGGTCTGATCTGAACTGAAGTGAAGCGAAGCTAAGCTAAGCTAGGCTAGGCTAGGGTAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCTGCGCTACGCTAGGCTAGACTAGATTAGATAAGATAAGATAAAATAAACTAAACAAAACACAACACTACACTGCACTGAACTGATCTGATTTGATTTGATTTCATTTCCTTTCCCTTCCCCTCCCCTCCCCTTCCCTTTCCTTTACTTTAGTTTAGGTTAGGGTAGGGAAGGGAAGGGAAAGGAAAAGAAAAAAAAAAGAAAAGAAAAGAAAAGAATAGAATG\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"AACTGCACTGCACTGCAGTGCAGGGCAGGACAGGATAGGATGGGATGCGATGCTATGCTCTGCTCTGCTCTTCTCTTGTCTTGGCTTGGATTGGAGTGGAGTGGAGTTGAGTTCAGTTCTGTTCTGTTCTGGTCTGGTCTGGTCTGGTCTGGTCTAGTCTACTCTACTCTACTCTACTCTACTCTGCTCTGCTCTGCGCTGCGATGCGATGCGATGCGATGCGATGCTATGCTTTGCTTGGCTTGTCTTGTTTTGTTTTGTTTGGTTTGCTTTGCATTGCAATGCAAAGCAAAACAAAACAAAACCAAACCCAACCCTACCCTGCCCTGTCCTGTCCTGTCATGTCATGTCATGTCATGACATGAGATGAGATGAGAAGAGAAGAGAAGGGAAGGTAAGGTCAGGTCCGGTCCAGTCCACTCCACTCCACTACACTAGACTAGACTAGATTAGATGAGATGGGATGGCATGGCATGGCAGGGCAGGGCAGGTCAGGTTAGGTTTGGTTTCGTTTCATTTCAGTTCAGCTCAGCTCAGCTGAGCTGTGCTGTGCTGTGCTGTGCTGTGCTATGCTAC\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"GCTTTCCTTTCCTTTCCTTTCCTGTCCTGGCCTGGCCTGGCCTGGCCCGGCCCCGCCCCCCCCCCACCCCAACCCAAGCCAAGACAAGAGAAGAGTAGAGTGGAGTGCAGTGCAGTGCAGTGCAGGGCAGGGCAGGGAAGGGATGGGATGGGATGCGATGCCATGCCCTGCCCAGCCCAGCCCAGGCCAGGTCAGGTCAGGTCTGGTCTGGTCTGCTCTGCACTGCAATGCAACGCAACCCAACCAAACCACACCACCCCACCACACCACACCACTCCACTGCACTGGACTGGGCTGGGTTGGGTGGGGTGGGGTGGCGTGGCTTGGCTGGGCTGCGCTGCACTGCAGTGCAGCGCAGCTCAGCTGAGCTGTGCTGTGCTGTGCTGTGCCGTGCCCTGCCCAGCCCAGCCCAGGCCAGGACAGGAGAGGAGGGGAGGGGAGGGTAGGGTGGGGTGGGGTGGGGTGGGATGGGACGGGACTGGACTCGACTCCACTCCTCTCCTGTCCTGCCCTGCCCTGCCCTGCCCCGCCCCACCCCACCCCACCCCACCACACCAAACCAACCCAACTCAACTC\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"TGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCCGAGCCGAGCCGGGCCGGGCCGGGTCGGGTGGGGTGCGGTGCCGTGCCCTGCCCCGCCCCGCCCCGGCCCGGGCCGGGTCGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCCGAGCCGAGCCGGGCCGGGCCGGGTCGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTG\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"GGGTGAGGTGAGGTGAGGTGAGGGGAGGGAAGGGAAGGGAAGGGAAGTGAAGTGAAGTGCAGTGCAGTGCATTGCATCGCATCCCATCCCATCCCTTCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGTTGGGTGGGGTGAGGTGAGGTGAGATGAGAAGAGAAAAGAAACGAAACAAAACATAACATGACATGGCATGGGATGGGTTGGGTAGGGTAAGGTAAGGTAAGCTAAGCGAAGCGTAGCGTGGCGTGTCGTGTGGTGTGCTGTGCAGTGCAGTGCAGAGCAGACCAGACGAGACGTGACGTGACGTGGCGTGGAGTGGAGTGGAGAGGAGAGGAGAGGAGAGGGGAGGGCAGGGCGGGGCGTGGCGTGGCGTGGCGTGGGGTGGGGTGGGGTGGGGTGGGGTGAGGTGAGGTGAGGTGAGGGGAGGGAAGGGACGGGACGGGACGCGACGCCACGCCCCGCCCAGCCCACCCCACCCCACCCCACCCCACCCCTCCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGTTGGGTGGGGTGAGGTGAGGTGAGATGAGAAGAGAAA\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"AAGTTTAGTTTTGTTTTTTTTTTCTTTTCCTTTCCATTCCACTCCACCCCACCTCACCTGACCTGCCCTGCCCTGCCATGCCACGCCACTCCACTTCACTTCACTTCACTTCACTTCACATCACAACACAATACAATGCAATGAAATGACATGACCTGACCCGACCCTACCCTCCCCTCCCCTCCACTCCAGTCCAGCCCAGCGCAGCGCAGCGCCGCGCCCCGCCCTGCCCTCCCCTCTCCTCTACTCTACTCTACTCTACTGTACTGGACTGGCCTGGCATGGCAGGGCAGAGCAGAGCAGAGAAGAGACGAGACTAGACTAGACTAGACTAGCCTAGCATAGCATAGCATCGCATCACATCAAATCAAGTCAAGCCAAGCCAAGCCAAGCCAGGCCAGCCCAGCTCAGCTGAGCTGGGCTGGCCTGGCATGGCAAGGCAAAGCAAACCAAACCAAACCAAACCAGACCAGACCAGAGCAGAGGAGAGGCGAGGCGAGGCGTGGCGTCGCGTCCCGTCCTGTCCTTTCCTTTCCTTTACTTTAATTTAAGTTAAGGTAAGGTAAGGTCAGGTCC\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"TTTTTTTTTTTTTTTTTGTTTTGCTTTGCGTTGCGGTGCGGGGCGGGGCGGGGCGGGGCGGGGCGCGGCGCAGCGCAGCGCAGTGCAGTGCAGTGGAGTGGCGTGGCTTGGCTCGGCTCAGCTCATCTCATGTCATGCCATGCCATGCCTTGCCTGGCCTGTCCTGTACTGTAGTGTAGTGTAGTCTAGTCCAGTCCCGTCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCAGGCCAAGCCAAGCCAAGACAAGACAAGACGAGACGGGACGGGACGGGCCGGGCAGGGCAGGGCAGAGCAGATCAGATCAGATCAGATCACATCACGTCACGACACGAGACGAGGCGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGAGGAGATGAGATCAGATCGGATCGAATCGAGTCGAGACGAGACGAGACTAGACTAGACTATACTATCCTATCCTATCCTATCCTGTCCTGGCCTGGCCTGGCTTGGCTAGGCTAA\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"CCGCCTCGCCTTGCCTTCCCTTCCCTTCCCTTCCCTTCCCTCCCCTCTCCTCTGCTCTGTTCTGTTCTGTTTTGTTTTGTTTTTTTTTTGTTTTGGTTTGGCTTGGCATGGCATGGCATAGCATAACATAAGATAAGATAAGAAAAGAAAAGAAACGAAACAAAACAAAACAATACAATTCAATTCAATTCAATTCAGTTCAGGTCAGGTCAGGTTAGGTTTGGTTTAGTTTATTTTATCTTATCATATCAAATCAAGTCAAGGCAAGGAAAGGAGAGGAGAGGAGAGGAGAGTAGAGTCGAGTCCAGTCCAGTCCAGTCCAGGCCAGGGCAGGGTAGGGTCGGGTCAGGTCAGGTCAGATCAGAACAGAATAGAATTGAATTTAATTTTATTTTTTTTTTCTTTTCTTTTCTATTCTAATCTAACCTAACCTAACCAAACCACACCACCCCACCACACCAGACCAGGCCAGGTCAGGTGAGGTGGGGTGGCGTGGCGTGGCGCGGCGCAGCGCAGCGCAGAGCAGAGCAGAGCAGAGCAGAGCAAAGCAAGGCAAGCCAAGCTAAGCTTAGCTTA\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"ACAAAACAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAAAAAAAAAAAAAGAAAAGCAAAGCGAAGCGGAGCGGGGCGGGACGGGAAGGGAAGGGAAGCGAAGCAAAGCAGAGCAGCGCAGCCCAGCCTAGCCTTGCCTTGCCTTGGCTTGGGTTGGGTTGGGTTGGGTTAGGTTATGTTATCTTATCTTATCTCATCTCCTCTCCACTCCAGTCCAGCCCAGCACAGCACAGCACAGCACACCACACCACACCCCACCCAACCCACCCCACCCCACCACACCAGACCAGACCAGAGCAGAGGAGAGGGGAGGGCAGGGCAGGGCAGGGCAGCGCAGCACAGCAGAGCAGAGCAGACCAGACAAGACACGACACTACACTGCACTGGACTGGCCTGGCTTGGCTAGGCTAAGCTAAACTAAAGTAAAGCAAAGCTAAGCTCAGCTCTGCTCTTCTCTTATCTTAGCTTAGTTTAGTCTAGTCAAGTCATGTCATATCATAACATAAGATAAGTTAAGTCAAGTCCAGTCCTGTCCTGTCCTGACCTGAGCTGAGTTGAGTGGAGTGCAGTGCT\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"ACTGGACTGGAATGGAAAGGAAAAGAAAATAAAATTAAATTTAATTTTATTTTATTTTAATTTAAATTAAATTAAATGAAATGAAATGAAATGAATTGAATGGAATGAAATGATATGATGTGATGTGATGTGATGTGATGTGATGTGATTTGATTCGATTCTATTCTGTTCTGTTCTGTGCTGTGGTGTGGTGTGGTGTGGTGCGGTGCTGTGCTCTGCTCCGCTCCTCTCCTGTCCTGTCCTGTGCTGTGGTGTGGGGTGGGCTGGGCAGGGCAGGGCAGCGCAGCACAGCACAGCACTGCACTGCACTGGACTGGCCTGGCCTGGCCTGGCCTGGCCTGACCTGAACTGAAGTGAAGCGAAGCAAAGCACAGCACAGCACAACACAAAACAAACCAAACCAAACCTAACCTGACCTGGCCTGGACTGGAGTGGAGCGGAGCCGAGCCTAGCCTCGCCTCCCCTCCACTCCAGTCCAGCCCAGCACAGCAGAGCAGGGCAGGGCAGGGGAGGGGGGGGGGCGGGGCAGGGCAGGGCAGTGCAGTCCAGTCCAGTCCTGTCCTCTCCTCACCTCAG\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"AGAGCTGAGCTGAGCTGTGCTGTCCTGTCTTGTCTGGTCTGCTCTGCTCTGCTGTGCTGGGCTGGGCTGGGGTGGGGGGGGGGCGGGGCAGGGCAGGGCAGGGCAGGGCAGGGCAGGGCGGGGCGCGGCGCTGCGCTGCGCTGTGCTGTTCTGTTCTGTTCTGTTCTGTTCTGGTCTGGGCTGGGCTGGGCAGGGCACGGCACTGCACTGCACTGTACTGTACTGTAGTGTAGGGTAGGATAGGATAGGATGGGATGTGATGTTATGTTATGTTAGGTTAGCTTAGCATAGCAGAGCAGCGCAGCGCAGCGAAGCGACGCGACCCGACCCGACCCTACCCTGCCCTGGCCTGGCCTGGCCTGGCCTGGCCTCGCCTCTCCTCTACTCTACTCTACCCTACCATACCACACCACTCCACTACACTAGACTAGACTAGATTAGATGAGATGCGATGCCATGCCATGCCAGGCCAGTCCAGTACAGTAGAGTAGCGTAGCATAGCACAGCACCGCACCCCACCCTACCCTCCCCTCCCCTCCTCTCCTCTCCTCTCCTCTCCTCTCCTCTCCACTCCAG\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"GCTTTGCTTTGTTTTGTTTTGTTATGTTACGTTACATTACAGTACAGGACAGGTCAGGTGAGGTGTGGTGTCGTGTCTTGTCTGGTCTGTTCTGTTCTGTTATGTTAAGTTAACTTAACATAACATAACATTACATTCCATTCCATTCCATTCCATTCCATGCCATGGCATGGAATGGACTGGACCGGACCAGACCAAACCAAACCAAAACAAAACAAAACAAAACAAAACAAGACAAGGCAAGGCAAGGCCAGGCCAGGCCAAGCCAAACCAAACCAAACCAAACCCAACCCAACCCAACCCAAACCAAAACAAAATAAAATCAAATCAAATCAAATCAAGTCAAGGCAAGGGAAGGGAAGGGACGGGACAGGACAGGACAGGACAGGACAGGAAAGGAAGGGAAGTGAAGTAAAGTAGAGTAGAGTAGACTAGACTAGACTCGACTCCACTCCACTCCACTCCACCCCACCCCACCCAACCCATCCCATGCCATGCCATGCAATGCAGTGCAGGGCAGGTCAGGTGAGGTGGGGTGGAGTGGAATGGAAGGGAAGGGAAGGGAAGGGGAGGGGA\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"GGTGATGTGATGTGATGCGATGCTATGCTATGCTACGCTACACTACAGTACAGGACAGGGCAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCCGGCCCTGCCCTCCCCTCACCTCATCTCATATCATAGCATAGGATAGGATAGGACAGGACAGGACAGGACAGGACAGGTCAGGTGAGGTGCGGTGCTGTGCTCTGCTCAGCTCACCTCACCTCACCACACCAGACCAGGCCAGGTCAGGTGAGGTGCGGTGCGGTGCGGTGCGGAGCGGAGCGGAGGGGAGGAGAGGACAGGACAGGACAAGACAACACAACCCAACCCAACCCGACCCGTCCCGTCCCGTCCCGTCCGGTCCGGTCCGGGCCGGGCCGGGCTGGGCTGGGCTGGGCTGGACTGGAGTGGAGCGGAGCAGAGCAGAGCAGGGCAGGTCAGGTCAGGTCAGGTCAAGTCAAGTCAAGACAAGAGAAGAGGAGAGGCGAGGCTAGGCTCGGCTCTGCTCTGCTCTGGTCTGGGCTGGGATGGGAGGGGAGAGGAGACGAGACAAGACACGACACTACACTTCACTTCACTTCC\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"GGGGCAGGGCAGGGCAGGGCAGGACAGGAAAGGAAAGGAAAGGAAAGCAAAGCCAAGCCCAGCCCAGCCCATCCCATCCCATCTCATCTAATCTACTCTACACTACAATACAAGACAAGGCAAGGCAAGGCCAGGCCAGGCCAGGCCAGTCCAGTGCAGTGGAGTGGCGTGGCTTGGCTTGGCTTTGCTTTTCTTTTCTTTTCCTTTCCCTTCCCCTCCCCCCCCCCACCCCAACCCAACCCAACCCAACCCAACCCAACCCAGCCCAGTCCAGTCCAGTCCAGTCCTGTCCTTTCCTTCCCTTCCCTTCCCTTCCCATCCCAACCCAAACCAAATCAAATTAAATTCAATTCCATTCCCTTCCCATCCCACCCCACACCACAGCACAGCACAGCCCAGCCTAGCCTCGCCTCCCCTCCACTCCAGTCCAGACCAGATCAGATCAGATCCGATCCCATCCCTTCCCTGCCCTGCCCTGCACTGCAATGCAACGCAACCCAACCCAACCCTACCCTCCCCTCCCCTCCCCTCCCTTCCCTCCCCTCCCCTCCCCTCCCGTCCCGCCCCGCTCCGCTT\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"ATACAGTACAGGACAGGTCAGGTTAGGTTTGGTTTCGTTTCCTTTCCGTTCCGTTCCGTGCCGTGGCGTGGGGTGGGATGGGAGGGGAGAGGAGAGGAGAGGAGAGGTGAGGTAAGGTAAGGTAACGTAACATAACACAACACAACACAACACAATACAATACAATAGAATAGCATAGCTTAGCTTAGCTTGGCTTGTCTTGTATTGTATTGTATCGTATCATATCAGATCAGTTCAGTCCAGTCAAGTCATGTCATTTCATTACATTACATTACCTTACCATACCACACCACTCCACTTCACTTGACTTGACTTGAGTTGAGTTGAGTGGAGTGTAGTGTGGTGTGATGTGAAGTGAAGTGAAGCGAAGCAAAGCAGAGCAGTGCAGTTCAGTTAAGTTAGGTTAGTTTAGTCTAGTCAAGTCAAGTCAAATCAAAGCAAAGTAAAGTCAAGTCTAGTCTGGTCTGGTCTGGGCTGGGATGGGAGGGGAGTGGAGTGGAGTGAAGTGAAGTGAATTGAATGGAATGAAATGAGATGAGATGAGAGGAGAGTAGAGTAGAGTAGAGTAGAGTAGAA\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"AACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAAAAAAAAAAAAAAACAAAACAAAACACAACACAACACAGCACAGCACAGCACAGCAAAGCAAAGCAAACCAAACCAAACCTAACCTGACCTGTCCTGTACTGTATTGTATGGTATGTTATGTTATGTTGTGTTGTGTTGTCTTGTCCTGTCCCGTCCCTTCCCTTCCCTTCCCTTCCCTTCCATTCCAGTCCAGGCCAGGTCAGGTCAGGTCCGGTCCCGTCCCCTCCCCCCCCCCTCCCCTGCCCTGCCCTGCTCTGCTGTGCTGGGCTGGGCTGGGCTGGGCAGGGCATGGCATTGCATTTCATTTGATTTGCTTTGCATTGCAGTGCAGAGCAGAACAGAACAGAACCGAACCGAACCGCACCGCACCGCAGCGCAGCGCAGCACAGCATAGCATCGCATCCCATCCCATCCCATCCCAGCCCAGACCAGATCAGATCAGATCAGATCACATCACTTCACTCCACTCGACTCGTCTCGTTTCGTTACGTTAAGTTAAATTAAAATAAAAAAAAAAAAAAAATAAAATT\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"TCCTGACCTGATCTGATATGATAAGATAAAATAAACTAAACCAAACCCAACCCAACCCATCCCATGCCATGGCATGGGATGGGATGGGATGGGATCGGATCTGATCTCATCTCATCTCATCTCATGTCATGACATGAGATGAGATGAGAAGAGAATAGAATTGAATTAAATTATATTATTTTATTCTATTCAATTCATTTCATTTCATTACATTATATTATCTTATCATATCATATCATGTCATGACATGAGATGAGATGAGAAGAGAATAGAATAGAATAGAATAGTATAGTATAGTATAGTATGGTATGGTATGGGATGGGATGGGAAGGGAAAGGAAAGGAAAGAAAAGACAAGACCAGACCAGACCAGACCAGTCCAGTCCAGTCCAGTCCCGTCCCCTCCCCACCCCATCCCATGCCATGACATGATATGATTTGATTCGATTCAATTCAATTCAATTCAATTCAATTAAATTACATTACCTTACCTTACCTCACCTCCCCTCCCCTCCCCTCCCCCCCCCCTCCCCTGCCCTGGCCTGGGCTGGGTTGGGTCGGGTCCGGTCCCGTCCCT\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"GTCATCTCATCGCATCGTATCGTATCGTAGCGTAGTGTAGTATAGTACAGTACTGTACTATACTACACTACACTACATTACATTACATTTCATTTTATTTTATTTTAATTTAAATTAAACTAAACAAAACATAACATGACATGTCATGTAATGTAATGTAAAGTAAAGTAAAGAAAAGAGAAGAGCAGAGCTGAGCTCAGCTCAGCTCAGCTCAGTTCAGTGCAGTGGAGTGGTGTGGTGTGGTGCGGTGCTGTGCTCTGCTCCGCTCCACTCCAATCCAAGCCAAGACAAGAAAAGAAGAGAAGCGAAGCAAAGCAAAGCAAGGCAAGACAAGACAAGACTAGACTTGACTTTACTTTGCTTTGGTTTGGTTTGGTATGGTAGGGTAGAGTAGAGTAGAGAAGAGACGAGACGAGACGGGACGGCACGGCCCGGCCGGGCCGCGCCGCTCCGCTTCGCTTGGCTTGCCTTGCTTTGCTCTGCTCCGCTCCCCTCCCATCCCAACCCAAACCAAATCAAATAAAATATAATATCATATCATATCATATCATGTCATGCCATGCTATGCTGTGCTGA\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"ACTGCGCTGCGCTGCGCGGCGCGCCGCGCCGCGCCGCGCCGAGCCGACCCGACGCGACGGGACGGTACGGTGCGGTGGGGTGGGGTGGGCTGGGCTGGGCTGGGCTGGGCTGGCCTGGCGTGGCGGGGCGGGGCGGGACGGGACGGGACCGGACCAGACCAGACCAGGCCAGGACAGGACAGGACAGGACAGGACAGGACAGGACAGGAAAGGAACGGAACAGAACAAAACAATACAATGCAATGGAATGGGATGGGATGGGATGGGATTGGATTCGATTCCATTCCGTTCCGATCCGAGCCGAGGCGAGGGGAGGGCAGGGCCGGGCCGGGCCGCGCCGCACCGCAACGCAAGGCAAGGCAAGGGAAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCTGCGCTCCGCTCCGCTCCTCTCCTTTCCTTCCCTTCTCTTCTGTTCTGCTCTGCGCTGCGGTGCGGGGCGGGTCGGGTTGGGTTGGGTTGGGTTGGGTTGGGGTGGGGTGGGGTGGGGTGCGGTGCGGTGCGATGCGAGGCGAGGCGAGGCGAGGCCAGGCCGGGCCGGGCCGGA\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"TGTGCTGTGCTGTGCTGAGCTGATCTGATGTGATGCGATGCCATGCCTTGCCTGGCCTGTCCTGTGCTGTGGTGTGGTGTGGTTTGGTTTGGTTTGGTTTGGTTTGGTTTGGTGTGGTGGGGTGGGGTGGGGTGGGGCGGGGCTGGGCTAGGCTACGCTACACTACAATACAACACAACACAACAGAACAGGACAGGACAGGAAAGGAAAGGAAATGAAATTAAATTCAATTCCATTCCTTTCCTGTCCTGCCCTGCTCTGCTTTGCTTTGCTTTGCTTTGGTTTGGATTGGAATGGAAAGGAAAGGAAAGAAAAGACAAGACAAGACAGGACAGAACAGAACAGAAAAGAAAGGAAAGCAAAGCAAAGCAGAGCAGAGCAGATCAGATAAGATAGGATAGCATAGCCTAGCCAAGCCAAGCCAAACCAAATCAAATTAAATTCAATTCTATTCTCTTCTCTTCTCTCCTCTCTTCTCTACTCTACTCTACCCTACCATACCACACCACACCACATCACATTACATTTCATTTTATTTTGTTTTGGTTTGGATTGGAATGGAAAGGAAACGAAACT\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"GGTTCCGTTCCCTTCCCGTCCCGCCCCGCTCCGCTTCGCTTCGCTTCCCTTCCATTCCACTCCACCCCACCGCACCGAACCGAGCCGAGGCGAGGGGAGGGCAGGGCCGGGCCGGGCCGAGCCGACCCGACTCGACTGGACTGCACTGCGCTGCGATGCGAGGCGAGGCGAGGTGAGGTGAGGTGCGGTGCAGTGCATTGCATGGCATGCCATGCTATGCTGTGCTGGGCTGGGCTGGGATGGGAGGGGAGTGGAGTCGAGTCGAGTCGTGTCGTATCGTAGCGTAGTGTAGTATAGTACAGTACCGTACCGTACCGCACCGCACCGCACCGCACCGCACCGCACCGGACCGGGCCGGGGCGGGGCGGGGCGGGGCGGGGCGGAGCGGAACGGAACGGAACAGAACAGAACAGCACAGCTCAGCTCAGCTCCGCTCCGCTCCGCTCCGCCCCGCCCCGCCCCGCCCCGCCCCGGCCCGGCCCGGCGCGGCGGGGCGGAGCGGATCGGATGGGATGGGATGGTATGGTGTGGTGTGGTGTTGTGTTTTGTTTCGTTTCCTTTCCATTCCAGTCCAGA\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"GCCCGGCCCGGGCCGGGACGGGAGGGGAGCGGAGCGGAGCGTAGCGTCGCGTCGCGTCGCGTCGCATCGCAGCGCAGCGCAGCCCAGCCTAGCCTCGCCTCCCCTCCCCTCCCCTCCCCCCCCCCGCCCCGCCCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTCCCCTCCCCTCCCCTCCCCTCCCCGCCCCGCCCCGCCCCGCCTCGCCTCGCCTCGCCTCGGCTCGGGTCGGGGCGGGGAGGGGACGGGACTGGACTCGACTCGACTCGTCTCGTCTCGTCCCGTCCCGTCCCTTCCCTCCCCTCCCCTCCACTCCACTCCACACCACAGCACAGCACAGCCCAGCCCAGCCCCGCCCCTCCCCTCCCCTCCCCTCCCCTCCCTTCCCTCCCCTCCCCTCCCCTCCCGTCCCGTCCCGTCCCGTCGCGTCGGGTCGGATCGGAACGGAATGGAATTGAATTCAATTCGATTCGCTTCGCATCGCAGCGCAGCGCAGCCCAGCCTAGCCTCGCCTCCCCTCCGCTCCGCTCCGCCCCGCCGCGCCGTGCCGTTCCGTTCCGTTCTGTTCTT\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"CTGGCTTGGCTGGGCTGCGCTGCTCTGCTCTGCTCCGCTCCTCTCCTTTCCTTACCTTACCTTACATTACAATACAAAACAAACCAAACCAAACCTAACCTGACCTGTCCTGTGCTGTGGTGTGGAGTGGAGTGGAGTGGAGTTGAGTTGAGTTGGGTTGGATTGGACTGGACTGGACTTGACTTGACTTGCCTTGCTTTGCTGTGCTGTGCTGTTCTGTTTTGTTTTGTTTTTTTTTTCTTTTCCTTTCCTTTCCTCTCCTCTCCTCTTCTCTTGTCTTGCCTTGCCTTGCCATGCCACGCCACTCCACTACACTAGACTAGACTAGAGTAGAGGAGAGGGGAGGGTAGGGTAGGGTAGGGTAGAGTAGAATAGAATAGAATAGAATATAATATGATATGATATGAAATGAAATGAAAAGAAAAGAAAAGAAAAGAAAAGAAGAGAAGAGAAGATAAGATTAGATTAGATTAGATTAGCTTAGCATAGCATAGCATGGCATGTCATGTTATGTTTTGTTTTGTTTTCTTTTCATTTCATTTCATCTCATCCCATCCTATCCTTTCCTTGCCTTGC\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"CCCTGCCCTGCTCTGCTATGCTACGCTACACTACAGTACAGTACAGTTCAGTTTAGTTTTGTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTGTTTTGATTTGATTTGATATGATATGATATAATATACTATACAATACAGTACAGGACAGGTCAGGTTAGGTTTGGTTTTGTTTTGTTTTGCTTTGCCTTGCCATGCCAGGCCAGCCCAGCTCAGCTTAGCTTTGCTTTTCTTTTTTTTTTTTTTTTCTTTTCATTTCACTTCACTTCACTACACTAGACTAGACTAGATTAGATGAGATGGGATGGTATGGTGTGGTGCGGTGCTGTGCTATGCTAAGCTAATCTAATTTAATTCAATTCCATTCCCTTCCCTTCCCTTCCCTTTCCTTTACTTTAGTTTAGTTTAGTGTAGTGAAGTGACGTGACCTGACCTGACCTAACCTAGCCTAGGCTAGGCTAGGCTAGGCTGGGCTGTGCTGTTCTGTTTTGTTTTGTTTTCTTTTCATTTCATTTCATATCATACCATACGATACGATACGATACGATGCGATGTGATGTTATGTTTTGTTTA\",
        \"label\": \"promoter\"
    }
    ],
    \"validation\": [
    {
        \"seq\": \"GTGGGGTGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGCGCCGCCCCGCCCCGCCCCGCCCCACCCCACCCCACTCCACTGCACTGCACTGCACTGCAGTGCAGGGCAGGTCAGGTGAGGTGGGGTGGGGTGGGCTGGGCCGGGCCTGGCCTGGCCTGTCCTGTACTGTAGTGTAGCGTAGCATAGCAGAGCAGCGCAGCTCAGCTGAGCTGCGCTGCACTGCACTGCACCGCACCTCACCTGACCTGACCTGAGCTGAGGTGAGGCGAGGCAAGGCAGGGCAGGGCAGGGCAGGGCAGGGCTGGGCTGGGCTGGGCTGGCCTGGCATGGCAGGGCAGCGCAGCCCAGCCCAGCCCCGCCCCTCCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGGTGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGCGCCGCCCCGCCCCGCCCCGCCCCACCCCACCCCACTCCACTGCACTGCACTGCACTGCAGTGCAGGGCAGGTCAGGTG\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"GTGTGGTGTGGGGTGGGATGGGATGGGATCGGATCAGATCATATCATGTCATGTCATGTAATGTATTGTATCGTATCATATCAGATCAGTTCAGTGCAGTGCAGTGCAGTGCAGTGCAGCGCAGCCCAGCCTAGCCTTGCCTTGCCTTGACTTGACTTGACCTGACCTGACCTCACCTCCCCTCCTCTCCTGTCCTGGCCTGGGCTGGGCTGGGCTGGGCTCGGCTCAGCTCAACTCAAGTCAAGCCAAGCAAAGCATAGCATTGCATTCCATTCTATTCTTTTCTTCTCTTCCCTTCCCTTCCCATCCCACCCCACCCCACCTCACCTCACCTCACCTCAACTCAACTCAACCCAACCTAACCTCACCTCTCCTCTTCTCTTGTCTTGACTTGAGTTGAGTTGAGTAGAGTAGAGTAGCGTAGCTTAGCTGAGCTGAGCTGAACTGAAATGAAATGAAATTAAATTAAATTACATTACATTACAGTACAGGACAGGACAGGAAAGGAACGGAACAGAACATAACATGACATGCCATGCCATGCCATGCCACGCCACCCCACCACACCACACCACA\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"CCCTGCCCTGCACTGCATTGCATGGCATGCCATGCCATGCCATGCCACGCCACACCACATCACATAACATAGCATAGCATAGCATAGCAAAGCAAGGCAAGGCAAGGTAAGGTGAGGTGCGGTGCTGTGCTGTGCTGGGCTGGGCTGGGTTGGGTCGGGTCAGGTCACGTCACTTCACTGCACTGAACTGATCTGATGTGATGCGATGCTATGCTATGCTAAGCTAACCTAACATAACATAACATCACATCTCATCTAATCTAATCTAAACTAAACTAAACAAAACAGAACAGGACAGGGCAGGGGAGGGGCGGGGCCGGGCCAGGCCAGGCCAGGCCAGGTCAGGTGAGGTGCGGTGCGGTGCGGTGCGGTGCGGTGCGGTGGGGTGGCGTGGCTTGGCTCGGCTCAGCTCACCTCACTTCACTCCACTCTACTCTTCTCTTGTCTTGACTTGAATTGAAATGAAATGAAATCAAATCCAATCCCATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGC\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"TTGGAGTGGAGCGGAGCAGAGCAAAGCAAGGCAAGGCAAGGCAAGGCTAGGCTAGGCTATGCTATGCTATGCTATGCAATGCACTGCACCGCACCACACCATACCATACCATACCATACAATACATTACATGACATGCCATGCTATGCTCTGCTCTGCTCTGCTCTGATCTGAGCTGAGTTGAGTGGAGTGGAGTGGGGTGGGCTGGGCTGGGCTTGGCTTGGCTTGACTTGATTTGATTTGATTCGATTCCATTCCTTTCCTCTCCTCCCCTCCACTCCAGTCCAGGCCAGGGCAGGGAAGGGAAGGGAAGGGAAGAGAAGAGAAGAGGAGAGGCGAGGCCAGGCCAGGCCAGGCCAGGCCAGGACAGGAAAGGAAAGGAAAGGAAAGCAAAGCAAAGCATAGCATTGCATTGCATTGAATTGATTTGATGTGATGTGATGTGATGTGATGTGAAGTGAAATGAAAAGAAAACAAAACAAAACAGAACAGCACAGCCCAGCCTAGCCTTGCCTTTCCTTTCCTTTCCTTTCCCTTCCCTTCCCTTCCCTTGCCTTGCCTTGCCTTGCCATGCCAT\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"AGCACAGCACAGCACAGGACAGGGCAGGGCAGGGCAGGGCACGGCACTGCACTGCACTGGACTGGTCTGGTGTGGTGGGGTGGAGTGGAGTGGAGGGGAGGGGAGGGAAGGGAGGGGAGCGGAGCCGAGCCCAGCCCTGCCCTGCCCTGCCCTGCGCTGCGGTGCGGGGCGGGGCGGGGCGGGGCAGGGCAGGGCAGTGCAGTCCAGTCCAGTCCTGTCCTCTCCTCACCTCAACTCAAGTCAAGGCAAGGCAAGGCCAGGCCTGGCCTCGCCTCCCCTCCGCTCCGGTCCGGACCGGATCGGATGGGATGGGATGGGATGGGTTGGGTGGGGTGTGGTGTGGTGTGATGTGAGGTGAGATGAGAGGAGAGGAGAGGCGAGGCAAGGCACGGCACCGCACCGCACCGGACCGGGCCGGGGCGGGGCGGGGCTGGGCTGGGCTGAGCTGAACTGAAGTGAAGCGAAGCAAAGCAGAGCAGCGCAGCACAGCATAGCATCGCATCTCATCTGATCTGGTCTGGGCTGGGTTGGGTTGGGTTTGGTTTGGTTTGATTTGAGTTGAGGTGAGGAGAGGAA\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"AGGCCAGGCCAGGCCAGCCCAGCTCAGCTGAGCTGGGCTGGGCTGGGGTGGGGTGGGGTCGGGTCAGGTCAAGTCAAGTCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGGAAGGGGAGGGGGGGGGGCGGGGCTGGGCTGGGCTGCGCTGCCCTGCCCTGCCCAGCCCAGCCCAGCCCAGCACAGCACAGCACAGCACAGCACAGTACAGTGCAGTGGAGTGGTGTGGTTTGGTTCGGTTCTGTTCTGTTCTGCTCTGCTCTGCTCTGCTCCGCTCCACTCCAGTCCAGACCAGAGCAGAGGAGAGGTGAGGTGAGGTGCGGTGCAGTGCAGTGCAGTGCAGTCCAGTCAAGTCAGGTCAGATCAGACCAGACTAGACTGGACTGCACTGCCCTGCCTTGCCTGGCCTGGCCTGGGCTGGGTTGGGTTGGGTTGGGTTGGGTTGGCTTGGCTTGGCTCGGCTCAGCTCATCTCATGTCATGCCATGCCATGCCTTGCCTGGCCTGGCCTGGGCTGGGTTGGGTCGGGTCTGGTCTGGTCTGTTCTGTCCTGTCATGTCATGTCATT\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"GTGCGATGCGAGGCGAGACGAGATGAGATGAGATGAGATGACATGACGTGACGCGACGCAACGCACCGCACTGCACTTCACTTCACTTCCCTTCCTTTCCTGTCCTGCCCTGCCCTGCCTTGCCTGGCCTGACCTGAGCTGAGGTGAGGCGAGGCGAGGCGGGGCGGCGCGGCCCGGCCGGGCCGCGCCGCTCCGCTGCGCTGTGCTGTTCTGTTCTGTTCTGTTCTCTTCTCGTCTCGCCTCGCGTCGCGGCGCGGCGCGGCTCGGCTTGGCTTCGCTTCCCTTCCGTTCCGGTCCGGCCCGGCACGGCAGGGCAGGGCAGGTCAGGTGAGGTGGGGTGGCGTGGCGTGGCGCGGCGCTGCGCTGCGCTGAGCTGAGCTGAGATGAGACGAGACCAGACCAGACCACACCACGCCACGGCACGGGACGGGACGGGAAGGGAAGGGAAGCGAAGCCAAGCCAAGCCAGGCCAGCCCAGCCCAGCCTAGCCTGGCCTGGCCTGGCCTGGCTTGGCTGGGCTGTGCTGTCCTGTCGTGTCGGGTCGGTTCGGTTCGGTTAGGTTAGGTTAGCTTAGCC\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"GTTCTTTTCTTGTCTTGGCTTGGATTGGATTGGATCGGATCAGATCACATCACATCACACCACACTACACTCCACTCGACTCGACTCGAGTCGAGGCGAGGAGAGGAAAGGAAAGGAAAGGAAAGCAAAGCTAAGCTCAGCTCCGCTCCACTCCAGTCCAGCCCAGCTCAGCTGAGCTGGGCTGGGCTGGGCTGGGCCGGGCCCGGCCCAGCCCAGCCCAGACCAGATCAGATTAGATTTGATTTGATTTGGTTTGGGTTGGGGTGGGGCGGGGCTGGGCTTGGCTTCGCTTCTCTTCTGTTCTGTTCTGTCCTGTCCTGTCCTGTCCTGTCCTGACCTGAACTGAAATGAAAGGAAAGGAAAGGCAAGGCGAGGCGCGGCGCTGCGCTGCGCTGGGCTGGCCTGGCTTGGCTCGGCTCCGCTCCTCTCCTGTCCTGGCCTGGTCTGGTGTGGTGTGGTGTGGTGTGATGTGAAGTGAATTGAATGGAATGGAATGGGATGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCCGGCCCAGCCCAGCCCAGGCCAGGGCAGGGCAGGGCTGGGCTG\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"GGCCAGGCCAGGCCAGGGCAGGGGAGGGGAGGGGACGGGACCGGACCAGACCAGACCAGGCCAGGCCAGGCTAGGCTGGGCTGGGCTGGGCTGGGATGGGAGGGGAGAGGAGAGGAGAGCAGAGCTGAGCTGAGCTGCGCTGCCCTGCCATGCCAAGCCAACCCAACCCAACCGAACCGCACCGCACCGCACCGCACCGCACCTCACCTGACCTGTCCTGTGCTGTGATGTGAAGTGAAGTGAAGGGAAGGAAAGGAAAGGAATGGAATGGAATGGAATGGTATGGTCTGGTCAGGTCAGGTCAGGTCAGGACAGGAAAGGAACGGAACCGAACCCAACCCTACCCTCCCCTCCCCTCCCCTCCCATCCCACCCCACCCCACCCCACCCTACCCTGCCCTGGCCTGGGCTGGGATGGGATGGGATGGGATGCGATGCAATGCATTGCATTGCATTCCATTCCATTCCTTTCCTGTCCTGGCCTGGCCTGGCTTGGCTTGGCTTTGCTTTTCTTTTATTTTACTTTACCTTACCATACCAGACCAGTCCAGTTCAGTTAAGTTATGTTATTTTATTC\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"ATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGAGCCGAGCCGAGGCGAGGTGAGGTGAGGTGGGGTGGGGTGGGTTGGGTGGGGTGGGGTGGAGTGGATTGGATCGGATCAGATCATATCATCTCATCTCATCTGATCTGATCTGAGCTGAGGTGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGTGGAGTTGAGTTTAGTTTGGTTTGATTTGAATTGAAATGAAACGAAACCAAACCAAACCAGACCAGCCCAGCCCAGCCTAGCCTGGCCTGGCCTGGCCTGGCCTGGCCAGGCCAAGCCAACCCAACACAACATAACATGACATGGCATGGCATGGCATGGCAAGGCAAAGCAAAACAAAACAAAACCAAACCCAACCCCACCCCGCCCCGTCCCGTCCCGTCTCGTCTCGTCTCTTCTCTACTCTACTCTACTCTACTATACTAAACTAAACTAAAATAAAAAAAAAATAAAATAAAATAC\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"CCGCCACGCCAGGCCAGGCCAGGCCAGGCTAGGCTCGGCTCCGCTCCTCTCCTCTCCTCTCCTCTGCTCTGCTCTGCACTGCAGTGCAGCGCAGCGCAGCGCAGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGCCGCGCAGCGCAGCGCAGCGCAGCGCAGCGCAGCGCCGCGCCCCGCCCCGCCCCCCCCCCTCCCCTCCCCTCGCCTCGCCTCGCGTCGCGGCGCGGTGCGGTACGGTAGGGTAGGGTAGGCTAGGCGAGGCGCGGCGCGGCGCGGCGCGGAGCGGAGCGGAGGGGAGGAGAGGAAAGGAAGGGAAGCGAAGCGAAGCGGAGCGGCGCGGCCCGGCCAGGCCACGCCACACCACAGCACAGGACAGGGCAGGGCAGGGCTGGGCTGGGCTGCGCTGCCCTGCCGTGCCGCGCCGCCCCGCCCCGCCCGGCCCGCCCCGCACCGCAGCGCAGAGCAGAACAGAATAGAATCGAATCGAATCGCATCGCATCGCAGCGCAGCGCAGCTCAGCTGAGCTGCGCTGCACTGCACTGCACGGCACGACACGATACGATC\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"CGCAGTGCAGTGCAGTGGAGTGGTGTGGTCTGGTCTGGTCTTGTCTTGTCTTGGCTTGGCTTGGCATGGCAGGGCAGCGCAGCTCAGCTGAGCTGCGCTGCCCTGCCATGCCAGGCCAGGCCAGGACAGGAGAGGAGTGGAGTAGAGTAGAGTAGGGTAGGTTAGGTAAGGTAGGGTAGTGTAGTGTAGTGCAGTGCCGTGCCTTGCCTCGCCTCCCCTCCGCTCCGGTCCGGACCGGACCGGACCGGACCCGACCCTACCCTCCCCTCGCCTCGCCTCGCTTCGCTACGCTAGGCTAGGCTAGGTTAGGTGAGGTGGGGTGGCGTGGCATGGCACGGCACTGCACTACACTATACTATCCTATCATATCACATCACATCACAACACAAAACAAAGCAAAGGAAAGGAAAGGACAGGACAGGACAAGACAAAACAAAGCAAAGCAAAGCTAAGCTGAGCTGGGCTGGTCTGGTTTGGTTGGGTTGAGTTGAATTGAAGTGAAGCGAAGCTAAGCTAAGCTACGCTACACTACAGTACAGAACAGAACAGAAAAGAAATGAAATCAAATCCAATCCC\",
        \"label\": \"promoter\"
    },
    {
        \"seq\": \"TACCGGACCGGACCGGAGCGGAGAGGAGACGAGACCAGACCGGACCGCACCGCACCGCACCGCACTGCACTGCACTGAACTGAACTGAAGTGAAGAGAAGACAAGACTAGACTGGACTGTACTGTTCTGTTTTGTTTTGTTTTATTTTAGTTTAGATTAGAGTAGAGTAGAGTTGAGTTGAGTTGAGTTGACTTGACTTGACTGGACTGAACTGACCTGACATGACAGGACAGTACAGTGCAGTGGAGTGGCGTGGCATGGCAGGGCAGCGCAGCGCAGCGAAGCGATGCGATTCGATTCGATTCTATTCTCTTCTCCTCTCCTCTCCTGTCCTGTCCTGTCCTGTCTTGTCTCGTCTCCTCTCCACTCCAGTCCAGCCCAGCCCAGCCCAGCCCTGCCCTCCCCTCACCTCAGCTCAGCTCAGCACAGCAGAGCAGTGCAGTGCAGTGTAGTGTCGTGTCCTGTCCCGTCCCTTCCCTTCCCTTTCCTTTGCTTTGGTTTGGGTTGGGCTGGGCAGGGCACGGCACCGCACCCCACCCAACCCAGCCCAGCCCAGCCCAGCCCAGCCCCGCCCCA\",
        \"label\": \"non-promoter\"
    },
    {
        \"seq\": \"CAGAATAGAATCGAATCGAATCGCATCGCATCGCAACGCAAGGCAAGACAAGAAAAGAATAGAATCGAATCAAATCATATCATGTCATGCCATGCAATGCAGTGCAGAGCAGAGCAGAGCAGAGCGGAGCGAAGCGACGCGACCCGACCTGACCTGACCTGACCTGATCTGATTTGATTTGATTTAATTTACTTTACGTTACGCTACGCTACGCTTCGCTTCGCTTCACTTCACTTCACCTCACCTCACCTAACCTAGCCTAGACTAGATTAGATTAGATTGGATTGAATTGACTTGACTTGACTTGACTTTACTTTTCTTTTTTTTTTATTTTATTTTATTTTATTCTATTCTATTCTGTTCTGCTCTGCACTGCATTGCATCGCATCGCATCGTATCGTTTCGTTGCGTTGTGTTGTGTTGTGTTGTGTTGTGTTCTGTTCTGTTCTTTTCTTCTCTTCCCTTCCCTTCCCCTCCCCCCCCCCACCCCACCCCACTCCACTTCACTTCACTTCCCTTCCTTTCCTCTCCTCTCCTCTTCTCTTCTCTTCTCTTCTTTTCTTGTCTTGCCTTGCT\",
        \"label\": \"non-promoter\"
    }
    ],
    \"epochs\": 1
}
}"
res <- postForm("https://biolm.ai/api/v1/finetune_run/", .opts=list(postfields = params, httpheader = headers, followlocation = TRUE), style = "httppost")
cat(res)

JSON Response#

Expand Example Response
{
"id": "129",
"pipeline": {
    "id": "3",
    "pipeline_slug": "finetune_DNABERT_classifier"
},
"start_time": null,
"created_at": "2023-04-01T12:41:21.734731-07:00",
"end_time": null,
"status": "scheduled",
"algorithm": null,
"hyperopt": false
}

Request Definitions#

hyperopt:

False specifies whether or not to perform hyperparameter optimization (hyperopt). If set to false, no optimization will be performed.

input_json:

Is a nested JSON object that contains the data for training and validation, as well as configuration details like max_train and train (below)

max_train:

40000 and “max_validate”: 20000” set the maximum number of training and validation examples, respectively.

train:

Is an array of objects, each containing a DNA sequence (“seq”) and a corresponding label (“label”). These are the training examples for the fine-tuning process. The sequences are strings of characters representing nucleotide bases (adenine (A), cytosine (C), guanine (G), and thymine (T)), and the labels indicate the classification category for each sequence (e.g., “non-promoter” or “promoter”

seq:

This key is associated with a string value that represents a DNA sequence. Each character in the string corresponds to a nucleotide base. The sequence provided is what the model will analyze and learn from during the fine-tuning process.

label:

Each seq comes with a corresponding label, which is a string that categorizes the sequence. In the context of the provided example, the labels are “non-promoter” or “promoter”. These labels are used as the target outputs for the classifier, meaning that the DNABERT model will learn to predict these labels from unseen DNA sequences after being trained on the provided examples. The classifier’s goal is to determine whether a given DNA sequence functions as a promoter (a region of DNA that initiates transcription of a particular gene) or not.

Response Definitions#

Start_time:

This field records the time when the task started processing. Null indicates that the process has not started yet.

created_at:

The timestamp when the task was created or submitted to the system. It is in ISO 8601 date and time format with timezone information.

end_time:

Similar to start_time, this would record when the task finished processing. Null indicates it has not finished yet or has not started.

status:

This indicates the current state of the task. “scheduled” means that the task has been scheduled to run but has not yet started.

algorithm:

This indicates which algorithm or method is being used for the task. null suggests that this information is either not applicable, not decided yet, or simply not provided in the response.

hyperopt:

Indicates whether hyperparameter optimization is enabled for the task. false means that hyperparameter optimization is not being used. Hyperparameter optimization is a process to automatically select the best hyperparameters (settings) for a machine learning model to maximize its performance.