DNABERT Fine-Tuning#
Zeeshan Siddiqui
Nov 7, 2023
6 min read
On this page, we will show and explain the use of DNABERT. As well as document the BioLM API for fine-tuning, demonstrate no-code and code interfaces.
Description#
The gene regulatory code is governed not only by individual DNA sequences, but also by intricate interactions between regulatory elements and other cellular components. Elucidating these complex relationships is imperative for deciphering genomic regulation. While substantial data exists for protein-coding regions, annotations for non-coding regulatory regions can be sparse, presenting modeling challenges. Furthermore, non-coding DNA may exhibit polysemy, with a single sequence associated with multiple functions, alongside distant semantic ties.
Standard bioinformatics tools often struggle to capture such intricacies, necessitating advanced computational methods to model the multidimensional connections within genomics data. As described by Li et al. (2021), The DNABERT model aims to address these needs by learning meaningful representations of non-coding DNA for predictive tasks. DNABERT implements the Transformer architecture utilized in BERT with 12 layers, 768 hidden units, and 12 attention heads. The same model topology and training methodology are consistently applied across DNABERT variants. Through easy fine-tuning, DNABERT (a pre-trained bidirectional encoder representation model) achieved state-of-the-art performance on diverse regulatory predictions (promoters, splice sites and transcription factor binding sites), highlighting the power of pretraining on the complex patterns within non-coding DNA. In addition, the researchers showed that DNABERT, originally pretrained on the human genome, achieved excellent performance when fine-tuned and applied to model non-human genomic sequences ( cross-organism transferability).
API Usage#
The endpoint to Finetune DNABERT Classifier: https://biolm.ai/api/v1/finetune_run/.
Making Requests#
curl --location 'https://biolm.ai/api/v1/finetune_run/' \
--header "Authorization: Token $BIOLMAI_TOKEN" \
--header 'Content-Type: application/json' \
--data '{
"pipeline": "finetune_DNABERT_classifier",
"hyperopt": false,
"input_json": {
"max_train": 40000,
"max_validate": 20000,
"train": [{"seq":"CACAGCACAGCCCAGCCAAGCCAGGCCAGCCCAGCCCAGCCAAGCCACGCCACTCCACTACACTAGACTAGGCTAGGCTAGGCCAGGCCCGGCCCTGCCCTGCCCTGTCCTGTCCTGTCCTGTCCTGTCCTGTCCTGCCCTGCACTGCAGTGCAGCGCAGCCCAGCCCAGCCCCGCCCCCCCCCCTCCCCTGCCCTGTCCTGTACTGTAGTGTAGGGTAGGGTAGGGGAGGGGTGGGGTCGGGTCTGGTCTGGTCTGGTCTGGACTGGAATGGAACGGAACAGAACAGAACAGCACAGCCCAGCCAAGCCAGGCCAGGCCAGGACAGGAGAGGAGTGGAGTGGAGTGGAGTGGTGTGGTTTGGTTTGGTTTAGTTTAATTTAAGTTAAGATAAGAGAAGAGGAGAGGCGAGGCAAGGCAGGGCAGGGCAGGGCAGGGGAGGGGAGGGGAGGGGAGTGGAGTCGAGTCGAGTCGCGTCGCCTCGCCTCGCCTTGCCTTGCCTTGCCTTGCCTTGCCCTGCCCTGCCCTGCCCTGTCCTGTGCTGTGCTGTGCCGTGCCATGCCACGCCACACCACAC","label":"non-promoter"},{"seq":"CTAATCTAATCTAATCTAATCTAGTCTAGTCTAGTATAGTAAAGTAATGTAATGTAATGCAATGCCATGCCGTGCCGCGCCGCGCCGCGTCGCGTTGCGTTGCGTTGGGTTGGTTTGGTGTGGTGGGGTGGAGTGGAATGGAAAGGAAAGGAAAGAAAAGACAAGACAAGACATGACATGACATGACATGACATGACATGACATGACATAACATACCATACCATACCTTACCTCACCTCACCTCAACTCAAATCAAACCAAACAAAACAGAACAGCACAGCACAGCAGAGCAGGGCAGGGCAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCCGCGCCACGCCATGCCATGCCATGCCATGCGATGCGCTGCGCCGCGCCACGCCAAGCCAAGCCAAGCCAAGCCAAGCCCAGCCCGGCCCGCCCCGCACCGCAGCGCAGAGCAGAGCAGAGGAGAGGGGAGGGTAGGGTTGGGTTGGGTTGTGTTGTCTTGTCCTGTCCAGTCCAATCCAACCCAACTCAACTCAACTCCACTCCTCTCCTATCCTATCCTATTCTATTCTATTCCATTCCT","label":"promoter"},{"seq":"GGAAGAGAAGAGAAGAGGAGAGGGGAGGGAAGGGAAGGGAAGGGAAGGGAAGGAAAGGAAAGGAAAGGAAATGAAATGAAATGCAATGCCATGCCCTGCCCCGCCCCGCCCCGGCCCGGGCCGGGTCGGGTCGGGTCCGGTCCCGTCCCATCCCAGCCCAGGCCAGGCCAGGCGAGGCGGGGCGGGGCGGGGCGGGGCGGGGCCGGGCCTGGCCTCGCCTCGCCTCGACTCGAGTCGAGCCGAGCGGAGCGTAGCGTGGCGTGCCGTGCCGTGCCCTGCCCAGCCCACCCCACGCCACGCCACGCCACGCCGCGCCGCGCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTGCCCTGCCCTGCTCTGCTGTGCTGGGCTGGCCTGGCCTGGCCAGGCCACGCCACGCCACGCCACGCCACGCCTCGCCTGGCCTGGCCTGGACTGGAGTGGAGTGGAGTTGAGTTGAGTTGCGTTGCATTGCAGTGCAGGGCAGGACAGGAAAGGAACGGAACCGAACCGAACCGGACCGGGCCGGGCCGGGCGGGGCGCGGCGCCGCGCCGCGCCGGGCCGGG","label":"promoter"},{"seq":"CGAAAGGAAAGCAAAGCAAAGCAAAGCAATGCAATCCAATCAAATCAGATCAGTTCAGTGCAGTGGAGTGGCGTGGCCTGGCCTGGCCTGGCCTGGCCTGGACTGGACTGGACCGGACCAGACCATACCATGCCATGTCATGTGATGTGTTGTGTAGTGTAGTGTAGTGTAGTATAGTATAGTATAGTATAGTATAGAATAGAGTAGAGAAGAGAGGAGAGCAGAGCAGAGCAAAGCAACGCAACACAACAGAACAGCACAGCGCAGCGCAGCGCCGCGCCACGCCATGCCATCCCATCTCATCTAATCTATTCTATGCTATGCTATGCTATGCTTTGCTTAGCTTAACTTAATTTAATTTAATTTAATTTGATTTGGTTTGGCTTGGCATGGCAAGGCAACGCAACACAACATAACATTACATTACATTACATTACATTACATTACATGACATGTCATGTAATGTAGTGTAGTGTAGTCTAGTCCAGTCCCGTCCCGTCCCGGCCCGGACCGGAACGGAAAGGAAAAGAAAATAAAATCAAATCTAATCTTATCTTTTCTTTTCTTTTATTTTAA","label":"promoter"},{"seq":"TGACTCGACTCCACTCCCCTCCCATCCCAACCCAAACCAAACCAAACCAAACCAAACCAAACCAACCCAACACAACAAAACAAAACAAAACAAAAGAAAAGGAAAGGGAAGGGGAGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGTGGAGTTGAGTTCAGTTCAGTTCATTTCATCTCATCACATCACATCACCTCACCACACCACACCACTCCACTACACTAGACTAGACTAGACTAGACTAGACTTGACTTTACTTTCCTTTCCTTTCCTTTCCTTTCCTTACCTTATCTTATATTATAATATAAAATAAAATAAAAAAAAAAAAAAAACAAAACAAAACACAACACTACACTACACTAGACTAGACTAGAGTAGAGGAGAGGGGAGGGAAGGGAGGGGAGTGGAGTGGAGTGCAGTGCTGTGCTTTGCTTAGCTTAACTTAAGTTAAGCTAAGCAAAGCAGAGCAGAGCAGAACAGAAAAGAAAGGAAAGAAAAGAAAAGAAAAGAAAAGAAAAAAAAAAAAAAAAAAAAAATAAAATAAAATACAATACTATACTATACTAA","label":"promoter"},{"seq":"AAGCATAGCATGGCATGACATGAAATGAAATGAAATGAAATGAAATGAAATGAAATGAAATGAAAGGAAAGAAAAGACAAGACTAGACTGGACTGGACTGGGCTGGGCTGGGCTGGGCTAGGCTAGGCTAGGCTAGGCTAGGCAAGGCACGGCACAGCACAGCACAGTACAGTGCAGTGGAGTGGCGTGGCTTGGCTCGGCTCAGCTCACCTCACATCACACCACACCACACCTCACCTGACCTGTCCTGTACTGTAATGTAATGTAATCTAATCCAATCCCATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCAGGCCAAGCCAAGCCAAGACAAGACAAGACAAGACAGGACAGGACAGGCCAGGCAAGGCAGGGCAGAGCAGATCAGATGAGATGAGATGACATGACCTGACCTGACCTGACCTGACCTGAGCTGAGGTGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGTGGAGTTGAGTTTAGTTTG","label":"non-promoter"},{"seq":"GGCTTTGCTTTGCTTTGTTTTGTTTTGTTTTGTTTTGTTTTCTTTTCTTTTCTGTTCTGTTCTGTGCTGTGATGTGAGGTGAGTTGAGTTGAGTTAAGTTACGTTACGTTACGGTACGGGACGGGGCGGGGCGGGGCTGGGCTGGGCTGCGCTGCCCTGCCATGCCACGCCACCCCACCTCACCTGACCTGCCCTGCACTGCAGTGCAGGGCAGGTCAGGTAAGGTAAGGTAAAGTAAAATAAAATAAAATCAAATCTAATCTGATCTGGTCTGGACTGGACTGGACAGGACATGACATTACATTGCATTGCATTGCCTTGCCCTGCCCTGCCCTGCCCTGACCTGAACTGAAATGAAATGAAATTAAATTGAATTGAATTGACTTGACCTGACCGGACCGAACCGAACCGAACCGAACCGAACCTAACCTTACCTTGCCTTGGCTTGGATTGGATTGGATAGGATACGATACAATACAATACAAAACAAACCAAACCAAACCCAACCCGACCCGGCCCGGCCCGGCCCGGCCTGGCCTGGCCTGACCTGACCTGACATGACAGGACAGTACAGTG","label":"promoter"},{"seq":"TCACCGCACCGTACCGTTCCGTTACGTTACGTTACTTTACTGTACTGCACTGCCCTGCCTTGCCTCGCCTCCCCTCCTCTCCTATCCTAGCCTAGTCTAGTGTAGTGGAGTGGCGTGGCGTGGCGGGGCGGAGCGGATCGGATAGGATACGATACGATACGGTACGGCACGGCGCGGCGGGGCGGCGCGGCACGGCAAGGCAATGCAATACAATAGAATAGTATAGTGTAGTGGAGTGGCGTGGCGTGGCGCGGCGCAGCGCACCGCACAGCACATCACATTACATTCCATTCAATTCAATTCAAGTCAAGGCAAGGCAAGGCAAGGCAGGGCAGGGCAGGACAGGAAAGGAAGGGAAGCGAAGCAAAGCAAAGCAAGGCAAGACAAGAGAAGAGGAGAGGAGAGGAAAGGAACGGAACAGAACAGAACAGAACAGAGCAGAGCAGAGCCGAGCCAAGCCACGCCACCCCACCACACCAGACCAGCCCAGCACAGCAGAGCAGGGCAGGTCAGGTTAGGTTTGGTTTGGTTTGGTTTGGCTTGGCCTGGCCCGGCCCAGCCCAGCCCAGTCCAGTG","label":"promoter"},{"seq":"AGAAAAGAAAACAAAACAAAACAAAACAAAACAAAACAAAAGAAAAGCAAAGCTAAGCTCAGCTCCGCTCCGCTCCGGTCCGGACCGGAGCGGAGTGGAGTAGAGTAGAGTAGGGTAGGATAGGAAAGGAAAGGAAAGGAAAGTAAAGTGAAGTGAAGTGACGTGACATGACACGACACAACACAGCACAGCACAGCGCAGCGCAGCGCCGCGCCACGCCACGCCACCCCACCTCACCTCACCTCCCCTCCCCTCCCGTCCCGGCCCGGTCCGGTACGGTAGGGTAGCGTAGCCTAGCCTAGCCTGGCCTGGCCTGGCCTGGCCTGGCCGGGCCGGGCCGGCCCGGCCCGGCCAGGCCAAGCCAAGCCAAGGCAAGGCAAGGCCAGGCCTGGCCTCGCCTCTCCTCTGCTCTGGTCTGGCCTGGCTTGGCTTGGCTTAGCTTAACTTAAGTTAAGCTAAGCGAAGCGGAGCGGGGCGGGCCGGGCCGGGCCTGGCCTCGCCTCTCCTCTGCTCTGGTCTGGCCTGGCCTGGCCTGGCCTGGCCTGCCCTGCCCTGCCATGCCAAGCCAAACCAAAA","label":"promoter"},{"seq":"AAGTAGAGTAGAGTAGAGTAGAGGAGAGGCGAGGCCAGGCCTGGCCTCGCCTCCCCTCCTCTCCTGTCCTGCCCTGCTCTGCTTTGCTTCGCTTCACTTCAGTTCAGGTCAGGGCAGGGAAGGGAAGGGAAGGGAAGTGAAGTAAAGTAGAGTAGAGTAGAGTAGAGCAGAGCCGAGCCGAGCCGGGCCGGTCCGGTGCGGTGTGGTGTCGTGTCTTGTCTCGTCTCGTCTCGCCTCGCATCGCACCGCACCGCACCACACCAGACCAGACCAGAGCAGAGCAGAGCCGAGCCCAGCCCCGCCCCACCCCAGCCCAGACCAGATCAGATGAGATGGGATGGAATGGAATGGAACGGAACTGAACTCAACTCTACTCTGCTCTGTTCTGTCCTGTCCTGTCCCGTCCCATCCCATCCCATTCCATTCCATTCAATTCACTTCACATCACATCACATTACATTACATTAAATTAATTTAATTTAATTGAATTGAATTGAATTGAATTGAATCGAATCCAATCCAATCCAGTCCAGTCCAGTACAGTACAGTACTGTACTTTACTTTACTTTGCTTTGA","label":"non-promoter"},{"seq":"GGTGCAGTGCAATGCAAGGCAAGGCAAGGAAAGGAAAGGAATGGAATGGAATGAAATGAAATGAAGTGAAGCGAAGCCAAGCCAAGCCAAGCCAATCCAATTCAATTTAATTTCATTTCTTTTCTCTTCTCATCTCAACTCAATTCAATCCAATCAAATCATATCATCTCATCGCATCGAATCGAGTCGAGGCGAGGCGAGGCTAGGCTAGGCTACGCTACCCTACCCTACCCTACCCTGCCCTGCCCTGCCCTGCCATGCCATGCCATCCCATCTCATCTTATCTTGTCTTGTCTTGTGTTGTGGTGTGGCGTGGCCTGGCCAGGCCATGCCATGCCATGTCATGTGATGTGATGTGAGGTGAGGTGAGGGGAGGGAAGGGATGGGATGGGATGCGATGCAATGCACTGCACAGCACACCACACGACACGTCACGTGACGTGTCGTGTAGTGTAGTGTAGAGTAGATTAGATCAGATCAGATCAAATCAATTCAATTCAATTTAATTTCATTTCTTTTCTCTTCTCATCTCAGCTCAGCTCAGCACAGCATAGCATCGCATCACATCACATCACA","label":"promoter"},{"seq":"AGAGACGAGACTAGACTGGACTGGACTGGCCTGGCATGGCAAGGCAAGGCAAGGCAAGGAAAGGACAGGACAGGACAGGACAGGACAGGCCAGGCTAGGCTCGGCTCGGCTCGCCTCGCCTCGCCCCGCCCTGCCCTTCCCTTCCCTTCTCTTCTGTTCTGTTCTGTACTGTAGTGTAGAGTAGAGTAGAGCAGAGCCGAGCCTAGCCTCGCCTCGCCTCGCCTCGCATCGCATCGCATTGCATTGCATTGGATTGGCTTGGCCTGGCCAGGCCACGCCACCCCACCACACCAGACCAGGCCAGGACAGGAGAGGAGGGGAGGCGAGGCAAGGCAGGGCAGTGCAGTGCAGTGTAGTGTTGTGTTGTGTTGTGTTGTCTTGTCTTGTCTGGTCTGCTCTGCCCTGCCTTGCCTCGCCTCTCCTCTCCTCTCGTCTCGACTCGAATCGAACCGAACTGAACTTAACTTGACTTGGCTTGGCTTGGCTTGGCTGGGCTGCGCTGCCCTGCCCTGCCCAGCCCAACCCAAGCCAAGGCAAGGTAAGGTGAGGTGAGGTGAGGTGAGATGAGAAGAGAAG","label":"promoter"},{"seq":"TGGCGAGGCGACGCGACCCGACCCGACCCCACCCCACCCCAACCCAACCCAACCCAACCTAACCTGACCTGCCCTGCCCTGCCCTGCCCTGCCCTTCCCTTGCCTTGCCTTGCTTTGCTTTGCTTCGCTTCGCTTCGGTTCGGATCGGACCGGACAGGACACGACACTACACTGCACTGCACTGCACTGCAGTGCAGCGCAGCACAGCACAGCACCGCACCCCACCCAACCCAACCCAATCCAATGCAATGGAATGGCATGGCGTGGCGCGGCGCCGCGCCCCGCCCAGCCCAGCCCAGACCAGAACAGAACAGAACCGAACCCAACCCGACCCGCCCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTGCCCTGCCCTGCCCTGCCGTGCCGCGCCGCGCCGCGGCGCGGGGCGGGCCGGGCAGGGCAGGGCAGTGCAGTGCAGTGCAGTGCAGTGCAGTGCAGCGCAGCCCAGCCCAGCCCGGCCCGGCCCGGGCCGGGACGGGATGGGATAGGATAGGATAGCATAGCGTAGCGCAGCGCCGCGCCCCGCCCCGCCCCCCCCCCACCCCAA","label":"non-promoter"},{"seq":"CTGTGTTGTGTAGTGTATTGTATAGTATATTATATCATATCTTATCTGATCTGTTCTGTACTGTAATGTAAAGTAAAGTAAAGTAAAGTTAAGTTAAGTTATGTTATCTTATCTTATCTCATCTCCTCTCCACTCCAGTCCAGTCCAGTCCAGTCAAGTCAAGTCAACTCAACGCAACGCAACGCTACGCTACGCTAGGCTAGGCTAGGGTAGGGAAGGGATGGGATGGGATGCGATGCAATGCACTGCACAGCACACCACACTACACTCCACTCTACTCTGCTCTGCTCTGCACTGCAATGCAACGCAACACAACACAACACTACACTCCACTCTACTCTACTCTAGTCTAGGCTAGGTTAGGTGAGGTGGGGTGGCGTGGCCTGGCCTGGCCTTGCCTTCCCTTCTCTTCTGTTCTGTTCTGTACTGTATTGTATAGTATATTATATAATATATTATATGATATGGTATGGCATGGCATGGCAGGGCAGAGCAGAACAGAAAAGAAAAGAAAAAAAAAAGAAAAGAAAAGAAAAGAAAAGAAAGGAAAGTAAAGTAAAGTAAAGTAAAGTAAAT","label":"non-promoter"},{"seq":"CTATATTATATTATATTTTATTTGATTTGGTTTGGATTGGACTGGACAGGACAAGACAATACAATCCAATCGAATCGCATCGCCTCGCCGCGCCGTGCCGTGCCGTGACGTGATGTGATTTGATTAGATTAAATTAAATTAAACTAAACGAAACGAAACGAGACGAGTCGAGTGGAGTGTAGTGTAGTGTATTGTATGGTATGATATGAAATGAAATGAAAGGAAAGGAAAGGCAAGGCGAGGCGTGGCGTCGCGTCTCGTCTGGTCTGATCTGAACTGAAGTGAAGCGAAGCTAAGCTAAGCTAGGCTAGGCTAGGGTAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCTGCGCTACGCTAGGCTAGACTAGATTAGATAAGATAAGATAAAATAAACTAAACAAAACACAACACTACACTGCACTGAACTGATCTGATTTGATTTGATTTCATTTCCTTTCCCTTCCCCTCCCCTCCCCTTCCCTTTCCTTTACTTTAGTTTAGGTTAGGGTAGGGAAGGGAAGGGAAAGGAAAAGAAAAAAAAAAGAAAAGAAAAGAAAAGAATAGAATG","label":"promoter"},{"seq":"AACTGCACTGCACTGCAGTGCAGGGCAGGACAGGATAGGATGGGATGCGATGCTATGCTCTGCTCTGCTCTTCTCTTGTCTTGGCTTGGATTGGAGTGGAGTGGAGTTGAGTTCAGTTCTGTTCTGTTCTGGTCTGGTCTGGTCTGGTCTGGTCTAGTCTACTCTACTCTACTCTACTCTACTCTGCTCTGCTCTGCGCTGCGATGCGATGCGATGCGATGCGATGCTATGCTTTGCTTGGCTTGTCTTGTTTTGTTTTGTTTGGTTTGCTTTGCATTGCAATGCAAAGCAAAACAAAACAAAACCAAACCCAACCCTACCCTGCCCTGTCCTGTCCTGTCATGTCATGTCATGTCATGACATGAGATGAGATGAGAAGAGAAGAGAAGGGAAGGTAAGGTCAGGTCCGGTCCAGTCCACTCCACTCCACTACACTAGACTAGACTAGATTAGATGAGATGGGATGGCATGGCATGGCAGGGCAGGGCAGGTCAGGTTAGGTTTGGTTTCGTTTCATTTCAGTTCAGCTCAGCTCAGCTGAGCTGTGCTGTGCTGTGCTGTGCTGTGCTATGCTAC","label":"promoter"},{"seq":"GCTTTCCTTTCCTTTCCTTTCCTGTCCTGGCCTGGCCTGGCCTGGCCCGGCCCCGCCCCCCCCCCACCCCAACCCAAGCCAAGACAAGAGAAGAGTAGAGTGGAGTGCAGTGCAGTGCAGTGCAGGGCAGGGCAGGGAAGGGATGGGATGGGATGCGATGCCATGCCCTGCCCAGCCCAGCCCAGGCCAGGTCAGGTCAGGTCTGGTCTGGTCTGCTCTGCACTGCAATGCAACGCAACCCAACCAAACCACACCACCCCACCACACCACACCACTCCACTGCACTGGACTGGGCTGGGTTGGGTGGGGTGGGGTGGCGTGGCTTGGCTGGGCTGCGCTGCACTGCAGTGCAGCGCAGCTCAGCTGAGCTGTGCTGTGCTGTGCTGTGCCGTGCCCTGCCCAGCCCAGCCCAGGCCAGGACAGGAGAGGAGGGGAGGGGAGGGTAGGGTGGGGTGGGGTGGGGTGGGATGGGACGGGACTGGACTCGACTCCACTCCTCTCCTGTCCTGCCCTGCCCTGCCCTGCCCCGCCCCACCCCACCCCACCCCACCACACCAAACCAACCCAACTCAACTC","label":"non-promoter"},{"seq":"TGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCCGAGCCGAGCCGGGCCGGGCCGGGTCGGGTGGGGTGCGGTGCCGTGCCCTGCCCCGCCCCGCCCCGGCCCGGGCCGGGTCGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCCGAGCCGAGCCGGGCCGGGCCGGGTCGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTG","label":"non-promoter"},{"seq":"GGGTGAGGTGAGGTGAGGTGAGGGGAGGGAAGGGAAGGGAAGGGAAGTGAAGTGAAGTGCAGTGCAGTGCATTGCATCGCATCCCATCCCATCCCTTCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGTTGGGTGGGGTGAGGTGAGGTGAGATGAGAAGAGAAAAGAAACGAAACAAAACATAACATGACATGGCATGGGATGGGTTGGGTAGGGTAAGGTAAGGTAAGCTAAGCGAAGCGTAGCGTGGCGTGTCGTGTGGTGTGCTGTGCAGTGCAGTGCAGAGCAGACCAGACGAGACGTGACGTGACGTGGCGTGGAGTGGAGTGGAGAGGAGAGGAGAGGAGAGGGGAGGGCAGGGCGGGGCGTGGCGTGGCGTGGCGTGGGGTGGGGTGGGGTGGGGTGGGGTGAGGTGAGGTGAGGTGAGGGGAGGGAAGGGACGGGACGGGACGCGACGCCACGCCCCGCCCAGCCCACCCCACCCCACCCCACCCCACCCCTCCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGTTGGGTGGGGTGAGGTGAGGTGAGATGAGAAGAGAAA","label":"non-promoter"},{"seq":"AAGTTTAGTTTTGTTTTTTTTTTCTTTTCCTTTCCATTCCACTCCACCCCACCTCACCTGACCTGCCCTGCCCTGCCATGCCACGCCACTCCACTTCACTTCACTTCACTTCACTTCACATCACAACACAATACAATGCAATGAAATGACATGACCTGACCCGACCCTACCCTCCCCTCCCCTCCACTCCAGTCCAGCCCAGCGCAGCGCAGCGCCGCGCCCCGCCCTGCCCTCCCCTCTCCTCTACTCTACTCTACTCTACTGTACTGGACTGGCCTGGCATGGCAGGGCAGAGCAGAGCAGAGAAGAGACGAGACTAGACTAGACTAGACTAGCCTAGCATAGCATAGCATCGCATCACATCAAATCAAGTCAAGCCAAGCCAAGCCAAGCCAGGCCAGCCCAGCTCAGCTGAGCTGGGCTGGCCTGGCATGGCAAGGCAAAGCAAACCAAACCAAACCAAACCAGACCAGACCAGAGCAGAGGAGAGGCGAGGCGAGGCGTGGCGTCGCGTCCCGTCCTGTCCTTTCCTTTCCTTTACTTTAATTTAAGTTAAGGTAAGGTAAGGTCAGGTCC","label":"promoter"},{"seq":"TTTTTTTTTTTTTTTTTGTTTTGCTTTGCGTTGCGGTGCGGGGCGGGGCGGGGCGGGGCGGGGCGCGGCGCAGCGCAGCGCAGTGCAGTGCAGTGGAGTGGCGTGGCTTGGCTCGGCTCAGCTCATCTCATGTCATGCCATGCCATGCCTTGCCTGGCCTGTCCTGTACTGTAGTGTAGTGTAGTCTAGTCCAGTCCCGTCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCAGGCCAAGCCAAGCCAAGACAAGACAAGACGAGACGGGACGGGACGGGCCGGGCAGGGCAGGGCAGAGCAGATCAGATCAGATCAGATCACATCACGTCACGACACGAGACGAGGCGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGAGGAGATGAGATCAGATCGGATCGAATCGAGTCGAGACGAGACGAGACTAGACTAGACTATACTATCCTATCCTATCCTATCCTGTCCTGGCCTGGCCTGGCTTGGCTAGGCTAA","label":"non-promoter"},{"seq":"CCGCCTCGCCTTGCCTTCCCTTCCCTTCCCTTCCCTTCCCTCCCCTCTCCTCTGCTCTGTTCTGTTCTGTTTTGTTTTGTTTTTTTTTTGTTTTGGTTTGGCTTGGCATGGCATGGCATAGCATAACATAAGATAAGATAAGAAAAGAAAAGAAACGAAACAAAACAAAACAATACAATTCAATTCAATTCAATTCAGTTCAGGTCAGGTCAGGTTAGGTTTGGTTTAGTTTATTTTATCTTATCATATCAAATCAAGTCAAGGCAAGGAAAGGAGAGGAGAGGAGAGGAGAGTAGAGTCGAGTCCAGTCCAGTCCAGTCCAGGCCAGGGCAGGGTAGGGTCGGGTCAGGTCAGGTCAGATCAGAACAGAATAGAATTGAATTTAATTTTATTTTTTTTTTCTTTTCTTTTCTATTCTAATCTAACCTAACCTAACCAAACCACACCACCCCACCACACCAGACCAGGCCAGGTCAGGTGAGGTGGGGTGGCGTGGCGTGGCGCGGCGCAGCGCAGCGCAGAGCAGAGCAGAGCAGAGCAGAGCAAAGCAAGGCAAGCCAAGCTAAGCTTAGCTTA","label":"promoter"},{"seq":"ACAAAACAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAAAAAAAAAAAAAGAAAAGCAAAGCGAAGCGGAGCGGGGCGGGACGGGAAGGGAAGGGAAGCGAAGCAAAGCAGAGCAGCGCAGCCCAGCCTAGCCTTGCCTTGCCTTGGCTTGGGTTGGGTTGGGTTGGGTTAGGTTATGTTATCTTATCTTATCTCATCTCCTCTCCACTCCAGTCCAGCCCAGCACAGCACAGCACAGCACACCACACCACACCCCACCCAACCCACCCCACCCCACCACACCAGACCAGACCAGAGCAGAGGAGAGGGGAGGGCAGGGCAGGGCAGGGCAGCGCAGCACAGCAGAGCAGAGCAGACCAGACAAGACACGACACTACACTGCACTGGACTGGCCTGGCTTGGCTAGGCTAAGCTAAACTAAAGTAAAGCAAAGCTAAGCTCAGCTCTGCTCTTCTCTTATCTTAGCTTAGTTTAGTCTAGTCAAGTCATGTCATATCATAACATAAGATAAGTTAAGTCAAGTCCAGTCCTGTCCTGTCCTGACCTGAGCTGAGTTGAGTGGAGTGCAGTGCT","label":"promoter"},{"seq":"ACTGGACTGGAATGGAAAGGAAAAGAAAATAAAATTAAATTTAATTTTATTTTATTTTAATTTAAATTAAATTAAATGAAATGAAATGAAATGAATTGAATGGAATGAAATGATATGATGTGATGTGATGTGATGTGATGTGATGTGATTTGATTCGATTCTATTCTGTTCTGTTCTGTGCTGTGGTGTGGTGTGGTGTGGTGCGGTGCTGTGCTCTGCTCCGCTCCTCTCCTGTCCTGTCCTGTGCTGTGGTGTGGGGTGGGCTGGGCAGGGCAGGGCAGCGCAGCACAGCACAGCACTGCACTGCACTGGACTGGCCTGGCCTGGCCTGGCCTGGCCTGACCTGAACTGAAGTGAAGCGAAGCAAAGCACAGCACAGCACAACACAAAACAAACCAAACCAAACCTAACCTGACCTGGCCTGGACTGGAGTGGAGCGGAGCCGAGCCTAGCCTCGCCTCCCCTCCACTCCAGTCCAGCCCAGCACAGCAGAGCAGGGCAGGGCAGGGGAGGGGGGGGGGCGGGGCAGGGCAGGGCAGTGCAGTCCAGTCCAGTCCTGTCCTCTCCTCACCTCAG","label":"promoter"},{"seq":"AGAGCTGAGCTGAGCTGTGCTGTCCTGTCTTGTCTGGTCTGCTCTGCTCTGCTGTGCTGGGCTGGGCTGGGGTGGGGGGGGGGCGGGGCAGGGCAGGGCAGGGCAGGGCAGGGCAGGGCGGGGCGCGGCGCTGCGCTGCGCTGTGCTGTTCTGTTCTGTTCTGTTCTGTTCTGGTCTGGGCTGGGCTGGGCAGGGCACGGCACTGCACTGCACTGTACTGTACTGTAGTGTAGGGTAGGATAGGATAGGATGGGATGTGATGTTATGTTATGTTAGGTTAGCTTAGCATAGCAGAGCAGCGCAGCGCAGCGAAGCGACGCGACCCGACCCGACCCTACCCTGCCCTGGCCTGGCCTGGCCTGGCCTGGCCTCGCCTCTCCTCTACTCTACTCTACCCTACCATACCACACCACTCCACTACACTAGACTAGACTAGATTAGATGAGATGCGATGCCATGCCATGCCAGGCCAGTCCAGTACAGTAGAGTAGCGTAGCATAGCACAGCACCGCACCCCACCCTACCCTCCCCTCCCCTCCTCTCCTCTCCTCTCCTCTCCTCTCCTCTCCACTCCAG","label":"promoter"},{"seq":"GCTTTGCTTTGTTTTGTTTTGTTATGTTACGTTACATTACAGTACAGGACAGGTCAGGTGAGGTGTGGTGTCGTGTCTTGTCTGGTCTGTTCTGTTCTGTTATGTTAAGTTAACTTAACATAACATAACATTACATTCCATTCCATTCCATTCCATTCCATGCCATGGCATGGAATGGACTGGACCGGACCAGACCAAACCAAACCAAAACAAAACAAAACAAAACAAAACAAGACAAGGCAAGGCAAGGCCAGGCCAGGCCAAGCCAAACCAAACCAAACCAAACCCAACCCAACCCAACCCAAACCAAAACAAAATAAAATCAAATCAAATCAAATCAAGTCAAGGCAAGGGAAGGGAAGGGACGGGACAGGACAGGACAGGACAGGACAGGAAAGGAAGGGAAGTGAAGTAAAGTAGAGTAGAGTAGACTAGACTAGACTCGACTCCACTCCACTCCACTCCACCCCACCCCACCCAACCCATCCCATGCCATGCCATGCAATGCAGTGCAGGGCAGGTCAGGTGAGGTGGGGTGGAGTGGAATGGAAGGGAAGGGAAGGGAAGGGGAGGGGA","label":"non-promoter"},{"seq":"GGTGATGTGATGTGATGCGATGCTATGCTATGCTACGCTACACTACAGTACAGGACAGGGCAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCCGGCCCTGCCCTCCCCTCACCTCATCTCATATCATAGCATAGGATAGGATAGGACAGGACAGGACAGGACAGGACAGGTCAGGTGAGGTGCGGTGCTGTGCTCTGCTCAGCTCACCTCACCTCACCACACCAGACCAGGCCAGGTCAGGTGAGGTGCGGTGCGGTGCGGTGCGGAGCGGAGCGGAGGGGAGGAGAGGACAGGACAGGACAAGACAACACAACCCAACCCAACCCGACCCGTCCCGTCCCGTCCCGTCCGGTCCGGTCCGGGCCGGGCCGGGCTGGGCTGGGCTGGGCTGGACTGGAGTGGAGCGGAGCAGAGCAGAGCAGGGCAGGTCAGGTCAGGTCAGGTCAAGTCAAGTCAAGACAAGAGAAGAGGAGAGGCGAGGCTAGGCTCGGCTCTGCTCTGCTCTGGTCTGGGCTGGGATGGGAGGGGAGAGGAGACGAGACAAGACACGACACTACACTTCACTTCACTTCC","label":"non-promoter"},{"seq":"GGGGCAGGGCAGGGCAGGGCAGGACAGGAAAGGAAAGGAAAGGAAAGCAAAGCCAAGCCCAGCCCAGCCCATCCCATCCCATCTCATCTAATCTACTCTACACTACAATACAAGACAAGGCAAGGCAAGGCCAGGCCAGGCCAGGCCAGTCCAGTGCAGTGGAGTGGCGTGGCTTGGCTTGGCTTTGCTTTTCTTTTCTTTTCCTTTCCCTTCCCCTCCCCCCCCCCACCCCAACCCAACCCAACCCAACCCAACCCAACCCAGCCCAGTCCAGTCCAGTCCAGTCCTGTCCTTTCCTTCCCTTCCCTTCCCTTCCCATCCCAACCCAAACCAAATCAAATTAAATTCAATTCCATTCCCTTCCCATCCCACCCCACACCACAGCACAGCACAGCCCAGCCTAGCCTCGCCTCCCCTCCACTCCAGTCCAGACCAGATCAGATCAGATCCGATCCCATCCCTTCCCTGCCCTGCCCTGCACTGCAATGCAACGCAACCCAACCCAACCCTACCCTCCCCTCCCCTCCCCTCCCTTCCCTCCCCTCCCCTCCCCTCCCGTCCCGCCCCGCTCCGCTT","label":"non-promoter"},{"seq":"ATACAGTACAGGACAGGTCAGGTTAGGTTTGGTTTCGTTTCCTTTCCGTTCCGTTCCGTGCCGTGGCGTGGGGTGGGATGGGAGGGGAGAGGAGAGGAGAGGAGAGGTGAGGTAAGGTAAGGTAACGTAACATAACACAACACAACACAACACAATACAATACAATAGAATAGCATAGCTTAGCTTAGCTTGGCTTGTCTTGTATTGTATTGTATCGTATCATATCAGATCAGTTCAGTCCAGTCAAGTCATGTCATTTCATTACATTACATTACCTTACCATACCACACCACTCCACTTCACTTGACTTGACTTGAGTTGAGTTGAGTGGAGTGTAGTGTGGTGTGATGTGAAGTGAAGTGAAGCGAAGCAAAGCAGAGCAGTGCAGTTCAGTTAAGTTAGGTTAGTTTAGTCTAGTCAAGTCAAGTCAAATCAAAGCAAAGTAAAGTCAAGTCTAGTCTGGTCTGGTCTGGGCTGGGATGGGAGGGGAGTGGAGTGGAGTGAAGTGAAGTGAATTGAATGGAATGAAATGAGATGAGATGAGAGGAGAGTAGAGTAGAGTAGAGTAGAGTAGAA","label":"non-promoter"},{"seq":"AACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAAAAAAAAAAAAAAACAAAACAAAACACAACACAACACAGCACAGCACAGCACAGCAAAGCAAAGCAAACCAAACCAAACCTAACCTGACCTGTCCTGTACTGTATTGTATGGTATGTTATGTTATGTTGTGTTGTGTTGTCTTGTCCTGTCCCGTCCCTTCCCTTCCCTTCCCTTCCCTTCCATTCCAGTCCAGGCCAGGTCAGGTCAGGTCCGGTCCCGTCCCCTCCCCCCCCCCTCCCCTGCCCTGCCCTGCTCTGCTGTGCTGGGCTGGGCTGGGCTGGGCAGGGCATGGCATTGCATTTCATTTGATTTGCTTTGCATTGCAGTGCAGAGCAGAACAGAACAGAACCGAACCGAACCGCACCGCACCGCAGCGCAGCGCAGCACAGCATAGCATCGCATCCCATCCCATCCCATCCCAGCCCAGACCAGATCAGATCAGATCAGATCACATCACTTCACTCCACTCGACTCGTCTCGTTTCGTTACGTTAAGTTAAATTAAAATAAAAAAAAAAAAAAAATAAAATT","label":"promoter"},{"seq":"TCCTGACCTGATCTGATATGATAAGATAAAATAAACTAAACCAAACCCAACCCAACCCATCCCATGCCATGGCATGGGATGGGATGGGATGGGATCGGATCTGATCTCATCTCATCTCATCTCATGTCATGACATGAGATGAGATGAGAAGAGAATAGAATTGAATTAAATTATATTATTTTATTCTATTCAATTCATTTCATTTCATTACATTATATTATCTTATCATATCATATCATGTCATGACATGAGATGAGATGAGAAGAGAATAGAATAGAATAGAATAGTATAGTATAGTATAGTATGGTATGGTATGGGATGGGATGGGAAGGGAAAGGAAAGGAAAGAAAAGACAAGACCAGACCAGACCAGACCAGTCCAGTCCAGTCCAGTCCCGTCCCCTCCCCACCCCATCCCATGCCATGACATGATATGATTTGATTCGATTCAATTCAATTCAATTCAATTCAATTAAATTACATTACCTTACCTTACCTCACCTCCCCTCCCCTCCCCTCCCCCCCCCCTCCCCTGCCCTGGCCTGGGCTGGGTTGGGTCGGGTCCGGTCCCGTCCCT","label":"non-promoter"},{"seq":"GTCATCTCATCGCATCGTATCGTATCGTAGCGTAGTGTAGTATAGTACAGTACTGTACTATACTACACTACACTACATTACATTACATTTCATTTTATTTTATTTTAATTTAAATTAAACTAAACAAAACATAACATGACATGTCATGTAATGTAATGTAAAGTAAAGTAAAGAAAAGAGAAGAGCAGAGCTGAGCTCAGCTCAGCTCAGCTCAGTTCAGTGCAGTGGAGTGGTGTGGTGTGGTGCGGTGCTGTGCTCTGCTCCGCTCCACTCCAATCCAAGCCAAGACAAGAAAAGAAGAGAAGCGAAGCAAAGCAAAGCAAGGCAAGACAAGACAAGACTAGACTTGACTTTACTTTGCTTTGGTTTGGTTTGGTATGGTAGGGTAGAGTAGAGTAGAGAAGAGACGAGACGAGACGGGACGGCACGGCCCGGCCGGGCCGCGCCGCTCCGCTTCGCTTGGCTTGCCTTGCTTTGCTCTGCTCCGCTCCCCTCCCATCCCAACCCAAACCAAATCAAATAAAATATAATATCATATCATATCATATCATGTCATGCCATGCTATGCTGTGCTGA","label":"non-promoter"},{"seq":"ACTGCGCTGCGCTGCGCGGCGCGCCGCGCCGCGCCGCGCCGAGCCGACCCGACGCGACGGGACGGTACGGTGCGGTGGGGTGGGGTGGGCTGGGCTGGGCTGGGCTGGGCTGGCCTGGCGTGGCGGGGCGGGGCGGGACGGGACGGGACCGGACCAGACCAGACCAGGCCAGGACAGGACAGGACAGGACAGGACAGGACAGGACAGGAAAGGAACGGAACAGAACAAAACAATACAATGCAATGGAATGGGATGGGATGGGATGGGATTGGATTCGATTCCATTCCGTTCCGATCCGAGCCGAGGCGAGGGGAGGGCAGGGCCGGGCCGGGCCGCGCCGCACCGCAACGCAAGGCAAGGCAAGGGAAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCTGCGCTCCGCTCCGCTCCTCTCCTTTCCTTCCCTTCTCTTCTGTTCTGCTCTGCGCTGCGGTGCGGGGCGGGTCGGGTTGGGTTGGGTTGGGTTGGGTTGGGGTGGGGTGGGGTGGGGTGCGGTGCGGTGCGATGCGAGGCGAGGCGAGGCGAGGCCAGGCCGGGCCGGGCCGGA","label":"promoter"},{"seq":"TGTGCTGTGCTGTGCTGAGCTGATCTGATGTGATGCGATGCCATGCCTTGCCTGGCCTGTCCTGTGCTGTGGTGTGGTGTGGTTTGGTTTGGTTTGGTTTGGTTTGGTTTGGTGTGGTGGGGTGGGGTGGGGTGGGGCGGGGCTGGGCTAGGCTACGCTACACTACAATACAACACAACACAACAGAACAGGACAGGACAGGAAAGGAAAGGAAATGAAATTAAATTCAATTCCATTCCTTTCCTGTCCTGCCCTGCTCTGCTTTGCTTTGCTTTGCTTTGGTTTGGATTGGAATGGAAAGGAAAGGAAAGAAAAGACAAGACAAGACAGGACAGAACAGAACAGAAAAGAAAGGAAAGCAAAGCAAAGCAGAGCAGAGCAGATCAGATAAGATAGGATAGCATAGCCTAGCCAAGCCAAGCCAAACCAAATCAAATTAAATTCAATTCTATTCTCTTCTCTTCTCTCCTCTCTTCTCTACTCTACTCTACCCTACCATACCACACCACACCACATCACATTACATTTCATTTTATTTTGTTTTGGTTTGGATTGGAATGGAAAGGAAACGAAACT","label":"non-promoter"},{"seq":"GGTTCCGTTCCCTTCCCGTCCCGCCCCGCTCCGCTTCGCTTCGCTTCCCTTCCATTCCACTCCACCCCACCGCACCGAACCGAGCCGAGGCGAGGGGAGGGCAGGGCCGGGCCGGGCCGAGCCGACCCGACTCGACTGGACTGCACTGCGCTGCGATGCGAGGCGAGGCGAGGTGAGGTGAGGTGCGGTGCAGTGCATTGCATGGCATGCCATGCTATGCTGTGCTGGGCTGGGCTGGGATGGGAGGGGAGTGGAGTCGAGTCGAGTCGTGTCGTATCGTAGCGTAGTGTAGTATAGTACAGTACCGTACCGTACCGCACCGCACCGCACCGCACCGCACCGCACCGGACCGGGCCGGGGCGGGGCGGGGCGGGGCGGGGCGGAGCGGAACGGAACGGAACAGAACAGAACAGCACAGCTCAGCTCAGCTCCGCTCCGCTCCGCTCCGCCCCGCCCCGCCCCGCCCCGCCCCGGCCCGGCCCGGCGCGGCGGGGCGGAGCGGATCGGATGGGATGGGATGGTATGGTGTGGTGTGGTGTTGTGTTTTGTTTCGTTTCCTTTCCATTCCAGTCCAGA","label":"non-promoter"},{"seq":"GCCCGGCCCGGGCCGGGACGGGAGGGGAGCGGAGCGGAGCGTAGCGTCGCGTCGCGTCGCGTCGCATCGCAGCGCAGCGCAGCCCAGCCTAGCCTCGCCTCCCCTCCCCTCCCCTCCCCCCCCCCGCCCCGCCCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTCCCCTCCCCTCCCCTCCCCTCCCCGCCCCGCCCCGCCCCGCCTCGCCTCGCCTCGCCTCGGCTCGGGTCGGGGCGGGGAGGGGACGGGACTGGACTCGACTCGACTCGTCTCGTCTCGTCCCGTCCCGTCCCTTCCCTCCCCTCCCCTCCACTCCACTCCACACCACAGCACAGCACAGCCCAGCCCAGCCCCGCCCCTCCCCTCCCCTCCCCTCCCCTCCCTTCCCTCCCCTCCCCTCCCCTCCCGTCCCGTCCCGTCCCGTCGCGTCGGGTCGGATCGGAACGGAATGGAATTGAATTCAATTCGATTCGCTTCGCATCGCAGCGCAGCGCAGCCCAGCCTAGCCTCGCCTCCCCTCCGCTCCGCTCCGCCCCGCCGCGCCGTGCCGTTCCGTTCCGTTCTGTTCTT","label":"non-promoter"},{"seq":"CTGGCTTGGCTGGGCTGCGCTGCTCTGCTCTGCTCCGCTCCTCTCCTTTCCTTACCTTACCTTACATTACAATACAAAACAAACCAAACCAAACCTAACCTGACCTGTCCTGTGCTGTGGTGTGGAGTGGAGTGGAGTGGAGTTGAGTTGAGTTGGGTTGGATTGGACTGGACTGGACTTGACTTGACTTGCCTTGCTTTGCTGTGCTGTGCTGTTCTGTTTTGTTTTGTTTTTTTTTTCTTTTCCTTTCCTTTCCTCTCCTCTCCTCTTCTCTTGTCTTGCCTTGCCTTGCCATGCCACGCCACTCCACTACACTAGACTAGACTAGAGTAGAGGAGAGGGGAGGGTAGGGTAGGGTAGGGTAGAGTAGAATAGAATAGAATAGAATATAATATGATATGATATGAAATGAAATGAAAAGAAAAGAAAAGAAAAGAAAAGAAGAGAAGAGAAGATAAGATTAGATTAGATTAGATTAGCTTAGCATAGCATAGCATGGCATGTCATGTTATGTTTTGTTTTGTTTTCTTTTCATTTCATTTCATCTCATCCCATCCTATCCTTTCCTTGCCTTGC","label":"promoter"},{"seq":"CCCTGCCCTGCTCTGCTATGCTACGCTACACTACAGTACAGTACAGTTCAGTTTAGTTTTGTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTGTTTTGATTTGATTTGATATGATATGATATAATATACTATACAATACAGTACAGGACAGGTCAGGTTAGGTTTGGTTTTGTTTTGTTTTGCTTTGCCTTGCCATGCCAGGCCAGCCCAGCTCAGCTTAGCTTTGCTTTTCTTTTTTTTTTTTTTTTCTTTTCATTTCACTTCACTTCACTACACTAGACTAGACTAGATTAGATGAGATGGGATGGTATGGTGTGGTGCGGTGCTGTGCTATGCTAAGCTAATCTAATTTAATTCAATTCCATTCCCTTCCCTTCCCTTCCCTTTCCTTTACTTTAGTTTAGTTTAGTGTAGTGAAGTGACGTGACCTGACCTGACCTAACCTAGCCTAGGCTAGGCTAGGCTAGGCTGGGCTGTGCTGTTCTGTTTTGTTTTGTTTTCTTTTCATTTCATTTCATATCATACCATACGATACGATACGATACGATGCGATGTGATGTTATGTTTTGTTTA","label":"promoter"}],
"validation": [{"seq":"GTGGGGTGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGCGCCGCCCCGCCCCGCCCCGCCCCACCCCACCCCACTCCACTGCACTGCACTGCACTGCAGTGCAGGGCAGGTCAGGTGAGGTGGGGTGGGGTGGGCTGGGCCGGGCCTGGCCTGGCCTGTCCTGTACTGTAGTGTAGCGTAGCATAGCAGAGCAGCGCAGCTCAGCTGAGCTGCGCTGCACTGCACTGCACCGCACCTCACCTGACCTGACCTGAGCTGAGGTGAGGCGAGGCAAGGCAGGGCAGGGCAGGGCAGGGCAGGGCTGGGCTGGGCTGGGCTGGCCTGGCATGGCAGGGCAGCGCAGCCCAGCCCAGCCCCGCCCCTCCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGGTGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGCGCCGCCCCGCCCCGCCCCGCCCCACCCCACCCCACTCCACTGCACTGCACTGCACTGCAGTGCAGGGCAGGTCAGGTG","label":"non-promoter"},{"seq":"GTGTGGTGTGGGGTGGGATGGGATGGGATCGGATCAGATCATATCATGTCATGTCATGTAATGTATTGTATCGTATCATATCAGATCAGTTCAGTGCAGTGCAGTGCAGTGCAGTGCAGCGCAGCCCAGCCTAGCCTTGCCTTGCCTTGACTTGACTTGACCTGACCTGACCTCACCTCCCCTCCTCTCCTGTCCTGGCCTGGGCTGGGCTGGGCTGGGCTCGGCTCAGCTCAACTCAAGTCAAGCCAAGCAAAGCATAGCATTGCATTCCATTCTATTCTTTTCTTCTCTTCCCTTCCCTTCCCATCCCACCCCACCCCACCTCACCTCACCTCACCTCAACTCAACTCAACCCAACCTAACCTCACCTCTCCTCTTCTCTTGTCTTGACTTGAGTTGAGTTGAGTAGAGTAGAGTAGCGTAGCTTAGCTGAGCTGAGCTGAACTGAAATGAAATGAAATTAAATTAAATTACATTACATTACAGTACAGGACAGGACAGGAAAGGAACGGAACAGAACATAACATGACATGCCATGCCATGCCATGCCACGCCACCCCACCACACCACACCACA","label":"non-promoter"},{"seq":"CCCTGCCCTGCACTGCATTGCATGGCATGCCATGCCATGCCATGCCACGCCACACCACATCACATAACATAGCATAGCATAGCATAGCAAAGCAAGGCAAGGCAAGGTAAGGTGAGGTGCGGTGCTGTGCTGTGCTGGGCTGGGCTGGGTTGGGTCGGGTCAGGTCACGTCACTTCACTGCACTGAACTGATCTGATGTGATGCGATGCTATGCTATGCTAAGCTAACCTAACATAACATAACATCACATCTCATCTAATCTAATCTAAACTAAACTAAACAAAACAGAACAGGACAGGGCAGGGGAGGGGCGGGGCCGGGCCAGGCCAGGCCAGGCCAGGTCAGGTGAGGTGCGGTGCGGTGCGGTGCGGTGCGGTGCGGTGGGGTGGCGTGGCTTGGCTCGGCTCAGCTCACCTCACTTCACTCCACTCTACTCTTCTCTTGTCTTGACTTGAATTGAAATGAAATGAAATCAAATCCAATCCCATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGC","label":"non-promoter"},{"seq":"TTGGAGTGGAGCGGAGCAGAGCAAAGCAAGGCAAGGCAAGGCAAGGCTAGGCTAGGCTATGCTATGCTATGCTATGCAATGCACTGCACCGCACCACACCATACCATACCATACCATACAATACATTACATGACATGCCATGCTATGCTCTGCTCTGCTCTGCTCTGATCTGAGCTGAGTTGAGTGGAGTGGAGTGGGGTGGGCTGGGCTGGGCTTGGCTTGGCTTGACTTGATTTGATTTGATTCGATTCCATTCCTTTCCTCTCCTCCCCTCCACTCCAGTCCAGGCCAGGGCAGGGAAGGGAAGGGAAGGGAAGAGAAGAGAAGAGGAGAGGCGAGGCCAGGCCAGGCCAGGCCAGGCCAGGACAGGAAAGGAAAGGAAAGGAAAGCAAAGCAAAGCATAGCATTGCATTGCATTGAATTGATTTGATGTGATGTGATGTGATGTGATGTGAAGTGAAATGAAAAGAAAACAAAACAAAACAGAACAGCACAGCCCAGCCTAGCCTTGCCTTTCCTTTCCTTTCCTTTCCCTTCCCTTCCCTTCCCTTGCCTTGCCTTGCCTTGCCATGCCAT","label":"non-promoter"},{"seq":"AGCACAGCACAGCACAGGACAGGGCAGGGCAGGGCAGGGCACGGCACTGCACTGCACTGGACTGGTCTGGTGTGGTGGGGTGGAGTGGAGTGGAGGGGAGGGGAGGGAAGGGAGGGGAGCGGAGCCGAGCCCAGCCCTGCCCTGCCCTGCCCTGCGCTGCGGTGCGGGGCGGGGCGGGGCGGGGCAGGGCAGGGCAGTGCAGTCCAGTCCAGTCCTGTCCTCTCCTCACCTCAACTCAAGTCAAGGCAAGGCAAGGCCAGGCCTGGCCTCGCCTCCCCTCCGCTCCGGTCCGGACCGGATCGGATGGGATGGGATGGGATGGGTTGGGTGGGGTGTGGTGTGGTGTGATGTGAGGTGAGATGAGAGGAGAGGAGAGGCGAGGCAAGGCACGGCACCGCACCGCACCGGACCGGGCCGGGGCGGGGCGGGGCTGGGCTGGGCTGAGCTGAACTGAAGTGAAGCGAAGCAAAGCAGAGCAGCGCAGCACAGCATAGCATCGCATCTCATCTGATCTGGTCTGGGCTGGGTTGGGTTGGGTTTGGTTTGGTTTGATTTGAGTTGAGGTGAGGAGAGGAA","label":"non-promoter"},{"seq":"AGGCCAGGCCAGGCCAGCCCAGCTCAGCTGAGCTGGGCTGGGCTGGGGTGGGGTGGGGTCGGGTCAGGTCAAGTCAAGTCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGGAAGGGGAGGGGGGGGGGCGGGGCTGGGCTGGGCTGCGCTGCCCTGCCCTGCCCAGCCCAGCCCAGCCCAGCACAGCACAGCACAGCACAGCACAGTACAGTGCAGTGGAGTGGTGTGGTTTGGTTCGGTTCTGTTCTGTTCTGCTCTGCTCTGCTCTGCTCCGCTCCACTCCAGTCCAGACCAGAGCAGAGGAGAGGTGAGGTGAGGTGCGGTGCAGTGCAGTGCAGTGCAGTCCAGTCAAGTCAGGTCAGATCAGACCAGACTAGACTGGACTGCACTGCCCTGCCTTGCCTGGCCTGGCCTGGGCTGGGTTGGGTTGGGTTGGGTTGGGTTGGCTTGGCTTGGCTCGGCTCAGCTCATCTCATGTCATGCCATGCCATGCCTTGCCTGGCCTGGCCTGGGCTGGGTTGGGTCGGGTCTGGTCTGGTCTGTTCTGTCCTGTCATGTCATGTCATT","label":"non-promoter"},{"seq":"GTGCGATGCGAGGCGAGACGAGATGAGATGAGATGAGATGACATGACGTGACGCGACGCAACGCACCGCACTGCACTTCACTTCACTTCCCTTCCTTTCCTGTCCTGCCCTGCCCTGCCTTGCCTGGCCTGACCTGAGCTGAGGTGAGGCGAGGCGAGGCGGGGCGGCGCGGCCCGGCCGGGCCGCGCCGCTCCGCTGCGCTGTGCTGTTCTGTTCTGTTCTGTTCTCTTCTCGTCTCGCCTCGCGTCGCGGCGCGGCGCGGCTCGGCTTGGCTTCGCTTCCCTTCCGTTCCGGTCCGGCCCGGCACGGCAGGGCAGGGCAGGTCAGGTGAGGTGGGGTGGCGTGGCGTGGCGCGGCGCTGCGCTGCGCTGAGCTGAGCTGAGATGAGACGAGACCAGACCAGACCACACCACGCCACGGCACGGGACGGGACGGGAAGGGAAGGGAAGCGAAGCCAAGCCAAGCCAGGCCAGCCCAGCCCAGCCTAGCCTGGCCTGGCCTGGCCTGGCTTGGCTGGGCTGTGCTGTCCTGTCGTGTCGGGTCGGTTCGGTTCGGTTAGGTTAGGTTAGCTTAGCC","label":"promoter"},{"seq":"GTTCTTTTCTTGTCTTGGCTTGGATTGGATTGGATCGGATCAGATCACATCACATCACACCACACTACACTCCACTCGACTCGACTCGAGTCGAGGCGAGGAGAGGAAAGGAAAGGAAAGGAAAGCAAAGCTAAGCTCAGCTCCGCTCCACTCCAGTCCAGCCCAGCTCAGCTGAGCTGGGCTGGGCTGGGCTGGGCCGGGCCCGGCCCAGCCCAGCCCAGACCAGATCAGATTAGATTTGATTTGATTTGGTTTGGGTTGGGGTGGGGCGGGGCTGGGCTTGGCTTCGCTTCTCTTCTGTTCTGTTCTGTCCTGTCCTGTCCTGTCCTGTCCTGACCTGAACTGAAATGAAAGGAAAGGAAAGGCAAGGCGAGGCGCGGCGCTGCGCTGCGCTGGGCTGGCCTGGCTTGGCTCGGCTCCGCTCCTCTCCTGTCCTGGCCTGGTCTGGTGTGGTGTGGTGTGGTGTGATGTGAAGTGAATTGAATGGAATGGAATGGGATGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCCGGCCCAGCCCAGCCCAGGCCAGGGCAGGGCAGGGCTGGGCTG","label":"non-promoter"},{"seq":"GGCCAGGCCAGGCCAGGGCAGGGGAGGGGAGGGGACGGGACCGGACCAGACCAGACCAGGCCAGGCCAGGCTAGGCTGGGCTGGGCTGGGCTGGGATGGGAGGGGAGAGGAGAGGAGAGCAGAGCTGAGCTGAGCTGCGCTGCCCTGCCATGCCAAGCCAACCCAACCCAACCGAACCGCACCGCACCGCACCGCACCGCACCTCACCTGACCTGTCCTGTGCTGTGATGTGAAGTGAAGTGAAGGGAAGGAAAGGAAAGGAATGGAATGGAATGGAATGGTATGGTCTGGTCAGGTCAGGTCAGGTCAGGACAGGAAAGGAACGGAACCGAACCCAACCCTACCCTCCCCTCCCCTCCCCTCCCATCCCACCCCACCCCACCCCACCCTACCCTGCCCTGGCCTGGGCTGGGATGGGATGGGATGGGATGCGATGCAATGCATTGCATTGCATTCCATTCCATTCCTTTCCTGTCCTGGCCTGGCCTGGCTTGGCTTGGCTTTGCTTTTCTTTTATTTTACTTTACCTTACCATACCAGACCAGTCCAGTTCAGTTAAGTTATGTTATTTTATTC","label":"non-promoter"},{"seq":"ATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGAGCCGAGCCGAGGCGAGGTGAGGTGAGGTGGGGTGGGGTGGGTTGGGTGGGGTGGGGTGGAGTGGATTGGATCGGATCAGATCATATCATCTCATCTCATCTGATCTGATCTGAGCTGAGGTGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGTGGAGTTGAGTTTAGTTTGGTTTGATTTGAATTGAAATGAAACGAAACCAAACCAAACCAGACCAGCCCAGCCCAGCCTAGCCTGGCCTGGCCTGGCCTGGCCTGGCCAGGCCAAGCCAACCCAACACAACATAACATGACATGGCATGGCATGGCATGGCAAGGCAAAGCAAAACAAAACAAAACCAAACCCAACCCCACCCCGCCCCGTCCCGTCCCGTCTCGTCTCGTCTCTTCTCTACTCTACTCTACTCTACTATACTAAACTAAACTAAAATAAAAAAAAAATAAAATAAAATAC","label":"non-promoter"},{"seq":"CCGCCACGCCAGGCCAGGCCAGGCCAGGCTAGGCTCGGCTCCGCTCCTCTCCTCTCCTCTCCTCTGCTCTGCTCTGCACTGCAGTGCAGCGCAGCGCAGCGCAGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGCCGCGCAGCGCAGCGCAGCGCAGCGCAGCGCAGCGCCGCGCCCCGCCCCGCCCCCCCCCCTCCCCTCCCCTCGCCTCGCCTCGCGTCGCGGCGCGGTGCGGTACGGTAGGGTAGGGTAGGCTAGGCGAGGCGCGGCGCGGCGCGGCGCGGAGCGGAGCGGAGGGGAGGAGAGGAAAGGAAGGGAAGCGAAGCGAAGCGGAGCGGCGCGGCCCGGCCAGGCCACGCCACACCACAGCACAGGACAGGGCAGGGCAGGGCTGGGCTGGGCTGCGCTGCCCTGCCGTGCCGCGCCGCCCCGCCCCGCCCGGCCCGCCCCGCACCGCAGCGCAGAGCAGAACAGAATAGAATCGAATCGAATCGCATCGCATCGCAGCGCAGCGCAGCTCAGCTGAGCTGCGCTGCACTGCACTGCACGGCACGACACGATACGATC","label":"promoter"},{"seq":"CGCAGTGCAGTGCAGTGGAGTGGTGTGGTCTGGTCTGGTCTTGTCTTGTCTTGGCTTGGCTTGGCATGGCAGGGCAGCGCAGCTCAGCTGAGCTGCGCTGCCCTGCCATGCCAGGCCAGGCCAGGACAGGAGAGGAGTGGAGTAGAGTAGAGTAGGGTAGGTTAGGTAAGGTAGGGTAGTGTAGTGTAGTGCAGTGCCGTGCCTTGCCTCGCCTCCCCTCCGCTCCGGTCCGGACCGGACCGGACCGGACCCGACCCTACCCTCCCCTCGCCTCGCCTCGCTTCGCTACGCTAGGCTAGGCTAGGTTAGGTGAGGTGGGGTGGCGTGGCATGGCACGGCACTGCACTACACTATACTATCCTATCATATCACATCACATCACAACACAAAACAAAGCAAAGGAAAGGAAAGGACAGGACAGGACAAGACAAAACAAAGCAAAGCAAAGCTAAGCTGAGCTGGGCTGGTCTGGTTTGGTTGGGTTGAGTTGAATTGAAGTGAAGCGAAGCTAAGCTAAGCTACGCTACACTACAGTACAGAACAGAACAGAAAAGAAATGAAATCAAATCCAATCCC","label":"promoter"},{"seq":"TACCGGACCGGACCGGAGCGGAGAGGAGACGAGACCAGACCGGACCGCACCGCACCGCACCGCACTGCACTGCACTGAACTGAACTGAAGTGAAGAGAAGACAAGACTAGACTGGACTGTACTGTTCTGTTTTGTTTTGTTTTATTTTAGTTTAGATTAGAGTAGAGTAGAGTTGAGTTGAGTTGAGTTGACTTGACTTGACTGGACTGAACTGACCTGACATGACAGGACAGTACAGTGCAGTGGAGTGGCGTGGCATGGCAGGGCAGCGCAGCGCAGCGAAGCGATGCGATTCGATTCGATTCTATTCTCTTCTCCTCTCCTCTCCTGTCCTGTCCTGTCCTGTCTTGTCTCGTCTCCTCTCCACTCCAGTCCAGCCCAGCCCAGCCCAGCCCTGCCCTCCCCTCACCTCAGCTCAGCTCAGCACAGCAGAGCAGTGCAGTGCAGTGTAGTGTCGTGTCCTGTCCCGTCCCTTCCCTTCCCTTTCCTTTGCTTTGGTTTGGGTTGGGCTGGGCAGGGCACGGCACCGCACCCCACCCAACCCAGCCCAGCCCAGCCCAGCCCAGCCCCGCCCCA","label":"non-promoter"},{"seq":"CAGAATAGAATCGAATCGAATCGCATCGCATCGCAACGCAAGGCAAGACAAGAAAAGAATAGAATCGAATCAAATCATATCATGTCATGCCATGCAATGCAGTGCAGAGCAGAGCAGAGCAGAGCGGAGCGAAGCGACGCGACCCGACCTGACCTGACCTGACCTGATCTGATTTGATTTGATTTAATTTACTTTACGTTACGCTACGCTACGCTTCGCTTCGCTTCACTTCACTTCACCTCACCTCACCTAACCTAGCCTAGACTAGATTAGATTAGATTGGATTGAATTGACTTGACTTGACTTGACTTTACTTTTCTTTTTTTTTTATTTTATTTTATTTTATTCTATTCTATTCTGTTCTGCTCTGCACTGCATTGCATCGCATCGCATCGTATCGTTTCGTTGCGTTGTGTTGTGTTGTGTTGTGTTGTGTTCTGTTCTGTTCTTTTCTTCTCTTCCCTTCCCTTCCCCTCCCCCCCCCCACCCCACCCCACTCCACTTCACTTCACTTCCCTTCCTTTCCTCTCCTCTCCTCTTCTCTTCTCTTCTCTTCTTTTCTTGTCTTGCCTTGCT","label":"non-promoter"}],
"epochs": 1
}
}'
import requests
import json
url = "https://biolm.ai/api/v1/finetune_run/"
payload = json.dumps({
"pipeline": "finetune_DNABERT_classifier",
"hyperopt": False,
"input_json": {
"max_train": 40000,
"max_validate": 20000,
"train": [
{
"seq": "CACAGCACAGCCCAGCCAAGCCAGGCCAGCCCAGCCCAGCCAAGCCACGCCACTCCACTACACTAGACTAGGCTAGGCTAGGCCAGGCCCGGCCCTGCCCTGCCCTGTCCTGTCCTGTCCTGTCCTGTCCTGTCCTGCCCTGCACTGCAGTGCAGCGCAGCCCAGCCCAGCCCCGCCCCCCCCCCTCCCCTGCCCTGTCCTGTACTGTAGTGTAGGGTAGGGTAGGGGAGGGGTGGGGTCGGGTCTGGTCTGGTCTGGTCTGGACTGGAATGGAACGGAACAGAACAGAACAGCACAGCCCAGCCAAGCCAGGCCAGGCCAGGACAGGAGAGGAGTGGAGTGGAGTGGAGTGGTGTGGTTTGGTTTGGTTTAGTTTAATTTAAGTTAAGATAAGAGAAGAGGAGAGGCGAGGCAAGGCAGGGCAGGGCAGGGCAGGGGAGGGGAGGGGAGGGGAGTGGAGTCGAGTCGAGTCGCGTCGCCTCGCCTCGCCTTGCCTTGCCTTGCCTTGCCTTGCCCTGCCCTGCCCTGCCCTGTCCTGTGCTGTGCTGTGCCGTGCCATGCCACGCCACACCACAC",
"label": "non-promoter"
},
{
"seq": "CTAATCTAATCTAATCTAATCTAGTCTAGTCTAGTATAGTAAAGTAATGTAATGTAATGCAATGCCATGCCGTGCCGCGCCGCGCCGCGTCGCGTTGCGTTGCGTTGGGTTGGTTTGGTGTGGTGGGGTGGAGTGGAATGGAAAGGAAAGGAAAGAAAAGACAAGACAAGACATGACATGACATGACATGACATGACATGACATGACATAACATACCATACCATACCTTACCTCACCTCACCTCAACTCAAATCAAACCAAACAAAACAGAACAGCACAGCACAGCAGAGCAGGGCAGGGCAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCCGCGCCACGCCATGCCATGCCATGCCATGCGATGCGCTGCGCCGCGCCACGCCAAGCCAAGCCAAGCCAAGCCAAGCCCAGCCCGGCCCGCCCCGCACCGCAGCGCAGAGCAGAGCAGAGGAGAGGGGAGGGTAGGGTTGGGTTGGGTTGTGTTGTCTTGTCCTGTCCAGTCCAATCCAACCCAACTCAACTCAACTCCACTCCTCTCCTATCCTATCCTATTCTATTCTATTCCATTCCT",
"label": "promoter"
},
{
"seq": "GGAAGAGAAGAGAAGAGGAGAGGGGAGGGAAGGGAAGGGAAGGGAAGGGAAGGAAAGGAAAGGAAAGGAAATGAAATGAAATGCAATGCCATGCCCTGCCCCGCCCCGCCCCGGCCCGGGCCGGGTCGGGTCGGGTCCGGTCCCGTCCCATCCCAGCCCAGGCCAGGCCAGGCGAGGCGGGGCGGGGCGGGGCGGGGCGGGGCCGGGCCTGGCCTCGCCTCGCCTCGACTCGAGTCGAGCCGAGCGGAGCGTAGCGTGGCGTGCCGTGCCGTGCCCTGCCCAGCCCACCCCACGCCACGCCACGCCACGCCGCGCCGCGCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTGCCCTGCCCTGCTCTGCTGTGCTGGGCTGGCCTGGCCTGGCCAGGCCACGCCACGCCACGCCACGCCACGCCTCGCCTGGCCTGGCCTGGACTGGAGTGGAGTGGAGTTGAGTTGAGTTGCGTTGCATTGCAGTGCAGGGCAGGACAGGAAAGGAACGGAACCGAACCGAACCGGACCGGGCCGGGCCGGGCGGGGCGCGGCGCCGCGCCGCGCCGGGCCGGG",
"label": "promoter"
},
{
"seq": "CGAAAGGAAAGCAAAGCAAAGCAAAGCAATGCAATCCAATCAAATCAGATCAGTTCAGTGCAGTGGAGTGGCGTGGCCTGGCCTGGCCTGGCCTGGCCTGGACTGGACTGGACCGGACCAGACCATACCATGCCATGTCATGTGATGTGTTGTGTAGTGTAGTGTAGTGTAGTATAGTATAGTATAGTATAGTATAGAATAGAGTAGAGAAGAGAGGAGAGCAGAGCAGAGCAAAGCAACGCAACACAACAGAACAGCACAGCGCAGCGCAGCGCCGCGCCACGCCATGCCATCCCATCTCATCTAATCTATTCTATGCTATGCTATGCTATGCTTTGCTTAGCTTAACTTAATTTAATTTAATTTAATTTGATTTGGTTTGGCTTGGCATGGCAAGGCAACGCAACACAACATAACATTACATTACATTACATTACATTACATTACATGACATGTCATGTAATGTAGTGTAGTGTAGTCTAGTCCAGTCCCGTCCCGTCCCGGCCCGGACCGGAACGGAAAGGAAAAGAAAATAAAATCAAATCTAATCTTATCTTTTCTTTTCTTTTATTTTAA",
"label": "promoter"
},
{
"seq": "TGACTCGACTCCACTCCCCTCCCATCCCAACCCAAACCAAACCAAACCAAACCAAACCAAACCAACCCAACACAACAAAACAAAACAAAACAAAAGAAAAGGAAAGGGAAGGGGAGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGTGGAGTTGAGTTCAGTTCAGTTCATTTCATCTCATCACATCACATCACCTCACCACACCACACCACTCCACTACACTAGACTAGACTAGACTAGACTAGACTTGACTTTACTTTCCTTTCCTTTCCTTTCCTTTCCTTACCTTATCTTATATTATAATATAAAATAAAATAAAAAAAAAAAAAAAACAAAACAAAACACAACACTACACTACACTAGACTAGACTAGAGTAGAGGAGAGGGGAGGGAAGGGAGGGGAGTGGAGTGGAGTGCAGTGCTGTGCTTTGCTTAGCTTAACTTAAGTTAAGCTAAGCAAAGCAGAGCAGAGCAGAACAGAAAAGAAAGGAAAGAAAAGAAAAGAAAAGAAAAGAAAAAAAAAAAAAAAAAAAAAATAAAATAAAATACAATACTATACTATACTAA",
"label": "promoter"
},
{
"seq": "AAGCATAGCATGGCATGACATGAAATGAAATGAAATGAAATGAAATGAAATGAAATGAAATGAAAGGAAAGAAAAGACAAGACTAGACTGGACTGGACTGGGCTGGGCTGGGCTGGGCTAGGCTAGGCTAGGCTAGGCTAGGCAAGGCACGGCACAGCACAGCACAGTACAGTGCAGTGGAGTGGCGTGGCTTGGCTCGGCTCAGCTCACCTCACATCACACCACACCACACCTCACCTGACCTGTCCTGTACTGTAATGTAATGTAATCTAATCCAATCCCATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCAGGCCAAGCCAAGCCAAGACAAGACAAGACAAGACAGGACAGGACAGGCCAGGCAAGGCAGGGCAGAGCAGATCAGATGAGATGAGATGACATGACCTGACCTGACCTGACCTGACCTGAGCTGAGGTGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGTGGAGTTGAGTTTAGTTTG",
"label": "non-promoter"
},
{
"seq": "GGCTTTGCTTTGCTTTGTTTTGTTTTGTTTTGTTTTGTTTTCTTTTCTTTTCTGTTCTGTTCTGTGCTGTGATGTGAGGTGAGTTGAGTTGAGTTAAGTTACGTTACGTTACGGTACGGGACGGGGCGGGGCGGGGCTGGGCTGGGCTGCGCTGCCCTGCCATGCCACGCCACCCCACCTCACCTGACCTGCCCTGCACTGCAGTGCAGGGCAGGTCAGGTAAGGTAAGGTAAAGTAAAATAAAATAAAATCAAATCTAATCTGATCTGGTCTGGACTGGACTGGACAGGACATGACATTACATTGCATTGCATTGCCTTGCCCTGCCCTGCCCTGCCCTGACCTGAACTGAAATGAAATGAAATTAAATTGAATTGAATTGACTTGACCTGACCGGACCGAACCGAACCGAACCGAACCGAACCTAACCTTACCTTGCCTTGGCTTGGATTGGATTGGATAGGATACGATACAATACAATACAAAACAAACCAAACCAAACCCAACCCGACCCGGCCCGGCCCGGCCCGGCCTGGCCTGGCCTGACCTGACCTGACATGACAGGACAGTACAGTG",
"label": "promoter"
},
{
"seq": "TCACCGCACCGTACCGTTCCGTTACGTTACGTTACTTTACTGTACTGCACTGCCCTGCCTTGCCTCGCCTCCCCTCCTCTCCTATCCTAGCCTAGTCTAGTGTAGTGGAGTGGCGTGGCGTGGCGGGGCGGAGCGGATCGGATAGGATACGATACGATACGGTACGGCACGGCGCGGCGGGGCGGCGCGGCACGGCAAGGCAATGCAATACAATAGAATAGTATAGTGTAGTGGAGTGGCGTGGCGTGGCGCGGCGCAGCGCACCGCACAGCACATCACATTACATTCCATTCAATTCAATTCAAGTCAAGGCAAGGCAAGGCAAGGCAGGGCAGGGCAGGACAGGAAAGGAAGGGAAGCGAAGCAAAGCAAAGCAAGGCAAGACAAGAGAAGAGGAGAGGAGAGGAAAGGAACGGAACAGAACAGAACAGAACAGAGCAGAGCAGAGCCGAGCCAAGCCACGCCACCCCACCACACCAGACCAGCCCAGCACAGCAGAGCAGGGCAGGTCAGGTTAGGTTTGGTTTGGTTTGGTTTGGCTTGGCCTGGCCCGGCCCAGCCCAGCCCAGTCCAGTG",
"label": "promoter"
},
{
"seq": "AGAAAAGAAAACAAAACAAAACAAAACAAAACAAAACAAAAGAAAAGCAAAGCTAAGCTCAGCTCCGCTCCGCTCCGGTCCGGACCGGAGCGGAGTGGAGTAGAGTAGAGTAGGGTAGGATAGGAAAGGAAAGGAAAGGAAAGTAAAGTGAAGTGAAGTGACGTGACATGACACGACACAACACAGCACAGCACAGCGCAGCGCAGCGCCGCGCCACGCCACGCCACCCCACCTCACCTCACCTCCCCTCCCCTCCCGTCCCGGCCCGGTCCGGTACGGTAGGGTAGCGTAGCCTAGCCTAGCCTGGCCTGGCCTGGCCTGGCCTGGCCGGGCCGGGCCGGCCCGGCCCGGCCAGGCCAAGCCAAGCCAAGGCAAGGCAAGGCCAGGCCTGGCCTCGCCTCTCCTCTGCTCTGGTCTGGCCTGGCTTGGCTTGGCTTAGCTTAACTTAAGTTAAGCTAAGCGAAGCGGAGCGGGGCGGGCCGGGCCGGGCCTGGCCTCGCCTCTCCTCTGCTCTGGTCTGGCCTGGCCTGGCCTGGCCTGGCCTGCCCTGCCCTGCCATGCCAAGCCAAACCAAAA",
"label": "promoter"
},
{
"seq": "AAGTAGAGTAGAGTAGAGTAGAGGAGAGGCGAGGCCAGGCCTGGCCTCGCCTCCCCTCCTCTCCTGTCCTGCCCTGCTCTGCTTTGCTTCGCTTCACTTCAGTTCAGGTCAGGGCAGGGAAGGGAAGGGAAGGGAAGTGAAGTAAAGTAGAGTAGAGTAGAGTAGAGCAGAGCCGAGCCGAGCCGGGCCGGTCCGGTGCGGTGTGGTGTCGTGTCTTGTCTCGTCTCGTCTCGCCTCGCATCGCACCGCACCGCACCACACCAGACCAGACCAGAGCAGAGCAGAGCCGAGCCCAGCCCCGCCCCACCCCAGCCCAGACCAGATCAGATGAGATGGGATGGAATGGAATGGAACGGAACTGAACTCAACTCTACTCTGCTCTGTTCTGTCCTGTCCTGTCCCGTCCCATCCCATCCCATTCCATTCCATTCAATTCACTTCACATCACATCACATTACATTACATTAAATTAATTTAATTTAATTGAATTGAATTGAATTGAATTGAATCGAATCCAATCCAATCCAGTCCAGTCCAGTACAGTACAGTACTGTACTTTACTTTACTTTGCTTTGA",
"label": "non-promoter"
},
{
"seq": "GGTGCAGTGCAATGCAAGGCAAGGCAAGGAAAGGAAAGGAATGGAATGGAATGAAATGAAATGAAGTGAAGCGAAGCCAAGCCAAGCCAAGCCAATCCAATTCAATTTAATTTCATTTCTTTTCTCTTCTCATCTCAACTCAATTCAATCCAATCAAATCATATCATCTCATCGCATCGAATCGAGTCGAGGCGAGGCGAGGCTAGGCTAGGCTACGCTACCCTACCCTACCCTACCCTGCCCTGCCCTGCCCTGCCATGCCATGCCATCCCATCTCATCTTATCTTGTCTTGTCTTGTGTTGTGGTGTGGCGTGGCCTGGCCAGGCCATGCCATGCCATGTCATGTGATGTGATGTGAGGTGAGGTGAGGGGAGGGAAGGGATGGGATGGGATGCGATGCAATGCACTGCACAGCACACCACACGACACGTCACGTGACGTGTCGTGTAGTGTAGTGTAGAGTAGATTAGATCAGATCAGATCAAATCAATTCAATTCAATTTAATTTCATTTCTTTTCTCTTCTCATCTCAGCTCAGCTCAGCACAGCATAGCATCGCATCACATCACATCACA",
"label": "promoter"
},
{
"seq": "AGAGACGAGACTAGACTGGACTGGACTGGCCTGGCATGGCAAGGCAAGGCAAGGCAAGGAAAGGACAGGACAGGACAGGACAGGACAGGCCAGGCTAGGCTCGGCTCGGCTCGCCTCGCCTCGCCCCGCCCTGCCCTTCCCTTCCCTTCTCTTCTGTTCTGTTCTGTACTGTAGTGTAGAGTAGAGTAGAGCAGAGCCGAGCCTAGCCTCGCCTCGCCTCGCCTCGCATCGCATCGCATTGCATTGCATTGGATTGGCTTGGCCTGGCCAGGCCACGCCACCCCACCACACCAGACCAGGCCAGGACAGGAGAGGAGGGGAGGCGAGGCAAGGCAGGGCAGTGCAGTGCAGTGTAGTGTTGTGTTGTGTTGTGTTGTCTTGTCTTGTCTGGTCTGCTCTGCCCTGCCTTGCCTCGCCTCTCCTCTCCTCTCGTCTCGACTCGAATCGAACCGAACTGAACTTAACTTGACTTGGCTTGGCTTGGCTTGGCTGGGCTGCGCTGCCCTGCCCTGCCCAGCCCAACCCAAGCCAAGGCAAGGTAAGGTGAGGTGAGGTGAGGTGAGATGAGAAGAGAAG",
"label": "promoter"
},
{
"seq": "TGGCGAGGCGACGCGACCCGACCCGACCCCACCCCACCCCAACCCAACCCAACCCAACCTAACCTGACCTGCCCTGCCCTGCCCTGCCCTGCCCTTCCCTTGCCTTGCCTTGCTTTGCTTTGCTTCGCTTCGCTTCGGTTCGGATCGGACCGGACAGGACACGACACTACACTGCACTGCACTGCACTGCAGTGCAGCGCAGCACAGCACAGCACCGCACCCCACCCAACCCAACCCAATCCAATGCAATGGAATGGCATGGCGTGGCGCGGCGCCGCGCCCCGCCCAGCCCAGCCCAGACCAGAACAGAACAGAACCGAACCCAACCCGACCCGCCCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTGCCCTGCCCTGCCCTGCCGTGCCGCGCCGCGCCGCGGCGCGGGGCGGGCCGGGCAGGGCAGGGCAGTGCAGTGCAGTGCAGTGCAGTGCAGTGCAGCGCAGCCCAGCCCAGCCCGGCCCGGCCCGGGCCGGGACGGGATGGGATAGGATAGGATAGCATAGCGTAGCGCAGCGCCGCGCCCCGCCCCGCCCCCCCCCCACCCCAA",
"label": "non-promoter"
},
{
"seq": "CTGTGTTGTGTAGTGTATTGTATAGTATATTATATCATATCTTATCTGATCTGTTCTGTACTGTAATGTAAAGTAAAGTAAAGTAAAGTTAAGTTAAGTTATGTTATCTTATCTTATCTCATCTCCTCTCCACTCCAGTCCAGTCCAGTCCAGTCAAGTCAAGTCAACTCAACGCAACGCAACGCTACGCTACGCTAGGCTAGGCTAGGGTAGGGAAGGGATGGGATGGGATGCGATGCAATGCACTGCACAGCACACCACACTACACTCCACTCTACTCTGCTCTGCTCTGCACTGCAATGCAACGCAACACAACACAACACTACACTCCACTCTACTCTACTCTAGTCTAGGCTAGGTTAGGTGAGGTGGGGTGGCGTGGCCTGGCCTGGCCTTGCCTTCCCTTCTCTTCTGTTCTGTTCTGTACTGTATTGTATAGTATATTATATAATATATTATATGATATGGTATGGCATGGCATGGCAGGGCAGAGCAGAACAGAAAAGAAAAGAAAAAAAAAAGAAAAGAAAAGAAAAGAAAAGAAAGGAAAGTAAAGTAAAGTAAAGTAAAGTAAAT",
"label": "non-promoter"
},
{
"seq": "CTATATTATATTATATTTTATTTGATTTGGTTTGGATTGGACTGGACAGGACAAGACAATACAATCCAATCGAATCGCATCGCCTCGCCGCGCCGTGCCGTGCCGTGACGTGATGTGATTTGATTAGATTAAATTAAATTAAACTAAACGAAACGAAACGAGACGAGTCGAGTGGAGTGTAGTGTAGTGTATTGTATGGTATGATATGAAATGAAATGAAAGGAAAGGAAAGGCAAGGCGAGGCGTGGCGTCGCGTCTCGTCTGGTCTGATCTGAACTGAAGTGAAGCGAAGCTAAGCTAAGCTAGGCTAGGCTAGGGTAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCTGCGCTACGCTAGGCTAGACTAGATTAGATAAGATAAGATAAAATAAACTAAACAAAACACAACACTACACTGCACTGAACTGATCTGATTTGATTTGATTTCATTTCCTTTCCCTTCCCCTCCCCTCCCCTTCCCTTTCCTTTACTTTAGTTTAGGTTAGGGTAGGGAAGGGAAGGGAAAGGAAAAGAAAAAAAAAAGAAAAGAAAAGAAAAGAATAGAATG",
"label": "promoter"
},
{
"seq": "AACTGCACTGCACTGCAGTGCAGGGCAGGACAGGATAGGATGGGATGCGATGCTATGCTCTGCTCTGCTCTTCTCTTGTCTTGGCTTGGATTGGAGTGGAGTGGAGTTGAGTTCAGTTCTGTTCTGTTCTGGTCTGGTCTGGTCTGGTCTGGTCTAGTCTACTCTACTCTACTCTACTCTACTCTGCTCTGCTCTGCGCTGCGATGCGATGCGATGCGATGCGATGCTATGCTTTGCTTGGCTTGTCTTGTTTTGTTTTGTTTGGTTTGCTTTGCATTGCAATGCAAAGCAAAACAAAACAAAACCAAACCCAACCCTACCCTGCCCTGTCCTGTCCTGTCATGTCATGTCATGTCATGACATGAGATGAGATGAGAAGAGAAGAGAAGGGAAGGTAAGGTCAGGTCCGGTCCAGTCCACTCCACTCCACTACACTAGACTAGACTAGATTAGATGAGATGGGATGGCATGGCATGGCAGGGCAGGGCAGGTCAGGTTAGGTTTGGTTTCGTTTCATTTCAGTTCAGCTCAGCTCAGCTGAGCTGTGCTGTGCTGTGCTGTGCTGTGCTATGCTAC",
"label": "promoter"
},
{
"seq": "GCTTTCCTTTCCTTTCCTTTCCTGTCCTGGCCTGGCCTGGCCTGGCCCGGCCCCGCCCCCCCCCCACCCCAACCCAAGCCAAGACAAGAGAAGAGTAGAGTGGAGTGCAGTGCAGTGCAGTGCAGGGCAGGGCAGGGAAGGGATGGGATGGGATGCGATGCCATGCCCTGCCCAGCCCAGCCCAGGCCAGGTCAGGTCAGGTCTGGTCTGGTCTGCTCTGCACTGCAATGCAACGCAACCCAACCAAACCACACCACCCCACCACACCACACCACTCCACTGCACTGGACTGGGCTGGGTTGGGTGGGGTGGGGTGGCGTGGCTTGGCTGGGCTGCGCTGCACTGCAGTGCAGCGCAGCTCAGCTGAGCTGTGCTGTGCTGTGCTGTGCCGTGCCCTGCCCAGCCCAGCCCAGGCCAGGACAGGAGAGGAGGGGAGGGGAGGGTAGGGTGGGGTGGGGTGGGGTGGGATGGGACGGGACTGGACTCGACTCCACTCCTCTCCTGTCCTGCCCTGCCCTGCCCTGCCCCGCCCCACCCCACCCCACCCCACCACACCAAACCAACCCAACTCAACTC",
"label": "non-promoter"
},
{
"seq": "TGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCCGAGCCGAGCCGGGCCGGGCCGGGTCGGGTGGGGTGCGGTGCCGTGCCCTGCCCCGCCCCGCCCCGGCCCGGGCCGGGTCGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCCGAGCCGAGCCGGGCCGGGCCGGGTCGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTG",
"label": "non-promoter"
},
{
"seq": "GGGTGAGGTGAGGTGAGGTGAGGGGAGGGAAGGGAAGGGAAGGGAAGTGAAGTGAAGTGCAGTGCAGTGCATTGCATCGCATCCCATCCCATCCCTTCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGTTGGGTGGGGTGAGGTGAGGTGAGATGAGAAGAGAAAAGAAACGAAACAAAACATAACATGACATGGCATGGGATGGGTTGGGTAGGGTAAGGTAAGGTAAGCTAAGCGAAGCGTAGCGTGGCGTGTCGTGTGGTGTGCTGTGCAGTGCAGTGCAGAGCAGACCAGACGAGACGTGACGTGACGTGGCGTGGAGTGGAGTGGAGAGGAGAGGAGAGGAGAGGGGAGGGCAGGGCGGGGCGTGGCGTGGCGTGGCGTGGGGTGGGGTGGGGTGGGGTGGGGTGAGGTGAGGTGAGGTGAGGGGAGGGAAGGGACGGGACGGGACGCGACGCCACGCCCCGCCCAGCCCACCCCACCCCACCCCACCCCACCCCTCCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGTTGGGTGGGGTGAGGTGAGGTGAGATGAGAAGAGAAA",
"label": "non-promoter"
},
{
"seq": "AAGTTTAGTTTTGTTTTTTTTTTCTTTTCCTTTCCATTCCACTCCACCCCACCTCACCTGACCTGCCCTGCCCTGCCATGCCACGCCACTCCACTTCACTTCACTTCACTTCACTTCACATCACAACACAATACAATGCAATGAAATGACATGACCTGACCCGACCCTACCCTCCCCTCCCCTCCACTCCAGTCCAGCCCAGCGCAGCGCAGCGCCGCGCCCCGCCCTGCCCTCCCCTCTCCTCTACTCTACTCTACTCTACTGTACTGGACTGGCCTGGCATGGCAGGGCAGAGCAGAGCAGAGAAGAGACGAGACTAGACTAGACTAGACTAGCCTAGCATAGCATAGCATCGCATCACATCAAATCAAGTCAAGCCAAGCCAAGCCAAGCCAGGCCAGCCCAGCTCAGCTGAGCTGGGCTGGCCTGGCATGGCAAGGCAAAGCAAACCAAACCAAACCAAACCAGACCAGACCAGAGCAGAGGAGAGGCGAGGCGAGGCGTGGCGTCGCGTCCCGTCCTGTCCTTTCCTTTCCTTTACTTTAATTTAAGTTAAGGTAAGGTAAGGTCAGGTCC",
"label": "promoter"
},
{
"seq": "TTTTTTTTTTTTTTTTTGTTTTGCTTTGCGTTGCGGTGCGGGGCGGGGCGGGGCGGGGCGGGGCGCGGCGCAGCGCAGCGCAGTGCAGTGCAGTGGAGTGGCGTGGCTTGGCTCGGCTCAGCTCATCTCATGTCATGCCATGCCATGCCTTGCCTGGCCTGTCCTGTACTGTAGTGTAGTGTAGTCTAGTCCAGTCCCGTCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCAGGCCAAGCCAAGCCAAGACAAGACAAGACGAGACGGGACGGGACGGGCCGGGCAGGGCAGGGCAGAGCAGATCAGATCAGATCAGATCACATCACGTCACGACACGAGACGAGGCGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGAGGAGATGAGATCAGATCGGATCGAATCGAGTCGAGACGAGACGAGACTAGACTAGACTATACTATCCTATCCTATCCTATCCTGTCCTGGCCTGGCCTGGCTTGGCTAGGCTAA",
"label": "non-promoter"
},
{
"seq": "CCGCCTCGCCTTGCCTTCCCTTCCCTTCCCTTCCCTTCCCTCCCCTCTCCTCTGCTCTGTTCTGTTCTGTTTTGTTTTGTTTTTTTTTTGTTTTGGTTTGGCTTGGCATGGCATGGCATAGCATAACATAAGATAAGATAAGAAAAGAAAAGAAACGAAACAAAACAAAACAATACAATTCAATTCAATTCAATTCAGTTCAGGTCAGGTCAGGTTAGGTTTGGTTTAGTTTATTTTATCTTATCATATCAAATCAAGTCAAGGCAAGGAAAGGAGAGGAGAGGAGAGGAGAGTAGAGTCGAGTCCAGTCCAGTCCAGTCCAGGCCAGGGCAGGGTAGGGTCGGGTCAGGTCAGGTCAGATCAGAACAGAATAGAATTGAATTTAATTTTATTTTTTTTTTCTTTTCTTTTCTATTCTAATCTAACCTAACCTAACCAAACCACACCACCCCACCACACCAGACCAGGCCAGGTCAGGTGAGGTGGGGTGGCGTGGCGTGGCGCGGCGCAGCGCAGCGCAGAGCAGAGCAGAGCAGAGCAGAGCAAAGCAAGGCAAGCCAAGCTAAGCTTAGCTTA",
"label": "promoter"
},
{
"seq": "ACAAAACAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAAAAAAAAAAAAAGAAAAGCAAAGCGAAGCGGAGCGGGGCGGGACGGGAAGGGAAGGGAAGCGAAGCAAAGCAGAGCAGCGCAGCCCAGCCTAGCCTTGCCTTGCCTTGGCTTGGGTTGGGTTGGGTTGGGTTAGGTTATGTTATCTTATCTTATCTCATCTCCTCTCCACTCCAGTCCAGCCCAGCACAGCACAGCACAGCACACCACACCACACCCCACCCAACCCACCCCACCCCACCACACCAGACCAGACCAGAGCAGAGGAGAGGGGAGGGCAGGGCAGGGCAGGGCAGCGCAGCACAGCAGAGCAGAGCAGACCAGACAAGACACGACACTACACTGCACTGGACTGGCCTGGCTTGGCTAGGCTAAGCTAAACTAAAGTAAAGCAAAGCTAAGCTCAGCTCTGCTCTTCTCTTATCTTAGCTTAGTTTAGTCTAGTCAAGTCATGTCATATCATAACATAAGATAAGTTAAGTCAAGTCCAGTCCTGTCCTGTCCTGACCTGAGCTGAGTTGAGTGGAGTGCAGTGCT",
"label": "promoter"
},
{
"seq": "ACTGGACTGGAATGGAAAGGAAAAGAAAATAAAATTAAATTTAATTTTATTTTATTTTAATTTAAATTAAATTAAATGAAATGAAATGAAATGAATTGAATGGAATGAAATGATATGATGTGATGTGATGTGATGTGATGTGATGTGATTTGATTCGATTCTATTCTGTTCTGTTCTGTGCTGTGGTGTGGTGTGGTGTGGTGCGGTGCTGTGCTCTGCTCCGCTCCTCTCCTGTCCTGTCCTGTGCTGTGGTGTGGGGTGGGCTGGGCAGGGCAGGGCAGCGCAGCACAGCACAGCACTGCACTGCACTGGACTGGCCTGGCCTGGCCTGGCCTGGCCTGACCTGAACTGAAGTGAAGCGAAGCAAAGCACAGCACAGCACAACACAAAACAAACCAAACCAAACCTAACCTGACCTGGCCTGGACTGGAGTGGAGCGGAGCCGAGCCTAGCCTCGCCTCCCCTCCACTCCAGTCCAGCCCAGCACAGCAGAGCAGGGCAGGGCAGGGGAGGGGGGGGGGCGGGGCAGGGCAGGGCAGTGCAGTCCAGTCCAGTCCTGTCCTCTCCTCACCTCAG",
"label": "promoter"
},
{
"seq": "AGAGCTGAGCTGAGCTGTGCTGTCCTGTCTTGTCTGGTCTGCTCTGCTCTGCTGTGCTGGGCTGGGCTGGGGTGGGGGGGGGGCGGGGCAGGGCAGGGCAGGGCAGGGCAGGGCAGGGCGGGGCGCGGCGCTGCGCTGCGCTGTGCTGTTCTGTTCTGTTCTGTTCTGTTCTGGTCTGGGCTGGGCTGGGCAGGGCACGGCACTGCACTGCACTGTACTGTACTGTAGTGTAGGGTAGGATAGGATAGGATGGGATGTGATGTTATGTTATGTTAGGTTAGCTTAGCATAGCAGAGCAGCGCAGCGCAGCGAAGCGACGCGACCCGACCCGACCCTACCCTGCCCTGGCCTGGCCTGGCCTGGCCTGGCCTCGCCTCTCCTCTACTCTACTCTACCCTACCATACCACACCACTCCACTACACTAGACTAGACTAGATTAGATGAGATGCGATGCCATGCCATGCCAGGCCAGTCCAGTACAGTAGAGTAGCGTAGCATAGCACAGCACCGCACCCCACCCTACCCTCCCCTCCCCTCCTCTCCTCTCCTCTCCTCTCCTCTCCTCTCCACTCCAG",
"label": "promoter"
},
{
"seq": "GCTTTGCTTTGTTTTGTTTTGTTATGTTACGTTACATTACAGTACAGGACAGGTCAGGTGAGGTGTGGTGTCGTGTCTTGTCTGGTCTGTTCTGTTCTGTTATGTTAAGTTAACTTAACATAACATAACATTACATTCCATTCCATTCCATTCCATTCCATGCCATGGCATGGAATGGACTGGACCGGACCAGACCAAACCAAACCAAAACAAAACAAAACAAAACAAAACAAGACAAGGCAAGGCAAGGCCAGGCCAGGCCAAGCCAAACCAAACCAAACCAAACCCAACCCAACCCAACCCAAACCAAAACAAAATAAAATCAAATCAAATCAAATCAAGTCAAGGCAAGGGAAGGGAAGGGACGGGACAGGACAGGACAGGACAGGACAGGAAAGGAAGGGAAGTGAAGTAAAGTAGAGTAGAGTAGACTAGACTAGACTCGACTCCACTCCACTCCACTCCACCCCACCCCACCCAACCCATCCCATGCCATGCCATGCAATGCAGTGCAGGGCAGGTCAGGTGAGGTGGGGTGGAGTGGAATGGAAGGGAAGGGAAGGGAAGGGGAGGGGA",
"label": "non-promoter"
},
{
"seq": "GGTGATGTGATGTGATGCGATGCTATGCTATGCTACGCTACACTACAGTACAGGACAGGGCAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCCGGCCCTGCCCTCCCCTCACCTCATCTCATATCATAGCATAGGATAGGATAGGACAGGACAGGACAGGACAGGACAGGTCAGGTGAGGTGCGGTGCTGTGCTCTGCTCAGCTCACCTCACCTCACCACACCAGACCAGGCCAGGTCAGGTGAGGTGCGGTGCGGTGCGGTGCGGAGCGGAGCGGAGGGGAGGAGAGGACAGGACAGGACAAGACAACACAACCCAACCCAACCCGACCCGTCCCGTCCCGTCCCGTCCGGTCCGGTCCGGGCCGGGCCGGGCTGGGCTGGGCTGGGCTGGACTGGAGTGGAGCGGAGCAGAGCAGAGCAGGGCAGGTCAGGTCAGGTCAGGTCAAGTCAAGTCAAGACAAGAGAAGAGGAGAGGCGAGGCTAGGCTCGGCTCTGCTCTGCTCTGGTCTGGGCTGGGATGGGAGGGGAGAGGAGACGAGACAAGACACGACACTACACTTCACTTCACTTCC",
"label": "non-promoter"
},
{
"seq": "GGGGCAGGGCAGGGCAGGGCAGGACAGGAAAGGAAAGGAAAGGAAAGCAAAGCCAAGCCCAGCCCAGCCCATCCCATCCCATCTCATCTAATCTACTCTACACTACAATACAAGACAAGGCAAGGCAAGGCCAGGCCAGGCCAGGCCAGTCCAGTGCAGTGGAGTGGCGTGGCTTGGCTTGGCTTTGCTTTTCTTTTCTTTTCCTTTCCCTTCCCCTCCCCCCCCCCACCCCAACCCAACCCAACCCAACCCAACCCAACCCAGCCCAGTCCAGTCCAGTCCAGTCCTGTCCTTTCCTTCCCTTCCCTTCCCTTCCCATCCCAACCCAAACCAAATCAAATTAAATTCAATTCCATTCCCTTCCCATCCCACCCCACACCACAGCACAGCACAGCCCAGCCTAGCCTCGCCTCCCCTCCACTCCAGTCCAGACCAGATCAGATCAGATCCGATCCCATCCCTTCCCTGCCCTGCCCTGCACTGCAATGCAACGCAACCCAACCCAACCCTACCCTCCCCTCCCCTCCCCTCCCTTCCCTCCCCTCCCCTCCCCTCCCGTCCCGCCCCGCTCCGCTT",
"label": "non-promoter"
},
{
"seq": "ATACAGTACAGGACAGGTCAGGTTAGGTTTGGTTTCGTTTCCTTTCCGTTCCGTTCCGTGCCGTGGCGTGGGGTGGGATGGGAGGGGAGAGGAGAGGAGAGGAGAGGTGAGGTAAGGTAAGGTAACGTAACATAACACAACACAACACAACACAATACAATACAATAGAATAGCATAGCTTAGCTTAGCTTGGCTTGTCTTGTATTGTATTGTATCGTATCATATCAGATCAGTTCAGTCCAGTCAAGTCATGTCATTTCATTACATTACATTACCTTACCATACCACACCACTCCACTTCACTTGACTTGACTTGAGTTGAGTTGAGTGGAGTGTAGTGTGGTGTGATGTGAAGTGAAGTGAAGCGAAGCAAAGCAGAGCAGTGCAGTTCAGTTAAGTTAGGTTAGTTTAGTCTAGTCAAGTCAAGTCAAATCAAAGCAAAGTAAAGTCAAGTCTAGTCTGGTCTGGTCTGGGCTGGGATGGGAGGGGAGTGGAGTGGAGTGAAGTGAAGTGAATTGAATGGAATGAAATGAGATGAGATGAGAGGAGAGTAGAGTAGAGTAGAGTAGAGTAGAA",
"label": "non-promoter"
},
{
"seq": "AACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAAAAAAAAAAAAAAACAAAACAAAACACAACACAACACAGCACAGCACAGCACAGCAAAGCAAAGCAAACCAAACCAAACCTAACCTGACCTGTCCTGTACTGTATTGTATGGTATGTTATGTTATGTTGTGTTGTGTTGTCTTGTCCTGTCCCGTCCCTTCCCTTCCCTTCCCTTCCCTTCCATTCCAGTCCAGGCCAGGTCAGGTCAGGTCCGGTCCCGTCCCCTCCCCCCCCCCTCCCCTGCCCTGCCCTGCTCTGCTGTGCTGGGCTGGGCTGGGCTGGGCAGGGCATGGCATTGCATTTCATTTGATTTGCTTTGCATTGCAGTGCAGAGCAGAACAGAACAGAACCGAACCGAACCGCACCGCACCGCAGCGCAGCGCAGCACAGCATAGCATCGCATCCCATCCCATCCCATCCCAGCCCAGACCAGATCAGATCAGATCAGATCACATCACTTCACTCCACTCGACTCGTCTCGTTTCGTTACGTTAAGTTAAATTAAAATAAAAAAAAAAAAAAAATAAAATT",
"label": "promoter"
},
{
"seq": "TCCTGACCTGATCTGATATGATAAGATAAAATAAACTAAACCAAACCCAACCCAACCCATCCCATGCCATGGCATGGGATGGGATGGGATGGGATCGGATCTGATCTCATCTCATCTCATCTCATGTCATGACATGAGATGAGATGAGAAGAGAATAGAATTGAATTAAATTATATTATTTTATTCTATTCAATTCATTTCATTTCATTACATTATATTATCTTATCATATCATATCATGTCATGACATGAGATGAGATGAGAAGAGAATAGAATAGAATAGAATAGTATAGTATAGTATAGTATGGTATGGTATGGGATGGGATGGGAAGGGAAAGGAAAGGAAAGAAAAGACAAGACCAGACCAGACCAGACCAGTCCAGTCCAGTCCAGTCCCGTCCCCTCCCCACCCCATCCCATGCCATGACATGATATGATTTGATTCGATTCAATTCAATTCAATTCAATTCAATTAAATTACATTACCTTACCTTACCTCACCTCCCCTCCCCTCCCCTCCCCCCCCCCTCCCCTGCCCTGGCCTGGGCTGGGTTGGGTCGGGTCCGGTCCCGTCCCT",
"label": "non-promoter"
},
{
"seq": "GTCATCTCATCGCATCGTATCGTATCGTAGCGTAGTGTAGTATAGTACAGTACTGTACTATACTACACTACACTACATTACATTACATTTCATTTTATTTTATTTTAATTTAAATTAAACTAAACAAAACATAACATGACATGTCATGTAATGTAATGTAAAGTAAAGTAAAGAAAAGAGAAGAGCAGAGCTGAGCTCAGCTCAGCTCAGCTCAGTTCAGTGCAGTGGAGTGGTGTGGTGTGGTGCGGTGCTGTGCTCTGCTCCGCTCCACTCCAATCCAAGCCAAGACAAGAAAAGAAGAGAAGCGAAGCAAAGCAAAGCAAGGCAAGACAAGACAAGACTAGACTTGACTTTACTTTGCTTTGGTTTGGTTTGGTATGGTAGGGTAGAGTAGAGTAGAGAAGAGACGAGACGAGACGGGACGGCACGGCCCGGCCGGGCCGCGCCGCTCCGCTTCGCTTGGCTTGCCTTGCTTTGCTCTGCTCCGCTCCCCTCCCATCCCAACCCAAACCAAATCAAATAAAATATAATATCATATCATATCATATCATGTCATGCCATGCTATGCTGTGCTGA",
"label": "non-promoter"
},
{
"seq": "ACTGCGCTGCGCTGCGCGGCGCGCCGCGCCGCGCCGCGCCGAGCCGACCCGACGCGACGGGACGGTACGGTGCGGTGGGGTGGGGTGGGCTGGGCTGGGCTGGGCTGGGCTGGCCTGGCGTGGCGGGGCGGGGCGGGACGGGACGGGACCGGACCAGACCAGACCAGGCCAGGACAGGACAGGACAGGACAGGACAGGACAGGACAGGAAAGGAACGGAACAGAACAAAACAATACAATGCAATGGAATGGGATGGGATGGGATGGGATTGGATTCGATTCCATTCCGTTCCGATCCGAGCCGAGGCGAGGGGAGGGCAGGGCCGGGCCGGGCCGCGCCGCACCGCAACGCAAGGCAAGGCAAGGGAAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCTGCGCTCCGCTCCGCTCCTCTCCTTTCCTTCCCTTCTCTTCTGTTCTGCTCTGCGCTGCGGTGCGGGGCGGGTCGGGTTGGGTTGGGTTGGGTTGGGTTGGGGTGGGGTGGGGTGGGGTGCGGTGCGGTGCGATGCGAGGCGAGGCGAGGCGAGGCCAGGCCGGGCCGGGCCGGA",
"label": "promoter"
},
{
"seq": "TGTGCTGTGCTGTGCTGAGCTGATCTGATGTGATGCGATGCCATGCCTTGCCTGGCCTGTCCTGTGCTGTGGTGTGGTGTGGTTTGGTTTGGTTTGGTTTGGTTTGGTTTGGTGTGGTGGGGTGGGGTGGGGTGGGGCGGGGCTGGGCTAGGCTACGCTACACTACAATACAACACAACACAACAGAACAGGACAGGACAGGAAAGGAAAGGAAATGAAATTAAATTCAATTCCATTCCTTTCCTGTCCTGCCCTGCTCTGCTTTGCTTTGCTTTGCTTTGGTTTGGATTGGAATGGAAAGGAAAGGAAAGAAAAGACAAGACAAGACAGGACAGAACAGAACAGAAAAGAAAGGAAAGCAAAGCAAAGCAGAGCAGAGCAGATCAGATAAGATAGGATAGCATAGCCTAGCCAAGCCAAGCCAAACCAAATCAAATTAAATTCAATTCTATTCTCTTCTCTTCTCTCCTCTCTTCTCTACTCTACTCTACCCTACCATACCACACCACACCACATCACATTACATTTCATTTTATTTTGTTTTGGTTTGGATTGGAATGGAAAGGAAACGAAACT",
"label": "non-promoter"
},
{
"seq": "GGTTCCGTTCCCTTCCCGTCCCGCCCCGCTCCGCTTCGCTTCGCTTCCCTTCCATTCCACTCCACCCCACCGCACCGAACCGAGCCGAGGCGAGGGGAGGGCAGGGCCGGGCCGGGCCGAGCCGACCCGACTCGACTGGACTGCACTGCGCTGCGATGCGAGGCGAGGCGAGGTGAGGTGAGGTGCGGTGCAGTGCATTGCATGGCATGCCATGCTATGCTGTGCTGGGCTGGGCTGGGATGGGAGGGGAGTGGAGTCGAGTCGAGTCGTGTCGTATCGTAGCGTAGTGTAGTATAGTACAGTACCGTACCGTACCGCACCGCACCGCACCGCACCGCACCGCACCGGACCGGGCCGGGGCGGGGCGGGGCGGGGCGGGGCGGAGCGGAACGGAACGGAACAGAACAGAACAGCACAGCTCAGCTCAGCTCCGCTCCGCTCCGCTCCGCCCCGCCCCGCCCCGCCCCGCCCCGGCCCGGCCCGGCGCGGCGGGGCGGAGCGGATCGGATGGGATGGGATGGTATGGTGTGGTGTGGTGTTGTGTTTTGTTTCGTTTCCTTTCCATTCCAGTCCAGA",
"label": "non-promoter"
},
{
"seq": "GCCCGGCCCGGGCCGGGACGGGAGGGGAGCGGAGCGGAGCGTAGCGTCGCGTCGCGTCGCGTCGCATCGCAGCGCAGCGCAGCCCAGCCTAGCCTCGCCTCCCCTCCCCTCCCCTCCCCCCCCCCGCCCCGCCCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTCCCCTCCCCTCCCCTCCCCTCCCCGCCCCGCCCCGCCCCGCCTCGCCTCGCCTCGCCTCGGCTCGGGTCGGGGCGGGGAGGGGACGGGACTGGACTCGACTCGACTCGTCTCGTCTCGTCCCGTCCCGTCCCTTCCCTCCCCTCCCCTCCACTCCACTCCACACCACAGCACAGCACAGCCCAGCCCAGCCCCGCCCCTCCCCTCCCCTCCCCTCCCCTCCCTTCCCTCCCCTCCCCTCCCCTCCCGTCCCGTCCCGTCCCGTCGCGTCGGGTCGGATCGGAACGGAATGGAATTGAATTCAATTCGATTCGCTTCGCATCGCAGCGCAGCGCAGCCCAGCCTAGCCTCGCCTCCCCTCCGCTCCGCTCCGCCCCGCCGCGCCGTGCCGTTCCGTTCCGTTCTGTTCTT",
"label": "non-promoter"
},
{
"seq": "CTGGCTTGGCTGGGCTGCGCTGCTCTGCTCTGCTCCGCTCCTCTCCTTTCCTTACCTTACCTTACATTACAATACAAAACAAACCAAACCAAACCTAACCTGACCTGTCCTGTGCTGTGGTGTGGAGTGGAGTGGAGTGGAGTTGAGTTGAGTTGGGTTGGATTGGACTGGACTGGACTTGACTTGACTTGCCTTGCTTTGCTGTGCTGTGCTGTTCTGTTTTGTTTTGTTTTTTTTTTCTTTTCCTTTCCTTTCCTCTCCTCTCCTCTTCTCTTGTCTTGCCTTGCCTTGCCATGCCACGCCACTCCACTACACTAGACTAGACTAGAGTAGAGGAGAGGGGAGGGTAGGGTAGGGTAGGGTAGAGTAGAATAGAATAGAATAGAATATAATATGATATGATATGAAATGAAATGAAAAGAAAAGAAAAGAAAAGAAAAGAAGAGAAGAGAAGATAAGATTAGATTAGATTAGATTAGCTTAGCATAGCATAGCATGGCATGTCATGTTATGTTTTGTTTTGTTTTCTTTTCATTTCATTTCATCTCATCCCATCCTATCCTTTCCTTGCCTTGC",
"label": "promoter"
},
{
"seq": "CCCTGCCCTGCTCTGCTATGCTACGCTACACTACAGTACAGTACAGTTCAGTTTAGTTTTGTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTGTTTTGATTTGATTTGATATGATATGATATAATATACTATACAATACAGTACAGGACAGGTCAGGTTAGGTTTGGTTTTGTTTTGTTTTGCTTTGCCTTGCCATGCCAGGCCAGCCCAGCTCAGCTTAGCTTTGCTTTTCTTTTTTTTTTTTTTTTCTTTTCATTTCACTTCACTTCACTACACTAGACTAGACTAGATTAGATGAGATGGGATGGTATGGTGTGGTGCGGTGCTGTGCTATGCTAAGCTAATCTAATTTAATTCAATTCCATTCCCTTCCCTTCCCTTCCCTTTCCTTTACTTTAGTTTAGTTTAGTGTAGTGAAGTGACGTGACCTGACCTGACCTAACCTAGCCTAGGCTAGGCTAGGCTAGGCTGGGCTGTGCTGTTCTGTTTTGTTTTGTTTTCTTTTCATTTCATTTCATATCATACCATACGATACGATACGATACGATGCGATGTGATGTTATGTTTTGTTTA",
"label": "promoter"
}
],
"validation": [
{
"seq": "GTGGGGTGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGCGCCGCCCCGCCCCGCCCCGCCCCACCCCACCCCACTCCACTGCACTGCACTGCACTGCAGTGCAGGGCAGGTCAGGTGAGGTGGGGTGGGGTGGGCTGGGCCGGGCCTGGCCTGGCCTGTCCTGTACTGTAGTGTAGCGTAGCATAGCAGAGCAGCGCAGCTCAGCTGAGCTGCGCTGCACTGCACTGCACCGCACCTCACCTGACCTGACCTGAGCTGAGGTGAGGCGAGGCAAGGCAGGGCAGGGCAGGGCAGGGCAGGGCTGGGCTGGGCTGGGCTGGCCTGGCATGGCAGGGCAGCGCAGCCCAGCCCAGCCCCGCCCCTCCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGGTGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGCGCCGCCCCGCCCCGCCCCGCCCCACCCCACCCCACTCCACTGCACTGCACTGCACTGCAGTGCAGGGCAGGTCAGGTG",
"label": "non-promoter"
},
{
"seq": "GTGTGGTGTGGGGTGGGATGGGATGGGATCGGATCAGATCATATCATGTCATGTCATGTAATGTATTGTATCGTATCATATCAGATCAGTTCAGTGCAGTGCAGTGCAGTGCAGTGCAGCGCAGCCCAGCCTAGCCTTGCCTTGCCTTGACTTGACTTGACCTGACCTGACCTCACCTCCCCTCCTCTCCTGTCCTGGCCTGGGCTGGGCTGGGCTGGGCTCGGCTCAGCTCAACTCAAGTCAAGCCAAGCAAAGCATAGCATTGCATTCCATTCTATTCTTTTCTTCTCTTCCCTTCCCTTCCCATCCCACCCCACCCCACCTCACCTCACCTCACCTCAACTCAACTCAACCCAACCTAACCTCACCTCTCCTCTTCTCTTGTCTTGACTTGAGTTGAGTTGAGTAGAGTAGAGTAGCGTAGCTTAGCTGAGCTGAGCTGAACTGAAATGAAATGAAATTAAATTAAATTACATTACATTACAGTACAGGACAGGACAGGAAAGGAACGGAACAGAACATAACATGACATGCCATGCCATGCCATGCCACGCCACCCCACCACACCACACCACA",
"label": "non-promoter"
},
{
"seq": "CCCTGCCCTGCACTGCATTGCATGGCATGCCATGCCATGCCATGCCACGCCACACCACATCACATAACATAGCATAGCATAGCATAGCAAAGCAAGGCAAGGCAAGGTAAGGTGAGGTGCGGTGCTGTGCTGTGCTGGGCTGGGCTGGGTTGGGTCGGGTCAGGTCACGTCACTTCACTGCACTGAACTGATCTGATGTGATGCGATGCTATGCTATGCTAAGCTAACCTAACATAACATAACATCACATCTCATCTAATCTAATCTAAACTAAACTAAACAAAACAGAACAGGACAGGGCAGGGGAGGGGCGGGGCCGGGCCAGGCCAGGCCAGGCCAGGTCAGGTGAGGTGCGGTGCGGTGCGGTGCGGTGCGGTGCGGTGGGGTGGCGTGGCTTGGCTCGGCTCAGCTCACCTCACTTCACTCCACTCTACTCTTCTCTTGTCTTGACTTGAATTGAAATGAAATGAAATCAAATCCAATCCCATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGC",
"label": "non-promoter"
},
{
"seq": "TTGGAGTGGAGCGGAGCAGAGCAAAGCAAGGCAAGGCAAGGCAAGGCTAGGCTAGGCTATGCTATGCTATGCTATGCAATGCACTGCACCGCACCACACCATACCATACCATACCATACAATACATTACATGACATGCCATGCTATGCTCTGCTCTGCTCTGCTCTGATCTGAGCTGAGTTGAGTGGAGTGGAGTGGGGTGGGCTGGGCTGGGCTTGGCTTGGCTTGACTTGATTTGATTTGATTCGATTCCATTCCTTTCCTCTCCTCCCCTCCACTCCAGTCCAGGCCAGGGCAGGGAAGGGAAGGGAAGGGAAGAGAAGAGAAGAGGAGAGGCGAGGCCAGGCCAGGCCAGGCCAGGCCAGGACAGGAAAGGAAAGGAAAGGAAAGCAAAGCAAAGCATAGCATTGCATTGCATTGAATTGATTTGATGTGATGTGATGTGATGTGATGTGAAGTGAAATGAAAAGAAAACAAAACAAAACAGAACAGCACAGCCCAGCCTAGCCTTGCCTTTCCTTTCCTTTCCTTTCCCTTCCCTTCCCTTCCCTTGCCTTGCCTTGCCTTGCCATGCCAT",
"label": "non-promoter"
},
{
"seq": "AGCACAGCACAGCACAGGACAGGGCAGGGCAGGGCAGGGCACGGCACTGCACTGCACTGGACTGGTCTGGTGTGGTGGGGTGGAGTGGAGTGGAGGGGAGGGGAGGGAAGGGAGGGGAGCGGAGCCGAGCCCAGCCCTGCCCTGCCCTGCCCTGCGCTGCGGTGCGGGGCGGGGCGGGGCGGGGCAGGGCAGGGCAGTGCAGTCCAGTCCAGTCCTGTCCTCTCCTCACCTCAACTCAAGTCAAGGCAAGGCAAGGCCAGGCCTGGCCTCGCCTCCCCTCCGCTCCGGTCCGGACCGGATCGGATGGGATGGGATGGGATGGGTTGGGTGGGGTGTGGTGTGGTGTGATGTGAGGTGAGATGAGAGGAGAGGAGAGGCGAGGCAAGGCACGGCACCGCACCGCACCGGACCGGGCCGGGGCGGGGCGGGGCTGGGCTGGGCTGAGCTGAACTGAAGTGAAGCGAAGCAAAGCAGAGCAGCGCAGCACAGCATAGCATCGCATCTCATCTGATCTGGTCTGGGCTGGGTTGGGTTGGGTTTGGTTTGGTTTGATTTGAGTTGAGGTGAGGAGAGGAA",
"label": "non-promoter"
},
{
"seq": "AGGCCAGGCCAGGCCAGCCCAGCTCAGCTGAGCTGGGCTGGGCTGGGGTGGGGTGGGGTCGGGTCAGGTCAAGTCAAGTCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGGAAGGGGAGGGGGGGGGGCGGGGCTGGGCTGGGCTGCGCTGCCCTGCCCTGCCCAGCCCAGCCCAGCCCAGCACAGCACAGCACAGCACAGCACAGTACAGTGCAGTGGAGTGGTGTGGTTTGGTTCGGTTCTGTTCTGTTCTGCTCTGCTCTGCTCTGCTCCGCTCCACTCCAGTCCAGACCAGAGCAGAGGAGAGGTGAGGTGAGGTGCGGTGCAGTGCAGTGCAGTGCAGTCCAGTCAAGTCAGGTCAGATCAGACCAGACTAGACTGGACTGCACTGCCCTGCCTTGCCTGGCCTGGCCTGGGCTGGGTTGGGTTGGGTTGGGTTGGGTTGGCTTGGCTTGGCTCGGCTCAGCTCATCTCATGTCATGCCATGCCATGCCTTGCCTGGCCTGGCCTGGGCTGGGTTGGGTCGGGTCTGGTCTGGTCTGTTCTGTCCTGTCATGTCATGTCATT",
"label": "non-promoter"
},
{
"seq": "GTGCGATGCGAGGCGAGACGAGATGAGATGAGATGAGATGACATGACGTGACGCGACGCAACGCACCGCACTGCACTTCACTTCACTTCCCTTCCTTTCCTGTCCTGCCCTGCCCTGCCTTGCCTGGCCTGACCTGAGCTGAGGTGAGGCGAGGCGAGGCGGGGCGGCGCGGCCCGGCCGGGCCGCGCCGCTCCGCTGCGCTGTGCTGTTCTGTTCTGTTCTGTTCTCTTCTCGTCTCGCCTCGCGTCGCGGCGCGGCGCGGCTCGGCTTGGCTTCGCTTCCCTTCCGTTCCGGTCCGGCCCGGCACGGCAGGGCAGGGCAGGTCAGGTGAGGTGGGGTGGCGTGGCGTGGCGCGGCGCTGCGCTGCGCTGAGCTGAGCTGAGATGAGACGAGACCAGACCAGACCACACCACGCCACGGCACGGGACGGGACGGGAAGGGAAGGGAAGCGAAGCCAAGCCAAGCCAGGCCAGCCCAGCCCAGCCTAGCCTGGCCTGGCCTGGCCTGGCTTGGCTGGGCTGTGCTGTCCTGTCGTGTCGGGTCGGTTCGGTTCGGTTAGGTTAGGTTAGCTTAGCC",
"label": "promoter"
},
{
"seq": "GTTCTTTTCTTGTCTTGGCTTGGATTGGATTGGATCGGATCAGATCACATCACATCACACCACACTACACTCCACTCGACTCGACTCGAGTCGAGGCGAGGAGAGGAAAGGAAAGGAAAGGAAAGCAAAGCTAAGCTCAGCTCCGCTCCACTCCAGTCCAGCCCAGCTCAGCTGAGCTGGGCTGGGCTGGGCTGGGCCGGGCCCGGCCCAGCCCAGCCCAGACCAGATCAGATTAGATTTGATTTGATTTGGTTTGGGTTGGGGTGGGGCGGGGCTGGGCTTGGCTTCGCTTCTCTTCTGTTCTGTTCTGTCCTGTCCTGTCCTGTCCTGTCCTGACCTGAACTGAAATGAAAGGAAAGGAAAGGCAAGGCGAGGCGCGGCGCTGCGCTGCGCTGGGCTGGCCTGGCTTGGCTCGGCTCCGCTCCTCTCCTGTCCTGGCCTGGTCTGGTGTGGTGTGGTGTGGTGTGATGTGAAGTGAATTGAATGGAATGGAATGGGATGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCCGGCCCAGCCCAGCCCAGGCCAGGGCAGGGCAGGGCTGGGCTG",
"label": "non-promoter"
},
{
"seq": "GGCCAGGCCAGGCCAGGGCAGGGGAGGGGAGGGGACGGGACCGGACCAGACCAGACCAGGCCAGGCCAGGCTAGGCTGGGCTGGGCTGGGCTGGGATGGGAGGGGAGAGGAGAGGAGAGCAGAGCTGAGCTGAGCTGCGCTGCCCTGCCATGCCAAGCCAACCCAACCCAACCGAACCGCACCGCACCGCACCGCACCGCACCTCACCTGACCTGTCCTGTGCTGTGATGTGAAGTGAAGTGAAGGGAAGGAAAGGAAAGGAATGGAATGGAATGGAATGGTATGGTCTGGTCAGGTCAGGTCAGGTCAGGACAGGAAAGGAACGGAACCGAACCCAACCCTACCCTCCCCTCCCCTCCCCTCCCATCCCACCCCACCCCACCCCACCCTACCCTGCCCTGGCCTGGGCTGGGATGGGATGGGATGGGATGCGATGCAATGCATTGCATTGCATTCCATTCCATTCCTTTCCTGTCCTGGCCTGGCCTGGCTTGGCTTGGCTTTGCTTTTCTTTTATTTTACTTTACCTTACCATACCAGACCAGTCCAGTTCAGTTAAGTTATGTTATTTTATTC",
"label": "non-promoter"
},
{
"seq": "ATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGAGCCGAGCCGAGGCGAGGTGAGGTGAGGTGGGGTGGGGTGGGTTGGGTGGGGTGGGGTGGAGTGGATTGGATCGGATCAGATCATATCATCTCATCTCATCTGATCTGATCTGAGCTGAGGTGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGTGGAGTTGAGTTTAGTTTGGTTTGATTTGAATTGAAATGAAACGAAACCAAACCAAACCAGACCAGCCCAGCCCAGCCTAGCCTGGCCTGGCCTGGCCTGGCCTGGCCAGGCCAAGCCAACCCAACACAACATAACATGACATGGCATGGCATGGCATGGCAAGGCAAAGCAAAACAAAACAAAACCAAACCCAACCCCACCCCGCCCCGTCCCGTCCCGTCTCGTCTCGTCTCTTCTCTACTCTACTCTACTCTACTATACTAAACTAAACTAAAATAAAAAAAAAATAAAATAAAATAC",
"label": "non-promoter"
},
{
"seq": "CCGCCACGCCAGGCCAGGCCAGGCCAGGCTAGGCTCGGCTCCGCTCCTCTCCTCTCCTCTCCTCTGCTCTGCTCTGCACTGCAGTGCAGCGCAGCGCAGCGCAGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGCCGCGCAGCGCAGCGCAGCGCAGCGCAGCGCAGCGCCGCGCCCCGCCCCGCCCCCCCCCCTCCCCTCCCCTCGCCTCGCCTCGCGTCGCGGCGCGGTGCGGTACGGTAGGGTAGGGTAGGCTAGGCGAGGCGCGGCGCGGCGCGGCGCGGAGCGGAGCGGAGGGGAGGAGAGGAAAGGAAGGGAAGCGAAGCGAAGCGGAGCGGCGCGGCCCGGCCAGGCCACGCCACACCACAGCACAGGACAGGGCAGGGCAGGGCTGGGCTGGGCTGCGCTGCCCTGCCGTGCCGCGCCGCCCCGCCCCGCCCGGCCCGCCCCGCACCGCAGCGCAGAGCAGAACAGAATAGAATCGAATCGAATCGCATCGCATCGCAGCGCAGCGCAGCTCAGCTGAGCTGCGCTGCACTGCACTGCACGGCACGACACGATACGATC",
"label": "promoter"
},
{
"seq": "CGCAGTGCAGTGCAGTGGAGTGGTGTGGTCTGGTCTGGTCTTGTCTTGTCTTGGCTTGGCTTGGCATGGCAGGGCAGCGCAGCTCAGCTGAGCTGCGCTGCCCTGCCATGCCAGGCCAGGCCAGGACAGGAGAGGAGTGGAGTAGAGTAGAGTAGGGTAGGTTAGGTAAGGTAGGGTAGTGTAGTGTAGTGCAGTGCCGTGCCTTGCCTCGCCTCCCCTCCGCTCCGGTCCGGACCGGACCGGACCGGACCCGACCCTACCCTCCCCTCGCCTCGCCTCGCTTCGCTACGCTAGGCTAGGCTAGGTTAGGTGAGGTGGGGTGGCGTGGCATGGCACGGCACTGCACTACACTATACTATCCTATCATATCACATCACATCACAACACAAAACAAAGCAAAGGAAAGGAAAGGACAGGACAGGACAAGACAAAACAAAGCAAAGCAAAGCTAAGCTGAGCTGGGCTGGTCTGGTTTGGTTGGGTTGAGTTGAATTGAAGTGAAGCGAAGCTAAGCTAAGCTACGCTACACTACAGTACAGAACAGAACAGAAAAGAAATGAAATCAAATCCAATCCC",
"label": "promoter"
},
{
"seq": "TACCGGACCGGACCGGAGCGGAGAGGAGACGAGACCAGACCGGACCGCACCGCACCGCACCGCACTGCACTGCACTGAACTGAACTGAAGTGAAGAGAAGACAAGACTAGACTGGACTGTACTGTTCTGTTTTGTTTTGTTTTATTTTAGTTTAGATTAGAGTAGAGTAGAGTTGAGTTGAGTTGAGTTGACTTGACTTGACTGGACTGAACTGACCTGACATGACAGGACAGTACAGTGCAGTGGAGTGGCGTGGCATGGCAGGGCAGCGCAGCGCAGCGAAGCGATGCGATTCGATTCGATTCTATTCTCTTCTCCTCTCCTCTCCTGTCCTGTCCTGTCCTGTCTTGTCTCGTCTCCTCTCCACTCCAGTCCAGCCCAGCCCAGCCCAGCCCTGCCCTCCCCTCACCTCAGCTCAGCTCAGCACAGCAGAGCAGTGCAGTGCAGTGTAGTGTCGTGTCCTGTCCCGTCCCTTCCCTTCCCTTTCCTTTGCTTTGGTTTGGGTTGGGCTGGGCAGGGCACGGCACCGCACCCCACCCAACCCAGCCCAGCCCAGCCCAGCCCAGCCCCGCCCCA",
"label": "non-promoter"
},
{
"seq": "CAGAATAGAATCGAATCGAATCGCATCGCATCGCAACGCAAGGCAAGACAAGAAAAGAATAGAATCGAATCAAATCATATCATGTCATGCCATGCAATGCAGTGCAGAGCAGAGCAGAGCAGAGCGGAGCGAAGCGACGCGACCCGACCTGACCTGACCTGACCTGATCTGATTTGATTTGATTTAATTTACTTTACGTTACGCTACGCTACGCTTCGCTTCGCTTCACTTCACTTCACCTCACCTCACCTAACCTAGCCTAGACTAGATTAGATTAGATTGGATTGAATTGACTTGACTTGACTTGACTTTACTTTTCTTTTTTTTTTATTTTATTTTATTTTATTCTATTCTATTCTGTTCTGCTCTGCACTGCATTGCATCGCATCGCATCGTATCGTTTCGTTGCGTTGTGTTGTGTTGTGTTGTGTTGTGTTCTGTTCTGTTCTTTTCTTCTCTTCCCTTCCCTTCCCCTCCCCCCCCCCACCCCACCCCACTCCACTTCACTTCACTTCCCTTCCTTTCCTCTCCTCTCCTCTTCTCTTCTCTTCTCTTCTTTTCTTGTCTTGCCTTGCT",
"label": "non-promoter"
}
],
"epochs": 1
}
})
headers = {
'Content-Type': 'application/json',
'Authorization': 'Token {}'.format(os.environ['BIOLMAI_TOKEN'])
}
response = requests.request("POST", url, headers=headers, data=payload)
print(response.text)
library(RCurl)
headers = c(
"Content-Type" = "application/json",
'Authorization' = paste('Token', Sys.getenv('BIOLMAI_TOKEN'))
)
params = "{
\"pipeline\": \"finetune_DNABERT_classifier\",
\"hyperopt\": false,
\"input_json\": {
\"max_train\": 40000,
\"max_validate\": 20000,
\"train\": [
{
\"seq\": \"CACAGCACAGCCCAGCCAAGCCAGGCCAGCCCAGCCCAGCCAAGCCACGCCACTCCACTACACTAGACTAGGCTAGGCTAGGCCAGGCCCGGCCCTGCCCTGCCCTGTCCTGTCCTGTCCTGTCCTGTCCTGTCCTGCCCTGCACTGCAGTGCAGCGCAGCCCAGCCCAGCCCCGCCCCCCCCCCTCCCCTGCCCTGTCCTGTACTGTAGTGTAGGGTAGGGTAGGGGAGGGGTGGGGTCGGGTCTGGTCTGGTCTGGTCTGGACTGGAATGGAACGGAACAGAACAGAACAGCACAGCCCAGCCAAGCCAGGCCAGGCCAGGACAGGAGAGGAGTGGAGTGGAGTGGAGTGGTGTGGTTTGGTTTGGTTTAGTTTAATTTAAGTTAAGATAAGAGAAGAGGAGAGGCGAGGCAAGGCAGGGCAGGGCAGGGCAGGGGAGGGGAGGGGAGGGGAGTGGAGTCGAGTCGAGTCGCGTCGCCTCGCCTCGCCTTGCCTTGCCTTGCCTTGCCTTGCCCTGCCCTGCCCTGCCCTGTCCTGTGCTGTGCTGTGCCGTGCCATGCCACGCCACACCACAC\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"CTAATCTAATCTAATCTAATCTAGTCTAGTCTAGTATAGTAAAGTAATGTAATGTAATGCAATGCCATGCCGTGCCGCGCCGCGCCGCGTCGCGTTGCGTTGCGTTGGGTTGGTTTGGTGTGGTGGGGTGGAGTGGAATGGAAAGGAAAGGAAAGAAAAGACAAGACAAGACATGACATGACATGACATGACATGACATGACATGACATAACATACCATACCATACCTTACCTCACCTCACCTCAACTCAAATCAAACCAAACAAAACAGAACAGCACAGCACAGCAGAGCAGGGCAGGGCAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCCGCGCCACGCCATGCCATGCCATGCCATGCGATGCGCTGCGCCGCGCCACGCCAAGCCAAGCCAAGCCAAGCCAAGCCCAGCCCGGCCCGCCCCGCACCGCAGCGCAGAGCAGAGCAGAGGAGAGGGGAGGGTAGGGTTGGGTTGGGTTGTGTTGTCTTGTCCTGTCCAGTCCAATCCAACCCAACTCAACTCAACTCCACTCCTCTCCTATCCTATCCTATTCTATTCTATTCCATTCCT\",
\"label\": \"promoter\"
},
{
\"seq\": \"GGAAGAGAAGAGAAGAGGAGAGGGGAGGGAAGGGAAGGGAAGGGAAGGGAAGGAAAGGAAAGGAAAGGAAATGAAATGAAATGCAATGCCATGCCCTGCCCCGCCCCGCCCCGGCCCGGGCCGGGTCGGGTCGGGTCCGGTCCCGTCCCATCCCAGCCCAGGCCAGGCCAGGCGAGGCGGGGCGGGGCGGGGCGGGGCGGGGCCGGGCCTGGCCTCGCCTCGCCTCGACTCGAGTCGAGCCGAGCGGAGCGTAGCGTGGCGTGCCGTGCCGTGCCCTGCCCAGCCCACCCCACGCCACGCCACGCCACGCCGCGCCGCGCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTGCCCTGCCCTGCTCTGCTGTGCTGGGCTGGCCTGGCCTGGCCAGGCCACGCCACGCCACGCCACGCCACGCCTCGCCTGGCCTGGCCTGGACTGGAGTGGAGTGGAGTTGAGTTGAGTTGCGTTGCATTGCAGTGCAGGGCAGGACAGGAAAGGAACGGAACCGAACCGAACCGGACCGGGCCGGGCCGGGCGGGGCGCGGCGCCGCGCCGCGCCGGGCCGGG\",
\"label\": \"promoter\"
},
{
\"seq\": \"CGAAAGGAAAGCAAAGCAAAGCAAAGCAATGCAATCCAATCAAATCAGATCAGTTCAGTGCAGTGGAGTGGCGTGGCCTGGCCTGGCCTGGCCTGGCCTGGACTGGACTGGACCGGACCAGACCATACCATGCCATGTCATGTGATGTGTTGTGTAGTGTAGTGTAGTGTAGTATAGTATAGTATAGTATAGTATAGAATAGAGTAGAGAAGAGAGGAGAGCAGAGCAGAGCAAAGCAACGCAACACAACAGAACAGCACAGCGCAGCGCAGCGCCGCGCCACGCCATGCCATCCCATCTCATCTAATCTATTCTATGCTATGCTATGCTATGCTTTGCTTAGCTTAACTTAATTTAATTTAATTTAATTTGATTTGGTTTGGCTTGGCATGGCAAGGCAACGCAACACAACATAACATTACATTACATTACATTACATTACATTACATGACATGTCATGTAATGTAGTGTAGTGTAGTCTAGTCCAGTCCCGTCCCGTCCCGGCCCGGACCGGAACGGAAAGGAAAAGAAAATAAAATCAAATCTAATCTTATCTTTTCTTTTCTTTTATTTTAA\",
\"label\": \"promoter\"
},
{
\"seq\": \"TGACTCGACTCCACTCCCCTCCCATCCCAACCCAAACCAAACCAAACCAAACCAAACCAAACCAACCCAACACAACAAAACAAAACAAAACAAAAGAAAAGGAAAGGGAAGGGGAGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGTGGAGTTGAGTTCAGTTCAGTTCATTTCATCTCATCACATCACATCACCTCACCACACCACACCACTCCACTACACTAGACTAGACTAGACTAGACTAGACTTGACTTTACTTTCCTTTCCTTTCCTTTCCTTTCCTTACCTTATCTTATATTATAATATAAAATAAAATAAAAAAAAAAAAAAAACAAAACAAAACACAACACTACACTACACTAGACTAGACTAGAGTAGAGGAGAGGGGAGGGAAGGGAGGGGAGTGGAGTGGAGTGCAGTGCTGTGCTTTGCTTAGCTTAACTTAAGTTAAGCTAAGCAAAGCAGAGCAGAGCAGAACAGAAAAGAAAGGAAAGAAAAGAAAAGAAAAGAAAAGAAAAAAAAAAAAAAAAAAAAAATAAAATAAAATACAATACTATACTATACTAA\",
\"label\": \"promoter\"
},
{
\"seq\": \"AAGCATAGCATGGCATGACATGAAATGAAATGAAATGAAATGAAATGAAATGAAATGAAATGAAAGGAAAGAAAAGACAAGACTAGACTGGACTGGACTGGGCTGGGCTGGGCTGGGCTAGGCTAGGCTAGGCTAGGCTAGGCAAGGCACGGCACAGCACAGCACAGTACAGTGCAGTGGAGTGGCGTGGCTTGGCTCGGCTCAGCTCACCTCACATCACACCACACCACACCTCACCTGACCTGTCCTGTACTGTAATGTAATGTAATCTAATCCAATCCCATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCAGGCCAAGCCAAGCCAAGACAAGACAAGACAAGACAGGACAGGACAGGCCAGGCAAGGCAGGGCAGAGCAGATCAGATGAGATGAGATGACATGACCTGACCTGACCTGACCTGACCTGAGCTGAGGTGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGTGGAGTTGAGTTTAGTTTG\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"GGCTTTGCTTTGCTTTGTTTTGTTTTGTTTTGTTTTGTTTTCTTTTCTTTTCTGTTCTGTTCTGTGCTGTGATGTGAGGTGAGTTGAGTTGAGTTAAGTTACGTTACGTTACGGTACGGGACGGGGCGGGGCGGGGCTGGGCTGGGCTGCGCTGCCCTGCCATGCCACGCCACCCCACCTCACCTGACCTGCCCTGCACTGCAGTGCAGGGCAGGTCAGGTAAGGTAAGGTAAAGTAAAATAAAATAAAATCAAATCTAATCTGATCTGGTCTGGACTGGACTGGACAGGACATGACATTACATTGCATTGCATTGCCTTGCCCTGCCCTGCCCTGCCCTGACCTGAACTGAAATGAAATGAAATTAAATTGAATTGAATTGACTTGACCTGACCGGACCGAACCGAACCGAACCGAACCGAACCTAACCTTACCTTGCCTTGGCTTGGATTGGATTGGATAGGATACGATACAATACAATACAAAACAAACCAAACCAAACCCAACCCGACCCGGCCCGGCCCGGCCCGGCCTGGCCTGGCCTGACCTGACCTGACATGACAGGACAGTACAGTG\",
\"label\": \"promoter\"
},
{
\"seq\": \"TCACCGCACCGTACCGTTCCGTTACGTTACGTTACTTTACTGTACTGCACTGCCCTGCCTTGCCTCGCCTCCCCTCCTCTCCTATCCTAGCCTAGTCTAGTGTAGTGGAGTGGCGTGGCGTGGCGGGGCGGAGCGGATCGGATAGGATACGATACGATACGGTACGGCACGGCGCGGCGGGGCGGCGCGGCACGGCAAGGCAATGCAATACAATAGAATAGTATAGTGTAGTGGAGTGGCGTGGCGTGGCGCGGCGCAGCGCACCGCACAGCACATCACATTACATTCCATTCAATTCAATTCAAGTCAAGGCAAGGCAAGGCAAGGCAGGGCAGGGCAGGACAGGAAAGGAAGGGAAGCGAAGCAAAGCAAAGCAAGGCAAGACAAGAGAAGAGGAGAGGAGAGGAAAGGAACGGAACAGAACAGAACAGAACAGAGCAGAGCAGAGCCGAGCCAAGCCACGCCACCCCACCACACCAGACCAGCCCAGCACAGCAGAGCAGGGCAGGTCAGGTTAGGTTTGGTTTGGTTTGGTTTGGCTTGGCCTGGCCCGGCCCAGCCCAGCCCAGTCCAGTG\",
\"label\": \"promoter\"
},
{
\"seq\": \"AGAAAAGAAAACAAAACAAAACAAAACAAAACAAAACAAAAGAAAAGCAAAGCTAAGCTCAGCTCCGCTCCGCTCCGGTCCGGACCGGAGCGGAGTGGAGTAGAGTAGAGTAGGGTAGGATAGGAAAGGAAAGGAAAGGAAAGTAAAGTGAAGTGAAGTGACGTGACATGACACGACACAACACAGCACAGCACAGCGCAGCGCAGCGCCGCGCCACGCCACGCCACCCCACCTCACCTCACCTCCCCTCCCCTCCCGTCCCGGCCCGGTCCGGTACGGTAGGGTAGCGTAGCCTAGCCTAGCCTGGCCTGGCCTGGCCTGGCCTGGCCGGGCCGGGCCGGCCCGGCCCGGCCAGGCCAAGCCAAGCCAAGGCAAGGCAAGGCCAGGCCTGGCCTCGCCTCTCCTCTGCTCTGGTCTGGCCTGGCTTGGCTTGGCTTAGCTTAACTTAAGTTAAGCTAAGCGAAGCGGAGCGGGGCGGGCCGGGCCGGGCCTGGCCTCGCCTCTCCTCTGCTCTGGTCTGGCCTGGCCTGGCCTGGCCTGGCCTGCCCTGCCCTGCCATGCCAAGCCAAACCAAAA\",
\"label\": \"promoter\"
},
{
\"seq\": \"AAGTAGAGTAGAGTAGAGTAGAGGAGAGGCGAGGCCAGGCCTGGCCTCGCCTCCCCTCCTCTCCTGTCCTGCCCTGCTCTGCTTTGCTTCGCTTCACTTCAGTTCAGGTCAGGGCAGGGAAGGGAAGGGAAGGGAAGTGAAGTAAAGTAGAGTAGAGTAGAGTAGAGCAGAGCCGAGCCGAGCCGGGCCGGTCCGGTGCGGTGTGGTGTCGTGTCTTGTCTCGTCTCGTCTCGCCTCGCATCGCACCGCACCGCACCACACCAGACCAGACCAGAGCAGAGCAGAGCCGAGCCCAGCCCCGCCCCACCCCAGCCCAGACCAGATCAGATGAGATGGGATGGAATGGAATGGAACGGAACTGAACTCAACTCTACTCTGCTCTGTTCTGTCCTGTCCTGTCCCGTCCCATCCCATCCCATTCCATTCCATTCAATTCACTTCACATCACATCACATTACATTACATTAAATTAATTTAATTTAATTGAATTGAATTGAATTGAATTGAATCGAATCCAATCCAATCCAGTCCAGTCCAGTACAGTACAGTACTGTACTTTACTTTACTTTGCTTTGA\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"GGTGCAGTGCAATGCAAGGCAAGGCAAGGAAAGGAAAGGAATGGAATGGAATGAAATGAAATGAAGTGAAGCGAAGCCAAGCCAAGCCAAGCCAATCCAATTCAATTTAATTTCATTTCTTTTCTCTTCTCATCTCAACTCAATTCAATCCAATCAAATCATATCATCTCATCGCATCGAATCGAGTCGAGGCGAGGCGAGGCTAGGCTAGGCTACGCTACCCTACCCTACCCTACCCTGCCCTGCCCTGCCCTGCCATGCCATGCCATCCCATCTCATCTTATCTTGTCTTGTCTTGTGTTGTGGTGTGGCGTGGCCTGGCCAGGCCATGCCATGCCATGTCATGTGATGTGATGTGAGGTGAGGTGAGGGGAGGGAAGGGATGGGATGGGATGCGATGCAATGCACTGCACAGCACACCACACGACACGTCACGTGACGTGTCGTGTAGTGTAGTGTAGAGTAGATTAGATCAGATCAGATCAAATCAATTCAATTCAATTTAATTTCATTTCTTTTCTCTTCTCATCTCAGCTCAGCTCAGCACAGCATAGCATCGCATCACATCACATCACA\",
\"label\": \"promoter\"
},
{
\"seq\": \"AGAGACGAGACTAGACTGGACTGGACTGGCCTGGCATGGCAAGGCAAGGCAAGGCAAGGAAAGGACAGGACAGGACAGGACAGGACAGGCCAGGCTAGGCTCGGCTCGGCTCGCCTCGCCTCGCCCCGCCCTGCCCTTCCCTTCCCTTCTCTTCTGTTCTGTTCTGTACTGTAGTGTAGAGTAGAGTAGAGCAGAGCCGAGCCTAGCCTCGCCTCGCCTCGCCTCGCATCGCATCGCATTGCATTGCATTGGATTGGCTTGGCCTGGCCAGGCCACGCCACCCCACCACACCAGACCAGGCCAGGACAGGAGAGGAGGGGAGGCGAGGCAAGGCAGGGCAGTGCAGTGCAGTGTAGTGTTGTGTTGTGTTGTGTTGTCTTGTCTTGTCTGGTCTGCTCTGCCCTGCCTTGCCTCGCCTCTCCTCTCCTCTCGTCTCGACTCGAATCGAACCGAACTGAACTTAACTTGACTTGGCTTGGCTTGGCTTGGCTGGGCTGCGCTGCCCTGCCCTGCCCAGCCCAACCCAAGCCAAGGCAAGGTAAGGTGAGGTGAGGTGAGGTGAGATGAGAAGAGAAG\",
\"label\": \"promoter\"
},
{
\"seq\": \"TGGCGAGGCGACGCGACCCGACCCGACCCCACCCCACCCCAACCCAACCCAACCCAACCTAACCTGACCTGCCCTGCCCTGCCCTGCCCTGCCCTTCCCTTGCCTTGCCTTGCTTTGCTTTGCTTCGCTTCGCTTCGGTTCGGATCGGACCGGACAGGACACGACACTACACTGCACTGCACTGCACTGCAGTGCAGCGCAGCACAGCACAGCACCGCACCCCACCCAACCCAACCCAATCCAATGCAATGGAATGGCATGGCGTGGCGCGGCGCCGCGCCCCGCCCAGCCCAGCCCAGACCAGAACAGAACAGAACCGAACCCAACCCGACCCGCCCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTGCCCTGCCCTGCCCTGCCGTGCCGCGCCGCGCCGCGGCGCGGGGCGGGCCGGGCAGGGCAGGGCAGTGCAGTGCAGTGCAGTGCAGTGCAGTGCAGCGCAGCCCAGCCCAGCCCGGCCCGGCCCGGGCCGGGACGGGATGGGATAGGATAGGATAGCATAGCGTAGCGCAGCGCCGCGCCCCGCCCCGCCCCCCCCCCACCCCAA\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"CTGTGTTGTGTAGTGTATTGTATAGTATATTATATCATATCTTATCTGATCTGTTCTGTACTGTAATGTAAAGTAAAGTAAAGTAAAGTTAAGTTAAGTTATGTTATCTTATCTTATCTCATCTCCTCTCCACTCCAGTCCAGTCCAGTCCAGTCAAGTCAAGTCAACTCAACGCAACGCAACGCTACGCTACGCTAGGCTAGGCTAGGGTAGGGAAGGGATGGGATGGGATGCGATGCAATGCACTGCACAGCACACCACACTACACTCCACTCTACTCTGCTCTGCTCTGCACTGCAATGCAACGCAACACAACACAACACTACACTCCACTCTACTCTACTCTAGTCTAGGCTAGGTTAGGTGAGGTGGGGTGGCGTGGCCTGGCCTGGCCTTGCCTTCCCTTCTCTTCTGTTCTGTTCTGTACTGTATTGTATAGTATATTATATAATATATTATATGATATGGTATGGCATGGCATGGCAGGGCAGAGCAGAACAGAAAAGAAAAGAAAAAAAAAAGAAAAGAAAAGAAAAGAAAAGAAAGGAAAGTAAAGTAAAGTAAAGTAAAGTAAAT\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"CTATATTATATTATATTTTATTTGATTTGGTTTGGATTGGACTGGACAGGACAAGACAATACAATCCAATCGAATCGCATCGCCTCGCCGCGCCGTGCCGTGCCGTGACGTGATGTGATTTGATTAGATTAAATTAAATTAAACTAAACGAAACGAAACGAGACGAGTCGAGTGGAGTGTAGTGTAGTGTATTGTATGGTATGATATGAAATGAAATGAAAGGAAAGGAAAGGCAAGGCGAGGCGTGGCGTCGCGTCTCGTCTGGTCTGATCTGAACTGAAGTGAAGCGAAGCTAAGCTAAGCTAGGCTAGGCTAGGGTAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCTGCGCTACGCTAGGCTAGACTAGATTAGATAAGATAAGATAAAATAAACTAAACAAAACACAACACTACACTGCACTGAACTGATCTGATTTGATTTGATTTCATTTCCTTTCCCTTCCCCTCCCCTCCCCTTCCCTTTCCTTTACTTTAGTTTAGGTTAGGGTAGGGAAGGGAAGGGAAAGGAAAAGAAAAAAAAAAGAAAAGAAAAGAAAAGAATAGAATG\",
\"label\": \"promoter\"
},
{
\"seq\": \"AACTGCACTGCACTGCAGTGCAGGGCAGGACAGGATAGGATGGGATGCGATGCTATGCTCTGCTCTGCTCTTCTCTTGTCTTGGCTTGGATTGGAGTGGAGTGGAGTTGAGTTCAGTTCTGTTCTGTTCTGGTCTGGTCTGGTCTGGTCTGGTCTAGTCTACTCTACTCTACTCTACTCTACTCTGCTCTGCTCTGCGCTGCGATGCGATGCGATGCGATGCGATGCTATGCTTTGCTTGGCTTGTCTTGTTTTGTTTTGTTTGGTTTGCTTTGCATTGCAATGCAAAGCAAAACAAAACAAAACCAAACCCAACCCTACCCTGCCCTGTCCTGTCCTGTCATGTCATGTCATGTCATGACATGAGATGAGATGAGAAGAGAAGAGAAGGGAAGGTAAGGTCAGGTCCGGTCCAGTCCACTCCACTCCACTACACTAGACTAGACTAGATTAGATGAGATGGGATGGCATGGCATGGCAGGGCAGGGCAGGTCAGGTTAGGTTTGGTTTCGTTTCATTTCAGTTCAGCTCAGCTCAGCTGAGCTGTGCTGTGCTGTGCTGTGCTGTGCTATGCTAC\",
\"label\": \"promoter\"
},
{
\"seq\": \"GCTTTCCTTTCCTTTCCTTTCCTGTCCTGGCCTGGCCTGGCCTGGCCCGGCCCCGCCCCCCCCCCACCCCAACCCAAGCCAAGACAAGAGAAGAGTAGAGTGGAGTGCAGTGCAGTGCAGTGCAGGGCAGGGCAGGGAAGGGATGGGATGGGATGCGATGCCATGCCCTGCCCAGCCCAGCCCAGGCCAGGTCAGGTCAGGTCTGGTCTGGTCTGCTCTGCACTGCAATGCAACGCAACCCAACCAAACCACACCACCCCACCACACCACACCACTCCACTGCACTGGACTGGGCTGGGTTGGGTGGGGTGGGGTGGCGTGGCTTGGCTGGGCTGCGCTGCACTGCAGTGCAGCGCAGCTCAGCTGAGCTGTGCTGTGCTGTGCTGTGCCGTGCCCTGCCCAGCCCAGCCCAGGCCAGGACAGGAGAGGAGGGGAGGGGAGGGTAGGGTGGGGTGGGGTGGGGTGGGATGGGACGGGACTGGACTCGACTCCACTCCTCTCCTGTCCTGCCCTGCCCTGCCCTGCCCCGCCCCACCCCACCCCACCCCACCACACCAAACCAACCCAACTCAACTC\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"TGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCCGAGCCGAGCCGGGCCGGGCCGGGTCGGGTGGGGTGCGGTGCCGTGCCCTGCCCCGCCCCGCCCCGGCCCGGGCCGGGTCGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCCGAGCCGAGCCGGGCCGGGCCGGGTCGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTGGGGTGAGGTGAGGTGAGCTGAGCGGAGCGGAGCGGGGCGGGGCGGGGTGGGGTGGGGTGCGGTGCGGTGCGCTGCGCGGCGCGGCGCGGGGCGGGGCGGGGTGGGGTG\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"GGGTGAGGTGAGGTGAGGTGAGGGGAGGGAAGGGAAGGGAAGGGAAGTGAAGTGAAGTGCAGTGCAGTGCATTGCATCGCATCCCATCCCATCCCTTCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGTTGGGTGGGGTGAGGTGAGGTGAGATGAGAAGAGAAAAGAAACGAAACAAAACATAACATGACATGGCATGGGATGGGTTGGGTAGGGTAAGGTAAGGTAAGCTAAGCGAAGCGTAGCGTGGCGTGTCGTGTGGTGTGCTGTGCAGTGCAGTGCAGAGCAGACCAGACGAGACGTGACGTGACGTGGCGTGGAGTGGAGTGGAGAGGAGAGGAGAGGAGAGGGGAGGGCAGGGCGGGGCGTGGCGTGGCGTGGCGTGGGGTGGGGTGGGGTGGGGTGGGGTGAGGTGAGGTGAGGTGAGGGGAGGGAAGGGACGGGACGGGACGCGACGCCACGCCCCGCCCAGCCCACCCCACCCCACCCCACCCCACCCCTCCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGTTGGGTGGGGTGAGGTGAGGTGAGATGAGAAGAGAAA\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"AAGTTTAGTTTTGTTTTTTTTTTCTTTTCCTTTCCATTCCACTCCACCCCACCTCACCTGACCTGCCCTGCCCTGCCATGCCACGCCACTCCACTTCACTTCACTTCACTTCACTTCACATCACAACACAATACAATGCAATGAAATGACATGACCTGACCCGACCCTACCCTCCCCTCCCCTCCACTCCAGTCCAGCCCAGCGCAGCGCAGCGCCGCGCCCCGCCCTGCCCTCCCCTCTCCTCTACTCTACTCTACTCTACTGTACTGGACTGGCCTGGCATGGCAGGGCAGAGCAGAGCAGAGAAGAGACGAGACTAGACTAGACTAGACTAGCCTAGCATAGCATAGCATCGCATCACATCAAATCAAGTCAAGCCAAGCCAAGCCAAGCCAGGCCAGCCCAGCTCAGCTGAGCTGGGCTGGCCTGGCATGGCAAGGCAAAGCAAACCAAACCAAACCAAACCAGACCAGACCAGAGCAGAGGAGAGGCGAGGCGAGGCGTGGCGTCGCGTCCCGTCCTGTCCTTTCCTTTCCTTTACTTTAATTTAAGTTAAGGTAAGGTAAGGTCAGGTCC\",
\"label\": \"promoter\"
},
{
\"seq\": \"TTTTTTTTTTTTTTTTTGTTTTGCTTTGCGTTGCGGTGCGGGGCGGGGCGGGGCGGGGCGGGGCGCGGCGCAGCGCAGCGCAGTGCAGTGCAGTGGAGTGGCGTGGCTTGGCTCGGCTCAGCTCATCTCATGTCATGCCATGCCATGCCTTGCCTGGCCTGTCCTGTACTGTAGTGTAGTGTAGTCTAGTCCAGTCCCGTCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCAGGCCAAGCCAAGCCAAGACAAGACAAGACGAGACGGGACGGGACGGGCCGGGCAGGGCAGGGCAGAGCAGATCAGATCAGATCAGATCACATCACGTCACGACACGAGACGAGGCGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGAGGAGATGAGATCAGATCGGATCGAATCGAGTCGAGACGAGACGAGACTAGACTAGACTATACTATCCTATCCTATCCTATCCTGTCCTGGCCTGGCCTGGCTTGGCTAGGCTAA\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"CCGCCTCGCCTTGCCTTCCCTTCCCTTCCCTTCCCTTCCCTCCCCTCTCCTCTGCTCTGTTCTGTTCTGTTTTGTTTTGTTTTTTTTTTGTTTTGGTTTGGCTTGGCATGGCATGGCATAGCATAACATAAGATAAGATAAGAAAAGAAAAGAAACGAAACAAAACAAAACAATACAATTCAATTCAATTCAATTCAGTTCAGGTCAGGTCAGGTTAGGTTTGGTTTAGTTTATTTTATCTTATCATATCAAATCAAGTCAAGGCAAGGAAAGGAGAGGAGAGGAGAGGAGAGTAGAGTCGAGTCCAGTCCAGTCCAGTCCAGGCCAGGGCAGGGTAGGGTCGGGTCAGGTCAGGTCAGATCAGAACAGAATAGAATTGAATTTAATTTTATTTTTTTTTTCTTTTCTTTTCTATTCTAATCTAACCTAACCTAACCAAACCACACCACCCCACCACACCAGACCAGGCCAGGTCAGGTGAGGTGGGGTGGCGTGGCGTGGCGCGGCGCAGCGCAGCGCAGAGCAGAGCAGAGCAGAGCAGAGCAAAGCAAGGCAAGCCAAGCTAAGCTTAGCTTA\",
\"label\": \"promoter\"
},
{
\"seq\": \"ACAAAACAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAAAAAAAAAAAAAGAAAAGCAAAGCGAAGCGGAGCGGGGCGGGACGGGAAGGGAAGGGAAGCGAAGCAAAGCAGAGCAGCGCAGCCCAGCCTAGCCTTGCCTTGCCTTGGCTTGGGTTGGGTTGGGTTGGGTTAGGTTATGTTATCTTATCTTATCTCATCTCCTCTCCACTCCAGTCCAGCCCAGCACAGCACAGCACAGCACACCACACCACACCCCACCCAACCCACCCCACCCCACCACACCAGACCAGACCAGAGCAGAGGAGAGGGGAGGGCAGGGCAGGGCAGGGCAGCGCAGCACAGCAGAGCAGAGCAGACCAGACAAGACACGACACTACACTGCACTGGACTGGCCTGGCTTGGCTAGGCTAAGCTAAACTAAAGTAAAGCAAAGCTAAGCTCAGCTCTGCTCTTCTCTTATCTTAGCTTAGTTTAGTCTAGTCAAGTCATGTCATATCATAACATAAGATAAGTTAAGTCAAGTCCAGTCCTGTCCTGTCCTGACCTGAGCTGAGTTGAGTGGAGTGCAGTGCT\",
\"label\": \"promoter\"
},
{
\"seq\": \"ACTGGACTGGAATGGAAAGGAAAAGAAAATAAAATTAAATTTAATTTTATTTTATTTTAATTTAAATTAAATTAAATGAAATGAAATGAAATGAATTGAATGGAATGAAATGATATGATGTGATGTGATGTGATGTGATGTGATGTGATTTGATTCGATTCTATTCTGTTCTGTTCTGTGCTGTGGTGTGGTGTGGTGTGGTGCGGTGCTGTGCTCTGCTCCGCTCCTCTCCTGTCCTGTCCTGTGCTGTGGTGTGGGGTGGGCTGGGCAGGGCAGGGCAGCGCAGCACAGCACAGCACTGCACTGCACTGGACTGGCCTGGCCTGGCCTGGCCTGGCCTGACCTGAACTGAAGTGAAGCGAAGCAAAGCACAGCACAGCACAACACAAAACAAACCAAACCAAACCTAACCTGACCTGGCCTGGACTGGAGTGGAGCGGAGCCGAGCCTAGCCTCGCCTCCCCTCCACTCCAGTCCAGCCCAGCACAGCAGAGCAGGGCAGGGCAGGGGAGGGGGGGGGGCGGGGCAGGGCAGGGCAGTGCAGTCCAGTCCAGTCCTGTCCTCTCCTCACCTCAG\",
\"label\": \"promoter\"
},
{
\"seq\": \"AGAGCTGAGCTGAGCTGTGCTGTCCTGTCTTGTCTGGTCTGCTCTGCTCTGCTGTGCTGGGCTGGGCTGGGGTGGGGGGGGGGCGGGGCAGGGCAGGGCAGGGCAGGGCAGGGCAGGGCGGGGCGCGGCGCTGCGCTGCGCTGTGCTGTTCTGTTCTGTTCTGTTCTGTTCTGGTCTGGGCTGGGCTGGGCAGGGCACGGCACTGCACTGCACTGTACTGTACTGTAGTGTAGGGTAGGATAGGATAGGATGGGATGTGATGTTATGTTATGTTAGGTTAGCTTAGCATAGCAGAGCAGCGCAGCGCAGCGAAGCGACGCGACCCGACCCGACCCTACCCTGCCCTGGCCTGGCCTGGCCTGGCCTGGCCTCGCCTCTCCTCTACTCTACTCTACCCTACCATACCACACCACTCCACTACACTAGACTAGACTAGATTAGATGAGATGCGATGCCATGCCATGCCAGGCCAGTCCAGTACAGTAGAGTAGCGTAGCATAGCACAGCACCGCACCCCACCCTACCCTCCCCTCCCCTCCTCTCCTCTCCTCTCCTCTCCTCTCCTCTCCACTCCAG\",
\"label\": \"promoter\"
},
{
\"seq\": \"GCTTTGCTTTGTTTTGTTTTGTTATGTTACGTTACATTACAGTACAGGACAGGTCAGGTGAGGTGTGGTGTCGTGTCTTGTCTGGTCTGTTCTGTTCTGTTATGTTAAGTTAACTTAACATAACATAACATTACATTCCATTCCATTCCATTCCATTCCATGCCATGGCATGGAATGGACTGGACCGGACCAGACCAAACCAAACCAAAACAAAACAAAACAAAACAAAACAAGACAAGGCAAGGCAAGGCCAGGCCAGGCCAAGCCAAACCAAACCAAACCAAACCCAACCCAACCCAACCCAAACCAAAACAAAATAAAATCAAATCAAATCAAATCAAGTCAAGGCAAGGGAAGGGAAGGGACGGGACAGGACAGGACAGGACAGGACAGGAAAGGAAGGGAAGTGAAGTAAAGTAGAGTAGAGTAGACTAGACTAGACTCGACTCCACTCCACTCCACTCCACCCCACCCCACCCAACCCATCCCATGCCATGCCATGCAATGCAGTGCAGGGCAGGTCAGGTGAGGTGGGGTGGAGTGGAATGGAAGGGAAGGGAAGGGAAGGGGAGGGGA\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"GGTGATGTGATGTGATGCGATGCTATGCTATGCTACGCTACACTACAGTACAGGACAGGGCAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCCGGCCCTGCCCTCCCCTCACCTCATCTCATATCATAGCATAGGATAGGATAGGACAGGACAGGACAGGACAGGACAGGTCAGGTGAGGTGCGGTGCTGTGCTCTGCTCAGCTCACCTCACCTCACCACACCAGACCAGGCCAGGTCAGGTGAGGTGCGGTGCGGTGCGGTGCGGAGCGGAGCGGAGGGGAGGAGAGGACAGGACAGGACAAGACAACACAACCCAACCCAACCCGACCCGTCCCGTCCCGTCCCGTCCGGTCCGGTCCGGGCCGGGCCGGGCTGGGCTGGGCTGGGCTGGACTGGAGTGGAGCGGAGCAGAGCAGAGCAGGGCAGGTCAGGTCAGGTCAGGTCAAGTCAAGTCAAGACAAGAGAAGAGGAGAGGCGAGGCTAGGCTCGGCTCTGCTCTGCTCTGGTCTGGGCTGGGATGGGAGGGGAGAGGAGACGAGACAAGACACGACACTACACTTCACTTCACTTCC\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"GGGGCAGGGCAGGGCAGGGCAGGACAGGAAAGGAAAGGAAAGGAAAGCAAAGCCAAGCCCAGCCCAGCCCATCCCATCCCATCTCATCTAATCTACTCTACACTACAATACAAGACAAGGCAAGGCAAGGCCAGGCCAGGCCAGGCCAGTCCAGTGCAGTGGAGTGGCGTGGCTTGGCTTGGCTTTGCTTTTCTTTTCTTTTCCTTTCCCTTCCCCTCCCCCCCCCCACCCCAACCCAACCCAACCCAACCCAACCCAACCCAGCCCAGTCCAGTCCAGTCCAGTCCTGTCCTTTCCTTCCCTTCCCTTCCCTTCCCATCCCAACCCAAACCAAATCAAATTAAATTCAATTCCATTCCCTTCCCATCCCACCCCACACCACAGCACAGCACAGCCCAGCCTAGCCTCGCCTCCCCTCCACTCCAGTCCAGACCAGATCAGATCAGATCCGATCCCATCCCTTCCCTGCCCTGCCCTGCACTGCAATGCAACGCAACCCAACCCAACCCTACCCTCCCCTCCCCTCCCCTCCCTTCCCTCCCCTCCCCTCCCCTCCCGTCCCGCCCCGCTCCGCTT\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"ATACAGTACAGGACAGGTCAGGTTAGGTTTGGTTTCGTTTCCTTTCCGTTCCGTTCCGTGCCGTGGCGTGGGGTGGGATGGGAGGGGAGAGGAGAGGAGAGGAGAGGTGAGGTAAGGTAAGGTAACGTAACATAACACAACACAACACAACACAATACAATACAATAGAATAGCATAGCTTAGCTTAGCTTGGCTTGTCTTGTATTGTATTGTATCGTATCATATCAGATCAGTTCAGTCCAGTCAAGTCATGTCATTTCATTACATTACATTACCTTACCATACCACACCACTCCACTTCACTTGACTTGACTTGAGTTGAGTTGAGTGGAGTGTAGTGTGGTGTGATGTGAAGTGAAGTGAAGCGAAGCAAAGCAGAGCAGTGCAGTTCAGTTAAGTTAGGTTAGTTTAGTCTAGTCAAGTCAAGTCAAATCAAAGCAAAGTAAAGTCAAGTCTAGTCTGGTCTGGTCTGGGCTGGGATGGGAGGGGAGTGGAGTGGAGTGAAGTGAAGTGAATTGAATGGAATGAAATGAGATGAGATGAGAGGAGAGTAGAGTAGAGTAGAGTAGAGTAGAA\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"AACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAAAAAAAAAAAAAAACAAAACAAAACACAACACAACACAGCACAGCACAGCACAGCAAAGCAAAGCAAACCAAACCAAACCTAACCTGACCTGTCCTGTACTGTATTGTATGGTATGTTATGTTATGTTGTGTTGTGTTGTCTTGTCCTGTCCCGTCCCTTCCCTTCCCTTCCCTTCCCTTCCATTCCAGTCCAGGCCAGGTCAGGTCAGGTCCGGTCCCGTCCCCTCCCCCCCCCCTCCCCTGCCCTGCCCTGCTCTGCTGTGCTGGGCTGGGCTGGGCTGGGCAGGGCATGGCATTGCATTTCATTTGATTTGCTTTGCATTGCAGTGCAGAGCAGAACAGAACAGAACCGAACCGAACCGCACCGCACCGCAGCGCAGCGCAGCACAGCATAGCATCGCATCCCATCCCATCCCATCCCAGCCCAGACCAGATCAGATCAGATCAGATCACATCACTTCACTCCACTCGACTCGTCTCGTTTCGTTACGTTAAGTTAAATTAAAATAAAAAAAAAAAAAAAATAAAATT\",
\"label\": \"promoter\"
},
{
\"seq\": \"TCCTGACCTGATCTGATATGATAAGATAAAATAAACTAAACCAAACCCAACCCAACCCATCCCATGCCATGGCATGGGATGGGATGGGATGGGATCGGATCTGATCTCATCTCATCTCATCTCATGTCATGACATGAGATGAGATGAGAAGAGAATAGAATTGAATTAAATTATATTATTTTATTCTATTCAATTCATTTCATTTCATTACATTATATTATCTTATCATATCATATCATGTCATGACATGAGATGAGATGAGAAGAGAATAGAATAGAATAGAATAGTATAGTATAGTATAGTATGGTATGGTATGGGATGGGATGGGAAGGGAAAGGAAAGGAAAGAAAAGACAAGACCAGACCAGACCAGACCAGTCCAGTCCAGTCCAGTCCCGTCCCCTCCCCACCCCATCCCATGCCATGACATGATATGATTTGATTCGATTCAATTCAATTCAATTCAATTCAATTAAATTACATTACCTTACCTTACCTCACCTCCCCTCCCCTCCCCTCCCCCCCCCCTCCCCTGCCCTGGCCTGGGCTGGGTTGGGTCGGGTCCGGTCCCGTCCCT\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"GTCATCTCATCGCATCGTATCGTATCGTAGCGTAGTGTAGTATAGTACAGTACTGTACTATACTACACTACACTACATTACATTACATTTCATTTTATTTTATTTTAATTTAAATTAAACTAAACAAAACATAACATGACATGTCATGTAATGTAATGTAAAGTAAAGTAAAGAAAAGAGAAGAGCAGAGCTGAGCTCAGCTCAGCTCAGCTCAGTTCAGTGCAGTGGAGTGGTGTGGTGTGGTGCGGTGCTGTGCTCTGCTCCGCTCCACTCCAATCCAAGCCAAGACAAGAAAAGAAGAGAAGCGAAGCAAAGCAAAGCAAGGCAAGACAAGACAAGACTAGACTTGACTTTACTTTGCTTTGGTTTGGTTTGGTATGGTAGGGTAGAGTAGAGTAGAGAAGAGACGAGACGAGACGGGACGGCACGGCCCGGCCGGGCCGCGCCGCTCCGCTTCGCTTGGCTTGCCTTGCTTTGCTCTGCTCCGCTCCCCTCCCATCCCAACCCAAACCAAATCAAATAAAATATAATATCATATCATATCATATCATGTCATGCCATGCTATGCTGTGCTGA\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"ACTGCGCTGCGCTGCGCGGCGCGCCGCGCCGCGCCGCGCCGAGCCGACCCGACGCGACGGGACGGTACGGTGCGGTGGGGTGGGGTGGGCTGGGCTGGGCTGGGCTGGGCTGGCCTGGCGTGGCGGGGCGGGGCGGGACGGGACGGGACCGGACCAGACCAGACCAGGCCAGGACAGGACAGGACAGGACAGGACAGGACAGGACAGGAAAGGAACGGAACAGAACAAAACAATACAATGCAATGGAATGGGATGGGATGGGATGGGATTGGATTCGATTCCATTCCGTTCCGATCCGAGCCGAGGCGAGGGGAGGGCAGGGCCGGGCCGGGCCGCGCCGCACCGCAACGCAAGGCAAGGCAAGGGAAGGGGAGGGGGGGGGGCGGGGCGGGGCGCGGCGCTGCGCTCCGCTCCGCTCCTCTCCTTTCCTTCCCTTCTCTTCTGTTCTGCTCTGCGCTGCGGTGCGGGGCGGGTCGGGTTGGGTTGGGTTGGGTTGGGTTGGGGTGGGGTGGGGTGGGGTGCGGTGCGGTGCGATGCGAGGCGAGGCGAGGCGAGGCCAGGCCGGGCCGGGCCGGA\",
\"label\": \"promoter\"
},
{
\"seq\": \"TGTGCTGTGCTGTGCTGAGCTGATCTGATGTGATGCGATGCCATGCCTTGCCTGGCCTGTCCTGTGCTGTGGTGTGGTGTGGTTTGGTTTGGTTTGGTTTGGTTTGGTTTGGTGTGGTGGGGTGGGGTGGGGTGGGGCGGGGCTGGGCTAGGCTACGCTACACTACAATACAACACAACACAACAGAACAGGACAGGACAGGAAAGGAAAGGAAATGAAATTAAATTCAATTCCATTCCTTTCCTGTCCTGCCCTGCTCTGCTTTGCTTTGCTTTGCTTTGGTTTGGATTGGAATGGAAAGGAAAGGAAAGAAAAGACAAGACAAGACAGGACAGAACAGAACAGAAAAGAAAGGAAAGCAAAGCAAAGCAGAGCAGAGCAGATCAGATAAGATAGGATAGCATAGCCTAGCCAAGCCAAGCCAAACCAAATCAAATTAAATTCAATTCTATTCTCTTCTCTTCTCTCCTCTCTTCTCTACTCTACTCTACCCTACCATACCACACCACACCACATCACATTACATTTCATTTTATTTTGTTTTGGTTTGGATTGGAATGGAAAGGAAACGAAACT\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"GGTTCCGTTCCCTTCCCGTCCCGCCCCGCTCCGCTTCGCTTCGCTTCCCTTCCATTCCACTCCACCCCACCGCACCGAACCGAGCCGAGGCGAGGGGAGGGCAGGGCCGGGCCGGGCCGAGCCGACCCGACTCGACTGGACTGCACTGCGCTGCGATGCGAGGCGAGGCGAGGTGAGGTGAGGTGCGGTGCAGTGCATTGCATGGCATGCCATGCTATGCTGTGCTGGGCTGGGCTGGGATGGGAGGGGAGTGGAGTCGAGTCGAGTCGTGTCGTATCGTAGCGTAGTGTAGTATAGTACAGTACCGTACCGTACCGCACCGCACCGCACCGCACCGCACCGCACCGGACCGGGCCGGGGCGGGGCGGGGCGGGGCGGGGCGGAGCGGAACGGAACGGAACAGAACAGAACAGCACAGCTCAGCTCAGCTCCGCTCCGCTCCGCTCCGCCCCGCCCCGCCCCGCCCCGCCCCGGCCCGGCCCGGCGCGGCGGGGCGGAGCGGATCGGATGGGATGGGATGGTATGGTGTGGTGTGGTGTTGTGTTTTGTTTCGTTTCCTTTCCATTCCAGTCCAGA\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"GCCCGGCCCGGGCCGGGACGGGAGGGGAGCGGAGCGGAGCGTAGCGTCGCGTCGCGTCGCGTCGCATCGCAGCGCAGCGCAGCCCAGCCTAGCCTCGCCTCCCCTCCCCTCCCCTCCCCCCCCCCGCCCCGCCCCGCCCCGCCCCGCCCCGCCCCCCCCCCTCCCCTCCCCTCCCCTCCCCTCCCCTCCCCGCCCCGCCCCGCCCCGCCTCGCCTCGCCTCGCCTCGGCTCGGGTCGGGGCGGGGAGGGGACGGGACTGGACTCGACTCGACTCGTCTCGTCTCGTCCCGTCCCGTCCCTTCCCTCCCCTCCCCTCCACTCCACTCCACACCACAGCACAGCACAGCCCAGCCCAGCCCCGCCCCTCCCCTCCCCTCCCCTCCCCTCCCTTCCCTCCCCTCCCCTCCCCTCCCGTCCCGTCCCGTCCCGTCGCGTCGGGTCGGATCGGAACGGAATGGAATTGAATTCAATTCGATTCGCTTCGCATCGCAGCGCAGCGCAGCCCAGCCTAGCCTCGCCTCCCCTCCGCTCCGCTCCGCCCCGCCGCGCCGTGCCGTTCCGTTCCGTTCTGTTCTT\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"CTGGCTTGGCTGGGCTGCGCTGCTCTGCTCTGCTCCGCTCCTCTCCTTTCCTTACCTTACCTTACATTACAATACAAAACAAACCAAACCAAACCTAACCTGACCTGTCCTGTGCTGTGGTGTGGAGTGGAGTGGAGTGGAGTTGAGTTGAGTTGGGTTGGATTGGACTGGACTGGACTTGACTTGACTTGCCTTGCTTTGCTGTGCTGTGCTGTTCTGTTTTGTTTTGTTTTTTTTTTCTTTTCCTTTCCTTTCCTCTCCTCTCCTCTTCTCTTGTCTTGCCTTGCCTTGCCATGCCACGCCACTCCACTACACTAGACTAGACTAGAGTAGAGGAGAGGGGAGGGTAGGGTAGGGTAGGGTAGAGTAGAATAGAATAGAATAGAATATAATATGATATGATATGAAATGAAATGAAAAGAAAAGAAAAGAAAAGAAAAGAAGAGAAGAGAAGATAAGATTAGATTAGATTAGATTAGCTTAGCATAGCATAGCATGGCATGTCATGTTATGTTTTGTTTTGTTTTCTTTTCATTTCATTTCATCTCATCCCATCCTATCCTTTCCTTGCCTTGC\",
\"label\": \"promoter\"
},
{
\"seq\": \"CCCTGCCCTGCTCTGCTATGCTACGCTACACTACAGTACAGTACAGTTCAGTTTAGTTTTGTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTGTTTTGATTTGATTTGATATGATATGATATAATATACTATACAATACAGTACAGGACAGGTCAGGTTAGGTTTGGTTTTGTTTTGTTTTGCTTTGCCTTGCCATGCCAGGCCAGCCCAGCTCAGCTTAGCTTTGCTTTTCTTTTTTTTTTTTTTTTCTTTTCATTTCACTTCACTTCACTACACTAGACTAGACTAGATTAGATGAGATGGGATGGTATGGTGTGGTGCGGTGCTGTGCTATGCTAAGCTAATCTAATTTAATTCAATTCCATTCCCTTCCCTTCCCTTCCCTTTCCTTTACTTTAGTTTAGTTTAGTGTAGTGAAGTGACGTGACCTGACCTGACCTAACCTAGCCTAGGCTAGGCTAGGCTAGGCTGGGCTGTGCTGTTCTGTTTTGTTTTGTTTTCTTTTCATTTCATTTCATATCATACCATACGATACGATACGATACGATGCGATGTGATGTTATGTTTTGTTTA\",
\"label\": \"promoter\"
}
],
\"validation\": [
{
\"seq\": \"GTGGGGTGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGCGCCGCCCCGCCCCGCCCCGCCCCACCCCACCCCACTCCACTGCACTGCACTGCACTGCAGTGCAGGGCAGGTCAGGTGAGGTGGGGTGGGGTGGGCTGGGCCGGGCCTGGCCTGGCCTGTCCTGTACTGTAGTGTAGCGTAGCATAGCAGAGCAGCGCAGCTCAGCTGAGCTGCGCTGCACTGCACTGCACCGCACCTCACCTGACCTGACCTGAGCTGAGGTGAGGCGAGGCAAGGCAGGGCAGGGCAGGGCAGGGCAGGGCTGGGCTGGGCTGGGCTGGCCTGGCATGGCAGGGCAGCGCAGCCCAGCCCAGCCCCGCCCCTCCCCTGCCCTGTCCTGTGCTGTGGTGTGGGGTGGGGTGGGGAGGGGAGGGGAGGGGAGGGGAGGGAAGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGCGCCGCCCCGCCCCGCCCCGCCCCACCCCACCCCACTCCACTGCACTGCACTGCACTGCAGTGCAGGGCAGGTCAGGTG\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"GTGTGGTGTGGGGTGGGATGGGATGGGATCGGATCAGATCATATCATGTCATGTCATGTAATGTATTGTATCGTATCATATCAGATCAGTTCAGTGCAGTGCAGTGCAGTGCAGTGCAGCGCAGCCCAGCCTAGCCTTGCCTTGCCTTGACTTGACTTGACCTGACCTGACCTCACCTCCCCTCCTCTCCTGTCCTGGCCTGGGCTGGGCTGGGCTGGGCTCGGCTCAGCTCAACTCAAGTCAAGCCAAGCAAAGCATAGCATTGCATTCCATTCTATTCTTTTCTTCTCTTCCCTTCCCTTCCCATCCCACCCCACCCCACCTCACCTCACCTCACCTCAACTCAACTCAACCCAACCTAACCTCACCTCTCCTCTTCTCTTGTCTTGACTTGAGTTGAGTTGAGTAGAGTAGAGTAGCGTAGCTTAGCTGAGCTGAGCTGAACTGAAATGAAATGAAATTAAATTAAATTACATTACATTACAGTACAGGACAGGACAGGAAAGGAACGGAACAGAACATAACATGACATGCCATGCCATGCCATGCCACGCCACCCCACCACACCACACCACA\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"CCCTGCCCTGCACTGCATTGCATGGCATGCCATGCCATGCCATGCCACGCCACACCACATCACATAACATAGCATAGCATAGCATAGCAAAGCAAGGCAAGGCAAGGTAAGGTGAGGTGCGGTGCTGTGCTGTGCTGGGCTGGGCTGGGTTGGGTCGGGTCAGGTCACGTCACTTCACTGCACTGAACTGATCTGATGTGATGCGATGCTATGCTATGCTAAGCTAACCTAACATAACATAACATCACATCTCATCTAATCTAATCTAAACTAAACTAAACAAAACAGAACAGGACAGGGCAGGGGAGGGGCGGGGCCGGGCCAGGCCAGGCCAGGCCAGGTCAGGTGAGGTGCGGTGCGGTGCGGTGCGGTGCGGTGCGGTGGGGTGGCGTGGCTTGGCTCGGCTCAGCTCACCTCACTTCACTCCACTCTACTCTTCTCTTGTCTTGACTTGAATTGAAATGAAATGAAATCAAATCCAATCCCATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGC\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"TTGGAGTGGAGCGGAGCAGAGCAAAGCAAGGCAAGGCAAGGCAAGGCTAGGCTAGGCTATGCTATGCTATGCTATGCAATGCACTGCACCGCACCACACCATACCATACCATACCATACAATACATTACATGACATGCCATGCTATGCTCTGCTCTGCTCTGCTCTGATCTGAGCTGAGTTGAGTGGAGTGGAGTGGGGTGGGCTGGGCTGGGCTTGGCTTGGCTTGACTTGATTTGATTTGATTCGATTCCATTCCTTTCCTCTCCTCCCCTCCACTCCAGTCCAGGCCAGGGCAGGGAAGGGAAGGGAAGGGAAGAGAAGAGAAGAGGAGAGGCGAGGCCAGGCCAGGCCAGGCCAGGCCAGGACAGGAAAGGAAAGGAAAGGAAAGCAAAGCAAAGCATAGCATTGCATTGCATTGAATTGATTTGATGTGATGTGATGTGATGTGATGTGAAGTGAAATGAAAAGAAAACAAAACAAAACAGAACAGCACAGCCCAGCCTAGCCTTGCCTTTCCTTTCCTTTCCTTTCCCTTCCCTTCCCTTCCCTTGCCTTGCCTTGCCTTGCCATGCCAT\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"AGCACAGCACAGCACAGGACAGGGCAGGGCAGGGCAGGGCACGGCACTGCACTGCACTGGACTGGTCTGGTGTGGTGGGGTGGAGTGGAGTGGAGGGGAGGGGAGGGAAGGGAGGGGAGCGGAGCCGAGCCCAGCCCTGCCCTGCCCTGCCCTGCGCTGCGGTGCGGGGCGGGGCGGGGCGGGGCAGGGCAGGGCAGTGCAGTCCAGTCCAGTCCTGTCCTCTCCTCACCTCAACTCAAGTCAAGGCAAGGCAAGGCCAGGCCTGGCCTCGCCTCCCCTCCGCTCCGGTCCGGACCGGATCGGATGGGATGGGATGGGATGGGTTGGGTGGGGTGTGGTGTGGTGTGATGTGAGGTGAGATGAGAGGAGAGGAGAGGCGAGGCAAGGCACGGCACCGCACCGCACCGGACCGGGCCGGGGCGGGGCGGGGCTGGGCTGGGCTGAGCTGAACTGAAGTGAAGCGAAGCAAAGCAGAGCAGCGCAGCACAGCATAGCATCGCATCTCATCTGATCTGGTCTGGGCTGGGTTGGGTTGGGTTTGGTTTGGTTTGATTTGAGTTGAGGTGAGGAGAGGAA\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"AGGCCAGGCCAGGCCAGCCCAGCTCAGCTGAGCTGGGCTGGGCTGGGGTGGGGTGGGGTCGGGTCAGGTCAAGTCAAGTCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGCAAGGGAAGGGGAGGGGGGGGGGCGGGGCTGGGCTGGGCTGCGCTGCCCTGCCCTGCCCAGCCCAGCCCAGCCCAGCACAGCACAGCACAGCACAGCACAGTACAGTGCAGTGGAGTGGTGTGGTTTGGTTCGGTTCTGTTCTGTTCTGCTCTGCTCTGCTCTGCTCCGCTCCACTCCAGTCCAGACCAGAGCAGAGGAGAGGTGAGGTGAGGTGCGGTGCAGTGCAGTGCAGTGCAGTCCAGTCAAGTCAGGTCAGATCAGACCAGACTAGACTGGACTGCACTGCCCTGCCTTGCCTGGCCTGGCCTGGGCTGGGTTGGGTTGGGTTGGGTTGGGTTGGCTTGGCTTGGCTCGGCTCAGCTCATCTCATGTCATGCCATGCCATGCCTTGCCTGGCCTGGCCTGGGCTGGGTTGGGTCGGGTCTGGTCTGGTCTGTTCTGTCCTGTCATGTCATGTCATT\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"GTGCGATGCGAGGCGAGACGAGATGAGATGAGATGAGATGACATGACGTGACGCGACGCAACGCACCGCACTGCACTTCACTTCACTTCCCTTCCTTTCCTGTCCTGCCCTGCCCTGCCTTGCCTGGCCTGACCTGAGCTGAGGTGAGGCGAGGCGAGGCGGGGCGGCGCGGCCCGGCCGGGCCGCGCCGCTCCGCTGCGCTGTGCTGTTCTGTTCTGTTCTGTTCTCTTCTCGTCTCGCCTCGCGTCGCGGCGCGGCGCGGCTCGGCTTGGCTTCGCTTCCCTTCCGTTCCGGTCCGGCCCGGCACGGCAGGGCAGGGCAGGTCAGGTGAGGTGGGGTGGCGTGGCGTGGCGCGGCGCTGCGCTGCGCTGAGCTGAGCTGAGATGAGACGAGACCAGACCAGACCACACCACGCCACGGCACGGGACGGGACGGGAAGGGAAGGGAAGCGAAGCCAAGCCAAGCCAGGCCAGCCCAGCCCAGCCTAGCCTGGCCTGGCCTGGCCTGGCTTGGCTGGGCTGTGCTGTCCTGTCGTGTCGGGTCGGTTCGGTTCGGTTAGGTTAGGTTAGCTTAGCC\",
\"label\": \"promoter\"
},
{
\"seq\": \"GTTCTTTTCTTGTCTTGGCTTGGATTGGATTGGATCGGATCAGATCACATCACATCACACCACACTACACTCCACTCGACTCGACTCGAGTCGAGGCGAGGAGAGGAAAGGAAAGGAAAGGAAAGCAAAGCTAAGCTCAGCTCCGCTCCACTCCAGTCCAGCCCAGCTCAGCTGAGCTGGGCTGGGCTGGGCTGGGCCGGGCCCGGCCCAGCCCAGCCCAGACCAGATCAGATTAGATTTGATTTGATTTGGTTTGGGTTGGGGTGGGGCGGGGCTGGGCTTGGCTTCGCTTCTCTTCTGTTCTGTTCTGTCCTGTCCTGTCCTGTCCTGTCCTGACCTGAACTGAAATGAAAGGAAAGGAAAGGCAAGGCGAGGCGCGGCGCTGCGCTGCGCTGGGCTGGCCTGGCTTGGCTCGGCTCCGCTCCTCTCCTGTCCTGGCCTGGTCTGGTGTGGTGTGGTGTGGTGTGATGTGAAGTGAATTGAATGGAATGGAATGGGATGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCCGGCCCAGCCCAGCCCAGGCCAGGGCAGGGCAGGGCTGGGCTG\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"GGCCAGGCCAGGCCAGGGCAGGGGAGGGGAGGGGACGGGACCGGACCAGACCAGACCAGGCCAGGCCAGGCTAGGCTGGGCTGGGCTGGGCTGGGATGGGAGGGGAGAGGAGAGGAGAGCAGAGCTGAGCTGAGCTGCGCTGCCCTGCCATGCCAAGCCAACCCAACCCAACCGAACCGCACCGCACCGCACCGCACCGCACCTCACCTGACCTGTCCTGTGCTGTGATGTGAAGTGAAGTGAAGGGAAGGAAAGGAAAGGAATGGAATGGAATGGAATGGTATGGTCTGGTCAGGTCAGGTCAGGTCAGGACAGGAAAGGAACGGAACCGAACCCAACCCTACCCTCCCCTCCCCTCCCCTCCCATCCCACCCCACCCCACCCCACCCTACCCTGCCCTGGCCTGGGCTGGGATGGGATGGGATGGGATGCGATGCAATGCATTGCATTGCATTCCATTCCATTCCTTTCCTGTCCTGGCCTGGCCTGGCTTGGCTTGGCTTTGCTTTTCTTTTATTTTACTTTACCTTACCATACCAGACCAGTCCAGTTCAGTTAAGTTATGTTATTTTATTC\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"ATCCCATCCCAGCCCAGCCCAGCACAGCACAGCACTGCACTTCACTTTACTTTGCTTTGGTTTGGGTTGGGATGGGAGGGGAGGGGAGGCGAGGCCAGGCCGGGCCGAGCCGAGCCGAGGCGAGGTGAGGTGAGGTGGGGTGGGGTGGGTTGGGTGGGGTGGGGTGGAGTGGATTGGATCGGATCAGATCATATCATCTCATCTCATCTGATCTGATCTGAGCTGAGGTGAGGTGAGGTCAGGTCAGGTCAGGTCAGGTCAGGACAGGAGAGGAGTGGAGTTGAGTTTAGTTTGGTTTGATTTGAATTGAAATGAAACGAAACCAAACCAAACCAGACCAGCCCAGCCCAGCCTAGCCTGGCCTGGCCTGGCCTGGCCTGGCCAGGCCAAGCCAACCCAACACAACATAACATGACATGGCATGGCATGGCATGGCAAGGCAAAGCAAAACAAAACAAAACCAAACCCAACCCCACCCCGCCCCGTCCCGTCCCGTCTCGTCTCGTCTCTTCTCTACTCTACTCTACTCTACTATACTAAACTAAACTAAAATAAAAAAAAAATAAAATAAAATAC\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"CCGCCACGCCAGGCCAGGCCAGGCCAGGCTAGGCTCGGCTCCGCTCCTCTCCTCTCCTCTCCTCTGCTCTGCTCTGCACTGCAGTGCAGCGCAGCGCAGCGCAGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGGCGCGCCGCGCAGCGCAGCGCAGCGCAGCGCAGCGCAGCGCCGCGCCCCGCCCCGCCCCCCCCCCTCCCCTCCCCTCGCCTCGCCTCGCGTCGCGGCGCGGTGCGGTACGGTAGGGTAGGGTAGGCTAGGCGAGGCGCGGCGCGGCGCGGCGCGGAGCGGAGCGGAGGGGAGGAGAGGAAAGGAAGGGAAGCGAAGCGAAGCGGAGCGGCGCGGCCCGGCCAGGCCACGCCACACCACAGCACAGGACAGGGCAGGGCAGGGCTGGGCTGGGCTGCGCTGCCCTGCCGTGCCGCGCCGCCCCGCCCCGCCCGGCCCGCCCCGCACCGCAGCGCAGAGCAGAACAGAATAGAATCGAATCGAATCGCATCGCATCGCAGCGCAGCGCAGCTCAGCTGAGCTGCGCTGCACTGCACTGCACGGCACGACACGATACGATC\",
\"label\": \"promoter\"
},
{
\"seq\": \"CGCAGTGCAGTGCAGTGGAGTGGTGTGGTCTGGTCTGGTCTTGTCTTGTCTTGGCTTGGCTTGGCATGGCAGGGCAGCGCAGCTCAGCTGAGCTGCGCTGCCCTGCCATGCCAGGCCAGGCCAGGACAGGAGAGGAGTGGAGTAGAGTAGAGTAGGGTAGGTTAGGTAAGGTAGGGTAGTGTAGTGTAGTGCAGTGCCGTGCCTTGCCTCGCCTCCCCTCCGCTCCGGTCCGGACCGGACCGGACCGGACCCGACCCTACCCTCCCCTCGCCTCGCCTCGCTTCGCTACGCTAGGCTAGGCTAGGTTAGGTGAGGTGGGGTGGCGTGGCATGGCACGGCACTGCACTACACTATACTATCCTATCATATCACATCACATCACAACACAAAACAAAGCAAAGGAAAGGAAAGGACAGGACAGGACAAGACAAAACAAAGCAAAGCAAAGCTAAGCTGAGCTGGGCTGGTCTGGTTTGGTTGGGTTGAGTTGAATTGAAGTGAAGCGAAGCTAAGCTAAGCTACGCTACACTACAGTACAGAACAGAACAGAAAAGAAATGAAATCAAATCCAATCCC\",
\"label\": \"promoter\"
},
{
\"seq\": \"TACCGGACCGGACCGGAGCGGAGAGGAGACGAGACCAGACCGGACCGCACCGCACCGCACCGCACTGCACTGCACTGAACTGAACTGAAGTGAAGAGAAGACAAGACTAGACTGGACTGTACTGTTCTGTTTTGTTTTGTTTTATTTTAGTTTAGATTAGAGTAGAGTAGAGTTGAGTTGAGTTGAGTTGACTTGACTTGACTGGACTGAACTGACCTGACATGACAGGACAGTACAGTGCAGTGGAGTGGCGTGGCATGGCAGGGCAGCGCAGCGCAGCGAAGCGATGCGATTCGATTCGATTCTATTCTCTTCTCCTCTCCTCTCCTGTCCTGTCCTGTCCTGTCTTGTCTCGTCTCCTCTCCACTCCAGTCCAGCCCAGCCCAGCCCAGCCCTGCCCTCCCCTCACCTCAGCTCAGCTCAGCACAGCAGAGCAGTGCAGTGCAGTGTAGTGTCGTGTCCTGTCCCGTCCCTTCCCTTCCCTTTCCTTTGCTTTGGTTTGGGTTGGGCTGGGCAGGGCACGGCACCGCACCCCACCCAACCCAGCCCAGCCCAGCCCAGCCCAGCCCCGCCCCA\",
\"label\": \"non-promoter\"
},
{
\"seq\": \"CAGAATAGAATCGAATCGAATCGCATCGCATCGCAACGCAAGGCAAGACAAGAAAAGAATAGAATCGAATCAAATCATATCATGTCATGCCATGCAATGCAGTGCAGAGCAGAGCAGAGCAGAGCGGAGCGAAGCGACGCGACCCGACCTGACCTGACCTGACCTGATCTGATTTGATTTGATTTAATTTACTTTACGTTACGCTACGCTACGCTTCGCTTCGCTTCACTTCACTTCACCTCACCTCACCTAACCTAGCCTAGACTAGATTAGATTAGATTGGATTGAATTGACTTGACTTGACTTGACTTTACTTTTCTTTTTTTTTTATTTTATTTTATTTTATTCTATTCTATTCTGTTCTGCTCTGCACTGCATTGCATCGCATCGCATCGTATCGTTTCGTTGCGTTGTGTTGTGTTGTGTTGTGTTGTGTTCTGTTCTGTTCTTTTCTTCTCTTCCCTTCCCTTCCCCTCCCCCCCCCCACCCCACCCCACTCCACTTCACTTCACTTCCCTTCCTTTCCTCTCCTCTCCTCTTCTCTTCTCTTCTCTTCTTTTCTTGTCTTGCCTTGCT\",
\"label\": \"non-promoter\"
}
],
\"epochs\": 1
}
}"
res <- postForm("https://biolm.ai/api/v1/finetune_run/", .opts=list(postfields = params, httpheader = headers, followlocation = TRUE), style = "httppost")
cat(res)
JSON Response#
Expand Example Response
{
"id": "129",
"pipeline": {
"id": "3",
"pipeline_slug": "finetune_DNABERT_classifier"
},
"start_time": null,
"created_at": "2023-04-01T12:41:21.734731-07:00",
"end_time": null,
"status": "scheduled",
"algorithm": null,
"hyperopt": false
}
Request Definitions#
- hyperopt:
False specifies whether or not to perform hyperparameter optimization (hyperopt). If set to false, no optimization will be performed.
- input_json:
Is a nested JSON object that contains the data for training and validation, as well as configuration details like max_train and train (below)
- max_train:
40000 and “max_validate”: 20000” set the maximum number of training and validation examples, respectively.
- train:
Is an array of objects, each containing a DNA sequence (“seq”) and a corresponding label (“label”). These are the training examples for the fine-tuning process. The sequences are strings of characters representing nucleotide bases (adenine (A), cytosine (C), guanine (G), and thymine (T)), and the labels indicate the classification category for each sequence (e.g., “non-promoter” or “promoter”
- seq:
This key is associated with a string value that represents a DNA sequence. Each character in the string corresponds to a nucleotide base. The sequence provided is what the model will analyze and learn from during the fine-tuning process.
- label:
Each seq comes with a corresponding label, which is a string that categorizes the sequence. In the context of the provided example, the labels are “non-promoter” or “promoter”. These labels are used as the target outputs for the classifier, meaning that the DNABERT model will learn to predict these labels from unseen DNA sequences after being trained on the provided examples. The classifier’s goal is to determine whether a given DNA sequence functions as a promoter (a region of DNA that initiates transcription of a particular gene) or not.
Response Definitions#
- Start_time:
This field records the time when the task started processing. Null indicates that the process has not started yet.
- created_at:
The timestamp when the task was created or submitted to the system. It is in ISO 8601 date and time format with timezone information.
- end_time:
Similar to start_time, this would record when the task finished processing. Null indicates it has not finished yet or has not started.
- status:
This indicates the current state of the task. “scheduled” means that the task has been scheduled to run but has not yet started.
- algorithm:
This indicates which algorithm or method is being used for the task. null suggests that this information is either not applicable, not decided yet, or simply not provided in the response.
- hyperopt:
Indicates whether hyperparameter optimization is enabled for the task. false means that hyperparameter optimization is not being used. Hyperparameter optimization is a process to automatically select the best hyperparameters (settings) for a machine learning model to maximize its performance.