GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:14:11 Sequence gi568815591r:64421164_64644433 : 223270 bp : 40.05% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12798 13215 418 2 1 80 -10 376 0.013 23.64 1.02 Intr + 41895 42087 193 1 1 56 93 169 0.927 11.83 1.03 Term + 48600 49029 430 0 1 12 54 774 0.819 60.19 1.04 PlyA + 49781 49786 6 1.05 2.00 Prom + 57576 57615 40 -3.95 2.01 Init + 60252 60382 131 2 2 96 75 132 0.679 12.37 2.02 Intr + 66535 66605 71 1 2 56 58 73 0.222 -0.99 2.03 Intr + 69564 69650 87 1 0 58 56 78 0.159 0.62 2.04 Intr + 70859 70988 130 0 1 27 111 126 0.577 7.63 2.05 Term + 74307 74481 175 0 1 -2 37 239 0.213 6.05 2.06 PlyA + 74698 74703 6 1.05 3.02 PlyA - 75211 75206 6 1.05 3.01 Sngl - 79702 79484 219 1 0 53 47 300 0.945 17.36 3.00 Prom - 85472 85433 40 -7.35 4.03 PlyA - 85533 85528 6 1.05 4.02 Term - 101337 99998 1340 1 2 65 41 1969 0.823 180.58 4.01 Init - 107756 107540 217 2 1 70 75 55 0.393 1.30 4.00 Prom - 109168 109129 40 -10.65 5.00 Prom + 109739 109778 40 -9.55 5.01 Init + 111758 112161 404 1 2 33 75 221 0.159 11.45 5.02 Term + 125612 125714 103 2 1 123 47 97 0.583 5.87 5.03 PlyA + 126162 126167 6 1.05 6.07 PlyA - 126305 126300 6 1.05 6.06 Term - 141814 141603 212 2 2 100 44 205 0.667 13.77 6.05 Intr - 142025 141883 143 1 2 61 30 185 0.663 9.08 6.04 Intr - 142607 142520 88 0 1 41 93 76 0.291 1.51 6.03 Intr - 148049 147949 101 1 2 67 57 86 0.312 2.23 6.02 Intr - 148528 148269 260 1 2 9 25 225 0.217 3.64 6.01 Init - 149274 148609 666 0 0 101 -3 949 0.386 80.36 6.00 Prom - 151703 151664 40 -8.65 7.03 PlyA - 152408 152403 6 1.05 7.02 Term - 153674 153445 230 0 2 75 39 207 0.576 10.31 7.01 Init - 156777 156657 121 0 1 68 74 77 0.676 4.70 7.00 Prom - 157225 157186 40 -5.55 8.00 Prom + 159900 159939 40 -5.65 8.01 Init + 161556 161675 120 2 0 52 82 106 0.066 4.64 8.02 Intr + 169936 170077 142 0 1 51 54 141 0.159 6.01 8.03 Intr + 173713 173763 51 2 0 106 63 40 0.097 1.26 8.04 Intr + 189999 190039 41 1 2 97 95 17 0.146 0.32 8.05 Intr + 195242 195325 84 1 0 42 94 71 0.468 2.20 8.06 Term + 200027 200470 444 1 0 -25 42 256 0.322 3.95 8.07 PlyA + 201617 201622 6 1.05 9.05 PlyA - 203741 203736 6 1.05 9.04 Term - 205112 204982 131 0 2 48 38 130 0.790 1.36 9.03 Intr - 206935 206808 128 0 2 67 77 84 0.689 4.60 9.02 Intr - 211520 211383 138 1 0 8 95 89 0.269 0.26 9.01 Init - 213094 212973 122 1 2 65 91 45 0.399 2.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:64421164_64644433|GENSCAN_predicted_peptide_1|346_aa MDVELTVEEINFPSVAYKNVIGARRASWRIISSIEQKEENKEGEDKLKMIWEYQQMIETE LKLLCCDILDVLDKNLIPAANTGESKVFYYKMKGDYHRYLAEFATGNDRKEAVENSLVAY KAASDIAMTELPPMHPTRLVSVPPSELKRDVGRERLFLHLWRKRFFQDLHPLSSKLHFNV AAIDQVWLLSAWYVAGLNCDVLESKLKIIHTGDKPFKGDECHKAFNQFSTLTNCKRIHTG EKPYKRKECGKGFNQFSHLTKHKKTHTGEKPYKCKECGKGFNQFSHLTKHKKTHTGKKSY KCEECGKAFNQFANLIKHKRIHTGEKSYNMKNDEKLLPSPQTLLNT >gi568815591r:64421164_64644433|GENSCAN_predicted_CDS_1|1041_bp atggatgtggaactgacagttgaagaaataaacttcccatctgttgcatataagaatgta attggagctagaagagcctcctggagaataatcagcagtattgaacagaaagaagaaaac aaggaaggagaagacaagctaaaaatgatttgggaatatcagcaaatgattgagactgag ctaaagttactctgttgtgacattctggatgtactggacaaaaacctcattccagcagct aacactggcgagtccaaagttttctattataaaatgaaaggggactaccacaggtatctg gcagaatttgccacaggaaatgacaggaaggaggctgtggagaacagcctagtggcttat aaagctgctagtgatattgcaatgacagaacttccaccaatgcatcctactcgcttagta tctgttccaccctcggaactgaagagagatgttggtagggagaggctcttccttcacctc tggcggaaaagattttttcaggatcttcatcccctgtcgtcgaagctacactttaatgta gcagctattgaccaagtgtggctactgagtgcctggtatgtggctggtctgaactgcgat gttctagaaagtaaacttaagataattcatactggagacaaacccttcaaaggtgatgaa tgtcacaaagcctttaaccagttctcaacccttaccaactgtaagagaattcatactgga gagaaaccctacaaacgtaaagaatgtgggaaaggttttaaccagttttcacaccttact aaacataagaaaactcatactggagagaaaccctacaaatgtaaagaatgtggcaaaggt tttaaccagttttcacaccttactaaacataagaaaactcatactggaaagaaatcctac aaatgtgaagaatgtggcaaagcttttaaccagtttgcaaaccttattaaacataaaaga attcataccggagagaaatcctacaatatgaagaatgatgaaaagcttttacccagtcct cagaccttactgaacacatga >gi568815591r:64421164_64644433|GENSCAN_predicted_peptide_2|197_aa MGVEKIIAQKIDRGRRAYKKHMGPTEHVTLRACCKESEPHEMIRLTLLIPVNQPVISDCC SEETRGIDLAFQGRYPLNRFSPPVSKLLGTNVSGAQAEKRGKSGIPGEKGSQTSVNFLSV QAIRCPSRYQALKGELPQARRQEQKNRKKLQEFEPKGWTVGVAEGEAAGTPQSGTAKPAP EMDKKNKLSTKIRRNQS >gi568815591r:64421164_64644433|GENSCAN_predicted_CDS_2|594_bp atgggtgttgagaagatcatagcccaaaagattgacagaggccgcagggcatataagaaa catatgggacctactgagcacgtcacacttagagcctgctgtaaggaatctgaaccccat gaaatgataaggctaaccctgcttattcctgtgaaccaaccagtgatctctgactgctgc tcagaagaaacaagagggatagacctagcttttcagggccgttaccccctgaaccggttc agtccacctgtgtcgaagctacttggcacaaatgtgtcaggggctcaagctgaaaaacgc ggaaaatcagggatccctggagaaaaggggtcccagacttccgtaaatttcctgtcggtt caggccataaggtgcccaagccggtaccaagcactgaaaggtgaactgccacaagccaga cgccaggaacagaagaaccgaaagaaactgcaagagtttgagccaaaaggatggactgtt ggagtagctgagggtgaagctgccggaactcctcaaagtggaacagcaaagccagcacca gagatggacaagaaaaacaaactatcaacgaaaattcgcagaaatcaaagttag >gi568815591r:64421164_64644433|GENSCAN_predicted_peptide_3|72_aa MERGSVLLVTLLAIVFDKVPTKPKTGGRREKRFDSESPTPRFYVEKPTAARGRQRGHNRS VFVSSQNQPHYF >gi568815591r:64421164_64644433|GENSCAN_predicted_CDS_3|219_bp atggagcgaggatctgttctgttggtaacattgctggccattgtgtttgataaggttccc acaaagccaaaaacaggaggaaggagggagaagagattcgattctgagtctcctactccc aggttctacgtggagaagccaactgctgctcgaggtcggcaacgtggccacaaccgctca gtcttcgtctcttcacaaaatcagccccattacttctga >gi568815591r:64421164_64644433|GENSCAN_predicted_peptide_4|518_aa MDSSSVLSTGDLMLSYCADLPPSGKWCFPESISCGRVERNPVGGALQLLRVYALCVQIRV WVGKDHWVGAGLVIYSHFTEDLWPEHSIKDSFQKVILRGYGKCGHENLQLRISCKSVDES KVFKEGYNELNQCLRTTQSKIFQCDKYVKVFHKFSNSNSHKKRNTGKKVFKCKECGKSFC MLSHLTQHIRIHTRENSYKCEECGKVLNWFSELIKHKGIHMGEKPYKCEECGKAFNQSST LIKHKKIHIEEKPFKCEECGKAFSLFSILSKHKIIHTGDKPYKCDECHKAFNWFATLTNH KRIHTGEKPFKCEECGKDFNQFSNLTKHKKIHTGEKPYKCEECGKAFNQFANLTRHKKIH TGEKSYKCEECGKAFIQSSNLTEHMRIHTGEKPYKCEECGKAFNGCSSLTRHKRIHTREN TYKCEECGKGFTLFSTLTNHKVIHTGEKSYKCDECGNVFNWPATLANHKRIHAREKPYKC EECGKAFNRSSHLTRHKKIHTGEKLYKPEKCDNNFDNT >gi568815591r:64421164_64644433|GENSCAN_predicted_CDS_4|1557_bp atggactcttccagtgttcttagcactggtgatttaatgctctcttattgtgctgatttg cctccttctgggaagtggtgctttccagagagcatcagctgtggtagggtggagaggaac ccagtgggtggggccttacaacttcttagagtatatgccctttgtgttcagatacgagtg tgggtagggaaggaccactgggtaggggcagggctagttatatattctcatttcactgaa gacctttggccagagcatagcataaaagattcttttcaaaaagtgatactgagaggatat ggaaaatgtggacatgagaatttacaattaagaataagttgtaaaagtgtggatgagtct aaggtgttcaaagaaggttataatgaacttaaccaatgtttgagaactacccagagcaaa atatttcaatgtgataaatacgtgaaagtctttcataaattttcaaattcaaacagtcat aagaaaagaaatactggaaagaaggttttcaaatgtaaagaatgtggcaaatcattttgc atgctttcacatctaacacaacatataagaattcacactagagagaattcttacaaatgt gaggaatgtggcaaagttcttaactggttctcagagcttattaaacataagggaattcat atgggagagaaaccctacaaatgtgaggaatgtggcaaagcctttaaccaatcctcaacc cttattaaacataagaaaattcatattgaagagaaacccttcaaatgtgaagaatgtggc aaagcctttagtttattctcaatccttagtaaacataagataattcatactggagacaaa ccttacaaatgtgatgaatgtcacaaagcctttaactggtttgcaacccttactaaccat aagagaattcatactggagagaaacccttcaaatgtgaagaatgtggcaaagactttaac cagttttcaaaccttactaaacataagaaaattcatactggagagaaaccctacaaatgt gaagaatgtggcaaagcttttaaccagtttgcaaaccttactagacataagaaaattcat actggagagaaatcctacaaatgtgaagaatgcggcaaagcttttatacagtcctcaaac cttactgaacatatgagaattcatactggagagaaaccctacaaatgtgaagaatgtggc aaagcttttaatgggtgctccagccttactcgacataagagaattcacactagagagaat acctacaaatgtgaagaatgtggcaaaggctttactttattttcaacccttactaaccat aaagtaattcatactggagagaaatcctacaaatgtgatgaatgtggcaatgtttttaac tggcctgcaactcttgctaatcataagagaattcatgctagagagaaaccctacaaatgt gaagaatgtggcaaagcttttaaccggtcctcacaccttactagacataagaaaattcat actggtgagaaactctacaaacctgaaaaatgtgacaataattttgataacacctaa >gi568815591r:64421164_64644433|GENSCAN_predicted_peptide_5|168_aa MIISIDAEKAFDKIQHLFMIKTLSKIGIQGTYLHVIKVIYDKPTANIILNGKKVESIPFE NWNKTRIPTLTTLLQHSTGSPSQSNQTREIKGIQISKEKVKLSLFAGDMIFYLENPKDSS KKLLELIKELSKVSRCPPQAQTPAISATIMAIWATLTCLYQTRTEQAL >gi568815591r:64421164_64644433|GENSCAN_predicted_CDS_5|507_bp atgatcatctcaatagatgcagaaaaagcatttgacaaaatccagcatctctttatgatt aaaactctcagcaaaattggcatacaagggacatacctccacgtaataaaagtcatctat gacaaacccacagccaacataatactgaatgggaaaaaagttgaaagcattccctttgag aactggaacaagacaaggatacccactctcacaactcttcttcaacatagcactggaagt cctagccagagcaatcagacaagagaaataaagggcatccaaatcagtaaagagaaagtc aaactgtcactgtttgctggtgatatgatcttttatcttgaaaaccccaaagactcctcc aaaaaactcctagaactgataaaagaattgagcaaagtttccaggtgccctccccaggca cagacaccagcaatttctgctacaataatggcaatatgggccacactgacctgtctctac caaacccgaacagaacaggccctgtga >gi568815591r:64421164_64644433|GENSCAN_predicted_peptide_6|489_aa MPPAAPGARLRLLAAAALAGLAVISRGLLSQRLEFNSPADNYTVCEGDNATLSCFMDEHV TRVAWLNRSNILYAGNDRRTRDPRVRLLINTPEEFSILVTEVGLGDEGLYTCSFQTRHQP YTTQVYLIVHVPARVVNISSPVMVNEGGNVNLLCLAVGRPEPTVTWRQLRDGFTSEGEIL EISDILRGQAGEYECVTHNGVNSAPDSRRVLVTVNYPPTITDWYKDDRLLSSGTAEGLKV QMERTRSMLLFANMSARHYGNYTCCAANRLGASSASMRLLCPGSLENSAPRPPGPLALLS ALGWLWWRIHSRYPFTFPIVTHSRHPIPLSLSSGCQQACVGVELHFNVAAIGHLWLLGAW NVAGLNCDVVERALGGGALNIIQSGTRGWDPSNQARSWSGQDGFRDLAGPLSLAAAGAPG IGRCTAKMPGPPGSLEMVRVRSDTPREGEGLVGTGRKWLWQDSRLQALGSKIRGPSCPWR SSVLSPLQP >gi568815591r:64421164_64644433|GENSCAN_predicted_CDS_6|1470_bp atgccccccgctgcgcccggggcccggctccggcttctcgccgccgccgccctggccggc ttggccgtcatcagccgggggctgctctcccagaggctggagttcaactctcctgccgac aactacacagtgtgtgaaggtgacaacgccaccctcagctgcttcatggacgagcatgtg acccgcgtggcctggctgaaccgctccaacatcctgtacgccggcaacgaccgcaggacc agggacccgcgggtgcggctgctcatcaacacccccgaggagttctccatcctcgtcacc gaggtggggctcggcgacgagggcctctacacctgctccttccagacccgccaccagccg tacaccactcaggtctacctcattgtccacgtccctgcccgcgttgtgaacatctcgtcg cctgtgatggtgaatgagggaggtaatgtgaacctgctttgcctggccgtggggcggcca gagcccacggtcacctggagacagctccgagacggcttcacctcggagggagagatcctg gagatctctgacatcctgcggggccaggccggggagtatgagtgcgtgactcacaacggg gttaactcggcgcccgacagccgccgcgtgctggtcacagtcaactatcctccgaccatc acggactggtataaggatgacagactactgagcagcggcacggccgagggcctgaaggtg cagatggagcgcactcgctcgatgcttctctttgccaacatgagcgcccggcattacggc aactatacgtgttgcgccgccaaccggctgggagcgtccagcgcctccatgcggctcctg tgcccaggatccctggagaactcagccccgaggcccccagggcccctggccctcctctcc gccctgggctggctgtggtggagaatccattctcgctacccgttcacgtttccgattgtg acccactcccgccaccccatacccctctctcttagctcaggctgtcaacaggcttgtgtg ggtgtggagctgcacttcaatgtggcagctattggccacctgtggctactgggtgcatgg aatgtggctggtctgaactgcgatgtggtagaaagggcattagggggtggcgccctaaac attatccaatcagggacgcggggctgggacccgtccaatcaggcacgcagctggagcgga caggacggcttccgggatttggcggggcctttgtctctagctgctgcgggagctccaggt attgggagatgcacagctaagatgccaggaccacctggaagcctagaaatggtgagagtg cggtccgacaccccgagagagggggaggggctggttggaaccggtcggaagtggctgtgg caggactccaggctccaagcactgggctccaaaatccgcggcccgagttgtccttggcgc agctcggtcctcagtccccttcagccataa >gi568815591r:64421164_64644433|GENSCAN_predicted_peptide_7|116_aa MTLKEHAAFKHLFNKAHLAPPLIHLTLSGHSTCFREHRVGGFPGGPGILAVDLPIPTGHR DTEAGPLGAEDTEQRRKDLELRLQPETKAAPNLESRPVHSSCVPDWMDPSPESLIG >gi568815591r:64421164_64644433|GENSCAN_predicted_CDS_7|351_bp atgactcttaaggagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggacacagcacatgtttcagagagcacagggttggg ggcttcccagggggtcctggcatcttagctgtggatctcccaatacctacaggtcacagg gacacagaagctgggcctctaggagcagaagacacagaacagagaagaaaagacctggag ctacgactgcagccagagacaaaggccgcgccaaatctcgaaagccgtcctgtccactcc agctgcgtgcctgattggatggatcccagcccagagtccctgattggataa >gi568815591r:64421164_64644433|GENSCAN_predicted_peptide_8|293_aa MCAGAPFTRPCHPPRPARRGPGSWGVGAVRTLRAGPPLLSVQSYPGGLQALLQLRPHSLT RHWSDVVAIGVHVICELYAVGCASVADAYKTGEKGESKKLERNRREMVEYHWALHPSDVR KEYHTLTYGIKSSGCTQCVIAEPSAQKLQKQAIESDSTLENLLKVATSAFYNRDQEKTQE KARKLRRRTKALVVALQACNVQDFQGSSAHCYQSGKSGYFKKQCPSIKKKPPQPSAGCGR DTREQTAPRDEVTRFRTSLTDRLAGIMGPRDPTPSSSSSNCHYSLGSQGDSGN >gi568815591r:64421164_64644433|GENSCAN_predicted_CDS_8|882_bp atgtgtgcgggcgcacccttcacgcggccctgtcacccgccgcgccctgcccgccgaggc cctggcagctggggggtgggcgctgttcggacgctgcgcgcgggaccacctctgctctcg gtgcagtcctacccaggaggcctgcaggctctcctgcagctcaggcctcactctctgacg cggcactggagtgatgttgtggcaattggtgtccatgtaatatgtgagctgtatgctgtg ggctgtgcctcagtagcagatgcttataagacaggagaaaagggagaaagcaaaaagttg gaaagaaacagaagggagatggtggaatatcactgggccttgcatccaagtgatgtcaga aaggagtatcacactctcacatatggtataaagtcttcaggttgtacacagtgtgtcatt gcagagcccagtgcacagaaactacaaaagcaggctatagaatcagatagcaccttagag aacctcctgaaggtagccacttcagccttttataatagggaccaggagaaaacccaagag aaagcgaggaaactcaggagaaggacaaaggctctagtagttgctttgcaagcctgcaat gtccaggatttccaaggttcatctgctcattgctatcagtctggcaagtcagggtacttt aagaagcagtgcccaagcatcaagaagaagccacctcaaccctctgcaggctgtggcaga gacactagagagcaaactgcccccagagatgaggtcactagattcagaaccagtctcaca gatcgtttggcaggaataatgggtcctagggacccaacccccagctccagcagctcaaac tgccattatagcctaggatcccaaggtgattctggaaattaa >gi568815591r:64421164_64644433|GENSCAN_predicted_peptide_9|172_aa MGPEIHHNISNGISPSKRRALHQIVDECRDMAQCFLQPGSRFRVEESSINYGIRSWAKQY VTTTTVDRSQEKEECHIIKVIAPRYTRHGTGKEKSANISVLDPEVYHNLSYGQSSGNRGK PHQIDNGSLDHTQQSHSEVASLSGEIPEVCCPALRRVMTRTHTRNEFQEQKV >gi568815591r:64421164_64644433|GENSCAN_predicted_CDS_9|519_bp atgggcccagagatacatcacaatatctccaatgggataagcccaagcaagagaagagca ctccaccaaatagttgatgagtgcagagatatggcacaatgcttcctgcagccaggatcc aggtttagggtagaagagagtagcataaactatggaatcaggtcttgggcaaagcaatat gtcacaacaaccactgtggacaggtcccaggaaaaagaggagtgtcacattatcaaggtg atagccccaaggtatacaaggcatggcacaggcaaggagaagagtgccaacatctcagtg ctggacccagaggtatatcacaatctttcttatgggcagagttcaggtaatagaggaaag ccacatcaaatagacaatgggtccttagaccacacgcagcagagtcatagtgaagtggct tcattgtctggggaaatacccgaagtttgttgtcctgcactgagaagagtaatgactcga acacacacacggaatgagtttcaggagcagaaagtttaa