GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:35:42 Sequence gi568815575r:115618945_115819229 : 200285 bp : 43.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3302 3465 164 0 2 61 92 178 0.992 15.19 1.02 Intr + 10254 10383 130 2 1 26 89 109 0.935 5.07 1.03 Intr + 10891 11023 133 2 1 27 91 94 0.957 3.30 1.04 Intr + 15056 15137 82 2 1 68 85 107 0.832 8.04 1.05 Intr + 17892 17915 24 2 0 103 81 23 0.365 1.32 1.06 Intr + 21464 21559 96 1 0 94 101 12 0.907 3.31 1.07 Intr + 24369 24564 196 2 1 79 88 95 0.913 7.59 1.08 Intr + 26077 26138 62 2 2 58 49 53 0.575 -3.15 1.09 Intr + 27159 27242 84 2 0 39 71 87 0.491 2.12 1.10 Intr + 27458 27591 134 1 2 95 75 86 0.998 7.44 1.11 Intr + 28606 28729 124 1 1 77 77 157 0.997 14.09 1.12 Term + 28949 29077 129 1 0 68 49 149 0.799 7.18 1.13 PlyA + 29207 29212 6 1.05 2.00 Prom + 30193 30232 40 -5.06 2.01 Init + 41464 41526 63 0 0 62 62 60 0.699 1.95 2.02 Intr + 45879 46285 407 2 2 71 108 627 0.614 56.15 2.03 Intr + 46389 46564 176 0 2 -14 20 223 0.670 4.88 2.04 Term + 46640 46884 245 0 2 -17 48 426 0.926 24.16 2.05 PlyA + 47536 47541 6 1.05 3.02 PlyA - 47848 47843 6 1.05 3.01 Sngl - 51476 50994 483 2 0 82 54 231 0.568 15.18 3.00 Prom - 66099 66060 40 -1.86 4.00 Prom + 81152 81191 40 -1.86 4.01 Init + 81930 81977 48 2 0 49 99 40 0.135 0.25 4.02 Intr + 83936 84750 815 1 2 17 17 724 0.092 48.44 4.03 Term + 84939 85176 238 0 1 -5 47 218 0.543 4.04 4.04 PlyA + 85746 85751 6 1.05 5.02 PlyA - 90185 90180 6 1.05 5.01 Sngl - 100480 99998 483 1 0 51 48 387 0.956 26.48 5.00 Prom - 101186 101147 40 -6.36 6.00 Prom + 106144 106183 40 -6.06 6.01 Init + 106201 106248 48 0 0 64 72 27 0.468 -0.35 6.02 Intr + 107855 108042 188 1 2 93 100 46 0.625 4.79 6.03 Term + 109127 109448 322 2 1 97 37 210 0.849 11.09 6.04 PlyA + 111293 111298 6 1.05 7.04 PlyA - 113326 113321 6 -0.45 7.03 Term - 113911 113590 322 0 1 97 37 200 0.714 10.09 7.02 Intr - 115190 115003 188 2 2 93 100 46 0.576 4.79 7.01 Init - 118435 118253 183 1 0 96 31 78 0.078 2.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 49524 49433 92 1 2 123 35 74 0.857 3.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:115618945_115819229|GENSCAN_predicted_peptide_1|452_aa XLNSNGFICDYELHELFKEANMPLPGYKVREIIQKLMLDGDRNKDGKISFDEFVYIFQEV KSSDIAKTFRKAINRKEGICALGGTSELSSEGTQHSYSEEEKYAFVNWINKALENDPDCR HVIPMNPNTDDLFKAVGDGIVLCKMINLSVPDTIDERAINKKKLTPFIIQPWLLYSEMDS KAYFHLLNQIAPKGQKEGEPRIDINMSGFNETDDLKRAESMLQQADKLGCRQFVTPADVV SGNPKLNLAFVANLFNKYPALTKPENQDIDWTLLEGETREERTFRNWMNSLGVNPHLYER IKVPVDWSKVNKPPYPKLGANMKKLENCNYAVELGKHPAKFSLVGIGGQDLNDGNQTLTL ALVWQLMRRYTLNVLEDLGDGQKANDDIIVNWVNRTLSEAGKSTSIQSFKDKTISSSLAV VDLIDAIQPGCINYDLVKSGNLTEDDKHNNAK >gi568815575r:115618945_115819229|GENSCAN_predicted_CDS_1|1359_bp natctcaacagcaacggattcatttgtgactatgaacttcatgagctcttcaaggaagct aatatgccattaccaggatataaagtgagagaaattattcagaaactcatgctggatggt gacaggaataaagatgggaaaataagttttgacgaatttgtttatatttttcaagaggta aaaagtagtgatattgccaagaccttccgcaaagcaatcaacaggaaagaaggtatttgt gctctgggtggaacttcagagttgtccagcgaaggaacacagcattcttactcagaggaa gaaaaatatgcttttgttaactggataaacaaagctttggaaaatgatcctgattgtaga catgttataccaatgaaccctaacaccgatgacctgttcaaagctgttggtgatggaatt gtgctttgtaaaatgattaacctttcagttcctgataccattgatgaaagagcaatcaac aagaagaaacttacacccttcatcattcagccttggctgctttactccgagatggattcc aaagcctatttccatcttctcaatcaaatcgcaccaaaaggacaaaaggaaggtgaacca cggatagatattaacatgtcaggtttcaatgaaacagatgatttgaagagagctgagagt atgcttcaacaagcagataaattaggttgcagacagtttgttacccctgctgatgttgtc agtggaaaccccaaactcaacttagctttcgtggctaacctgtttaataaatacccagca ctaactaagccagagaaccaggatattgactggactctattagaaggagaaactcgtgaa gaaagaaccttccgtaactggatgaactctcttggtgtcaatcctcacttatatgaacga attaaagttcctgttgactggagtaaggttaataaacctccatacccgaaactgggagcc aacatgaaaaagctagaaaactgcaactatgctgttgaattagggaagcatcctgctaaa ttctccctggttggcattggagggcaagacctgaatgatgggaaccaaaccctgacttta gctttagtctggcagctgatgagaagatataccctcaatgtcctggaagatcttggagat ggtcagaaagccaatgacgacatcattgtgaactgggtgaacagaacgttgagtgaagct ggaaaatcaacttccattcagagttttaaggacaagacgatcagctccagtttggcagtt gtggatttaattgatgccatccagccaggctgtataaactatgaccttgtgaagagtggc aatctaacagaagatgacaagcacaataatgccaagtaa >gi568815575r:115618945_115819229|GENSCAN_predicted_peptide_2|296_aa MPSEKRVLAAALVEVVLEKTLAQVVHCTVKTDSRCRELVPPIPDAMSSKGSVVLAYSGGL DTSCILVLLKEQGYDIIAYVANTGQKEDFEEARKKALKLGGKKVFIEEVSKEFVKEFIWL AIQSSALYEDHYLLGTSLTRPCIARKQVEIAQQERAKMPKFYNRFKVRNDLMEHTKQHGI PIPVTPKNLWNMDENLMQISNEAGILENPKNQAFPGVPMKVTNVKDGTTHQTSLELFLYL NEDVGKYSLGRIDIKENHFTGMKSPGIYETPADTIVYHAHLDIRAFTMDREVCKIK >gi568815575r:115618945_115819229|GENSCAN_predicted_CDS_2|891_bp atgcccagtgaaaaacgtgttcttgcagcagctcttgtggaagttgtccttgaaaaaact ttggcccaagtggttcactgcactgttaagacagattccagatgccgggaactcgtgcct ccaattccagatgctatgtccagcaaaggctctgtggttctggcctacagtggcggcctg gacacctcctgcatcctcgtgttactgaaggaacaaggctatgacatcattgcctacgtg gccaacactggccagaaggaagactttgaggaagccaggaagaaggcactgaagcttggg ggcaaaaaggtgttcattgaggaagtcagcaaggagtttgtgaaggagttcatctggctg gccatccagtccagcgcactgtatgaggaccactacctcctgggcacctctctcaccagg ccctgcatcgcccgaaaacaagtggaaatcgcccagcaggagagggccaagatgcccaag ttctacaacaggttcaaggtccgaaatgaccttatggaacacacaaagcaacacgggatt cccatcccagtcactcccaagaacctgtggaacatggacgagaacctcatgcagatcagc aatgaggctggaatcttggagaaccctaagaaccaagcatttccaggagtccccatgaaa gtgaccaacgtcaaggatggcaccacccaccagacctccttggagctcttcctgtacctg aacgaagacgtgggcaagtacagcttgggccgtattgacatcaaggagaaccacttcact ggaatgaagtccccaggtatctatgagaccccagcagacaccatcgtttaccatgctcat ttagacatcagggccttcaccatggaccgggaagtatgcaaaatcaaataa >gi568815575r:115618945_115819229|GENSCAN_predicted_peptide_3|160_aa MKDLFKENYKPLLKEIKEDTIKWKNIPCSWVRRINIMKVAILPKVIDRFNAIPIKLPMTF FTELEKTTLKFIWNQKRALITKSFLSQKNKAGGITLPDFKLHYKATVTKTAWYWYQNRDI DQWNRIEPSEITPHIYNYLIFDKPEKSKQWGKDSLFNKWC >gi568815575r:115618945_115819229|GENSCAN_predicted_CDS_3|483_bp atgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggataca atcaaatggaagaacattccatgctcatgggtaagaagaatcaatatcatgaaagtggcc atactgcccaaggtaattgatagattcaatgccatccccatcaagctaccaatgactttc ttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagccctcatc accaagtcattcctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaa ctacactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatata gatcaatggaacagaatagagccttcagaaataacgccgcatatctacaactatctgatc tttgacaaacctgagaaaagcaagcaatggggaaaggattccctatttaataaatggtgc tga >gi568815575r:115618945_115819229|GENSCAN_predicted_peptide_4|366_aa MSKPGMVAHAYNLSTLHRGAHVRVLSAPPHFHFGQTNRTPKFLRRFPAGKVPAFEGDDRF CVFESNAIAYYVSNEELQGSTPEAAAQVMQWVSFADSNIVSPASTWVFPTLGIMRHNKQA TENTKEEVRQILGLLDAHLKTRTFLEGERVTLTDITVVCTLLWLYKQVLEPSFCQAFPNT NRWFLTCINQPQFCTVLGKAKLCEKMAQSDAKKFVESQPKKDTPRKEKRSREEKQKPQAE QEEKKAAAPTPEEEMDECEQALAAEPKAKDPFAHLPKSTFVLDELKLNVILFGTNNISSI SGVWVFRGQELAFPLSPDWQVGYESYTWQKLDPGSEETQMLVREYFSWEGAFQHVGKPFN QGKIFK >gi568815575r:115618945_115819229|GENSCAN_predicted_CDS_4|1101_bp atgtccaagccgggcatggtggctcacgcctataatctcagcactttgcacagaggggct catgtccgtgtgctctccgcaccaccccatttccactttggccaaaccaaccgcacccct aaatttctccgcagatttcctgctggcaaggttccagcatttgaaggtgacgatagattc tgtgtgtttgagagcaatgccattgcctactatgtgagcaatgaggagctgcagggaagt actccagaggcagcagcccaagtgatgcagtgggtgagctttgctgatagcaatatagtg tccccagccagtacctgggtgttccccaccttgggcatcatgcgtcacaacaaacaggcc actgagaatacaaaggaggaagtgaggcaaattctggggctgttggatgctcacttgaag acaaggacttttctggagggcgaacgagtgacattgactgacatcacagttgtctgtacc ctattgtggctctataagcaggtcctagaaccttctttctgccaggcctttcccaatacc aaccgctggttcctcacctgcattaaccagcctcagttctgcactgtcttggggaaagca aaactatgtgagaagatggcccagtctgatgctaaaaagtttgtggaaagccagcctaaa aaggacacaccacggaaagagaagcgttcacgggaagagaagcagaagccccaggctgag caggaggagaaaaaggcggctgcccctactcctgaggaggagatggatgaatgtgagcag gcgctggccgctgagcccaaggccaaggacccctttgctcacctgcccaagagtaccttt gtgttggatgaattgaagctcaatgtcatcctctttggaaccaacaatatcagctccatt tctggagtctgggtcttccgaggccaggagcttgcctttccactgagtccagattggcag gtgggctatgagtcatacacatggcagaaactggatcctggcagcgaggagacccagatg ctggttcgagagtacttttcctgggagggggccttccagcatgtgggcaaacccttcaat cagggcaagatcttcaagtga >gi568815575r:115618945_115819229|GENSCAN_predicted_peptide_5|160_aa MGGSGGSCSRTQGGGRERTTCAHPLPWSIAAASFFCSSWCCLCARLVRTWYLFCETAAEE IPALAMADEKPKEGVKTENNDHINLKVAGQDGSVVQFKIKRHTPLSKLMKAYCERQGLSV RQIRFRFDGQPINETDTPAQLEMEDEDTIDVFQQQTGGVY >gi568815575r:115618945_115819229|GENSCAN_predicted_CDS_5|483_bp atgggaggctccggcggctcgtgctcgcgcacgcagggtggaggcagggagcgcacgacg tgcgcgcaccctctgccctggtccatcgctgccgcctccttcttctgcagctcctggtgc tgcttgtgtgctcgtttggtgcggacctggtacctcttttgtgaaacggcagctgaggag attccggcgcttgccatggccgacgaaaaacccaaggaaggagtcaagactgagaacaac gatcatattaatttgaaggtggcggggcaggatggttctgtggtgcagtttaagattaag aggcatacaccacttagtaaactaatgaaagcctattgtgaacgacagggattgtcagtg aggcagatcagattccgattcgacgggcaaccaatcaatgaaacagacacacctgcacag ttggaaatggaggatgaagatacaattgatgtgttccaacagcagacgggaggtgtctac tga >gi568815575r:115618945_115819229|GENSCAN_predicted_peptide_6|185_aa MLRHGEQKRKRARKKWDFLPTCAFKTVRAATERVRHGADRLRGGGRDAHELKYPDTPSTS TTTSNTAPTGPLSRSPKPRTQGGTPRRAASSGGHRPNGHGTQHWQSALLTPQACSVADGA SRAEDPARPSPRLLPREGAPGKLPKAPSPGSLAEASAGPAQIMAATRLPSHGFLSGNGPA SWLSS >gi568815575r:115618945_115819229|GENSCAN_predicted_CDS_6|558_bp atgttgagacatggagagcagaaaaggaagcgagcccggaagaagtgggacttcttgccc acgtgcgccttcaaaacggtaagagctgcaactgaacgtgtgagacatggtgcagatagg ctgagaggcggcgggagagatgcccatgaactcaagtacccggacacgccctccacttct accaccacgagtaacaccgcccccacgggaccgctctcgaggtcccccaagccaaggacg caaggaggaacgccccggcgcgcggccagcagcggcgggcaccggcccaatggccacgga actcagcactggcagtcggccctcctcacaccgcaggcgtgcagtgtggccgacggagcc tcccgggccgaggacccagctaggccgtcaccccggttgctcccacgggaaggggcacca ggcaaactgcccaaggccccgagcccaggctccctggcggaggcctccgctggtcccgcc cagatcatggccgccaccaggctcccgagccatggcttcctgtccgggaacggcccggcg tcctggctgtccagctag >gi568815575r:115618945_115819229|GENSCAN_predicted_peptide_7|230_aa MAREPGGGHDLGGTSGASAREPGLGALGSLPGAPSRGSNRGDGLAGSSAREAPSATLHAC GDFLPTCAFKTVRAATERVRHGADRLRGGGRDAHELKYPDTPSTSTTTSNTAPTGPLSRS PKPRTQGGTPRRRPAAAGTRANGHGTQHWQSALLTPQACSVADGASRAEDPARPSPRLLP REGAPGKLPKAPSPGSLAEASAGPAQIMAATRLPSRGFLSGNGPASWLSS >gi568815575r:115618945_115819229|GENSCAN_predicted_CDS_7|693_bp atggctcgggagcctggtggcggccatgatctgggcgggaccagcggagcctccgccagg gagcctgggctcggggccttgggcagtttgcctggtgccccttcccgtgggagcaaccgg ggtgacggcctagctgggtcctcggcccgggaggctccgtcggccacactgcacgcctgc ggtgacttcttgcccacgtgcgccttcaaaacggtaagagctgcaactgaacgtgtgaga catggtgcagataggctgagaggcggcgggagagatgcccatgaactcaagtacccggac acgccctccacttctaccaccacgagtaacaccgcccccacgggaccgctctcgaggtcc cccaagccaaggacgcaaggaggaacgccccggcgccggccagcagcggcgggcacccgc gccaatggccacggaactcagcactggcagtcggccctcctcacaccgcaggcgtgcagt gtggccgacggagcctcccgggccgaggacccagctaggccgtcaccccggttgctccca cgggaaggggcaccaggcaaactgcccaaggccccgagcccaggctccctggcggaggcc tccgctggtcccgcccagatcatggccgccaccaggctcccgagccgtggcttcctgtcc gggaacggcccggcgtcctggctgtccagctag