GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:21:10 Sequence gi568815579f:32481262_32687297 : 206036 bp : 45.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 818 979 162 1 0 78 43 166 0.975 9.04 1.02 PlyA + 2436 2441 6 1.05 2.03 PlyA - 2748 2743 6 1.05 2.02 Term - 13719 13583 137 1 2 107 41 28 0.313 -1.82 2.01 Init - 16894 16783 112 1 1 53 119 93 0.888 9.27 2.00 Prom - 35509 35470 40 -0.86 3.00 Prom + 53952 53991 40 -4.36 3.01 Init + 57645 57710 66 2 0 68 64 23 0.159 -1.03 3.02 Term + 63741 63869 129 2 0 83 45 119 0.833 5.28 3.03 PlyA + 67383 67388 6 1.05 4.00 Prom + 93521 93560 40 -2.56 4.01 Init + 100001 100066 66 1 0 103 72 89 0.995 9.87 4.02 Intr + 100934 100971 38 1 2 84 80 52 0.348 1.06 4.03 Intr + 103689 103750 62 0 2 93 52 25 0.120 -2.22 4.04 Intr + 104555 104646 92 0 2 75 121 15 0.887 3.21 4.05 Intr + 105597 105668 72 2 0 99 106 34 0.915 5.90 4.06 Term + 105992 106039 48 1 0 79 43 72 0.979 -0.90 4.07 PlyA + 106140 106145 6 1.05 5.02 PlyA - 106252 106247 6 1.05 5.01 Sngl - 108038 107010 1029 2 0 60 47 424 0.901 32.60 5.00 Prom - 109950 109911 40 -4.96 6.28 PlyA - 110117 110112 6 1.05 6.27 Term - 111282 110932 351 0 0 49 42 279 0.398 13.89 6.26 Intr - 117113 116905 209 0 2 9 88 107 0.503 1.60 6.25 Intr - 120865 120754 112 1 1 78 64 80 0.893 4.55 6.24 Intr - 123163 123002 162 1 0 89 96 241 0.998 25.17 6.23 Intr - 124744 124574 171 1 0 80 94 117 0.805 11.64 6.22 Intr - 126571 126374 198 1 0 82 52 219 0.711 17.25 6.21 Intr - 134519 134397 123 2 0 83 42 143 0.989 9.98 6.20 Intr - 136372 136328 45 1 0 78 101 40 0.858 2.91 6.19 Intr - 138118 137999 120 1 0 83 78 109 0.987 10.09 6.18 Intr - 138292 138233 60 1 0 93 107 23 0.925 3.63 6.17 Intr - 141358 141161 198 1 0 93 65 273 0.999 25.05 6.16 Intr - 144705 144613 93 0 0 131 86 175 0.999 21.76 6.15 Intr - 145566 145451 116 1 2 116 101 145 0.671 18.77 6.14 Intr - 149095 148959 137 0 2 27 99 77 0.248 2.91 6.13 Intr - 149276 149143 134 2 2 71 1 120 0.247 1.04 6.12 Intr - 150233 150141 93 2 0 105 100 -8 0.405 2.26 6.11 Intr - 158227 158095 133 0 1 77 58 113 0.764 7.85 6.10 Intr - 161938 161862 77 1 2 16 110 82 0.964 1.51 6.09 Intr - 162091 162026 66 1 0 81 92 45 0.894 3.30 6.08 Intr - 162223 162170 54 1 0 111 78 73 0.882 7.78 6.07 Intr - 162370 162311 60 1 0 66 107 35 0.811 2.13 6.06 Intr - 163218 163064 155 1 2 118 18 126 0.935 8.39 6.05 Intr - 165354 165198 157 0 1 101 66 50 0.983 3.68 6.04 Intr - 168531 168421 111 0 0 97 61 76 0.961 6.38 6.03 Intr - 177784 177653 132 1 0 97 87 66 0.941 8.24 6.02 Intr - 181912 181852 61 0 1 70 22 84 0.653 -1.26 6.01 Init - 185232 185183 50 0 2 93 87 64 0.939 7.22 6.00 Prom - 186098 186059 40 -8.26 7.00 Prom + 191200 191239 40 -7.56 7.01 Sngl + 195003 195710 708 2 0 105 49 1402 0.988 134.23 7.02 PlyA + 196890 196895 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:32481262_32687297|GENSCAN_predicted_peptide_1|53_aa MMDGPGENDPDLKPADHPRFCEEIKRNLPPYVAYFTRVFQNKTFHVYKLSRNK >gi568815579f:32481262_32687297|GENSCAN_predicted_CDS_1|162_bp atgatggatggcccaggagagaatgatcctgatttgaaacctgcagaccaccctcgcttc tgtgaagagatcaaaagaaacctgcctccctacgtggcctacttcaccagagtgttccag aacaaaaccttccacgtttacaagctgtccagaaacaagtag >gi568815579f:32481262_32687297|GENSCAN_predicted_peptide_2|82_aa MFVKINLTVVKGYMGDILVNSSHIISVCNGSLKDGNTGNVHFLISRTCEYVMLLGQRAFA NVMTIMNLEMREIIFYYLVVPI >gi568815579f:32481262_32687297|GENSCAN_predicted_CDS_2|249_bp atgtttgtcaaaataaatctgacggtggttaaaggttacatgggagacattcttgtcaat tctagccacattatcagtgtttgcaatggttccctgaaggatgggaatactggaaatgtc cacttcctaatctctaggacctgtgaatatgtcatgctccttggccaaagggcctttgca aatgtgatgacaattatgaaccttgagatgagggagattatcttctattatctggtggtt ccaatctaa >gi568815579f:32481262_32687297|GENSCAN_predicted_peptide_3|64_aa MGIKIQDEIVGTQNQTISEGNGDCKQTVTLHGPTVRMSTFSGSLDLQGVISRAALISCSE SSSD >gi568815579f:32481262_32687297|GENSCAN_predicted_CDS_3|195_bp atggggattaagattcaagatgagattgtggggacacagaaccaaaccatatcagaaggg aatggggactgcaagcagacagtcactctgcatggccccactgtaaggatgtcaaccttt agcggcagcttggaccttcaaggggtcatcagcagagccgcactgatcagctgctcggag tccagctctgactga >gi568815579f:32481262_32687297|GENSCAN_predicted_peptide_4|125_aa MADEELEALRRQRLAELQAKHGDPGDAAQQEAKHREAEMRNSILAQVLDQSARARLSNLA LVKPEKTKAVENYLIQMARYGQLSEKVSEQGLIEILKKVSQQTEKTTTVKFNRRKVMDSD EDDDY >gi568815579f:32481262_32687297|GENSCAN_predicted_CDS_4|378_bp atggcggacgaggagcttgaggcgctgaggagacagaggctggccgagctgcaggccaaa cacggggatcctggtgatgcggcccaacaggaagcaaagcacagggaagcagaaatgaga aacagtatcttagcccaagttctggatcagtcggcccgggccaggttaagtaacttagca cttgtaaagcctgaaaaaactaaagcagtagagaattaccttatacagatggcaagatat ggacaactaagtgagaaggtatcagaacaaggtttaatagaaatccttaaaaaagtaagc caacaaacagaaaagacaacaacagtgaaattcaacagaagaaaagtaatggactctgat gaagatgacgattattga >gi568815579f:32481262_32687297|GENSCAN_predicted_peptide_5|342_aa MSELPFTIASKRIKYLGIHLTRDVKDLFKENYKPLLNEIKEDTKKWKNIPCSWVGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTSKFIWNQKRACIAKSILSQKNKAGGITLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMTHIYNYLIFDKPEKNKQWGKDSLFN KWCWENWLAICKKLKLDPFLTPYTKINSRWIKDLNVRPKTIKILEENLGITIQDIGMGKD FMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNGQPTKWEKIFTTYSSDKGLISTI YSELQQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKC >gi568815579f:32481262_32687297|GENSCAN_predicted_CDS_5|1029_bp atgagtgaactcccattcacaattgcttcaaagagaatcaaatacctaggaatccacctt acaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaagaaatggaagaacattccatgctcatgggtaggaagaatcaatattgtg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactacttcaaaattcatatggaaccaaaaaaga gcctgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaataatgacgcatatctacaac tatttgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtaaaaagctgaaactggatcccttcctt acaccttatacaaaaatcaattcaagatggattaaagacttaaacgttagacctaaaacc ataaaaatcctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacgggcaaccc acaaaatgggagaaaattttcacaacctactcatctgacaaagggctaatatccacaatc tacagtgaactccaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcg aaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaa aaatgctga >gi568815579f:32481262_32687297|GENSCAN_predicted_peptide_6|1125_aa MEQSADSNILQTAYFLNIGILGYQENENGIRKPFYKCEICSDPSEVHMALYDEDLLKNPF YLALQKCRPDLCSKVAQIHGIVLVPCKGSLSSSIQSTCQFESYILIPVEEHFQTLNGKDV FIQGNRIKLGAGFACLLSVPILFEETFYNEKEESFSILCIAHPLEKRESSEEPLAPSDPF SLKTIEDVREFLGRHSERFDRNIASFHRTFRECERKSLRHHIDSANALYTKCLQQLLRDS HLKMLAKQEAQMNLMKQAVEIYVHHEIYNLIFKYVGTMEASEDAAFNKITRSLQDLQQKD IGVKPEFRMANLSYIKNFRFSSLAKDELGYCLTSFEAAIEYIRQGSLSAKPPESEGFGDR LFLKQRMSLLSQMTSSPTDCLFKASSVVPPLAALAELMPPLGSRSPLSDVYGRVHWVGNT VPDFLVVWQAAEVPAGQSVSAGRGQSPLGEAAVLPGDLEPEWVLGNEHVQEAPGQASLID LLVSKGAMVNATDYHGATPLHLACQKGYQSVTLLLLHYKASAEVQDNNGNTPLHLACTYG HEDCVKALVYYDVESCRLDIGNEKGDTPLHIAARWGYQGVIETLLQNGASTEIQNRLKET PLKCALNSKILSVMEAYHLSFERRQKSSEAPVQSPQRSVDSISQESSTSSFSSMSASSRQ EETKKDYREVEKLLRAVADGDLEMVRYLLEWTEEDLEDAEDTVSAADPEFCHPLCQCPKC APAQKRLAKVPASGLGVNVTSQDGSSPLHVAALHGRADLIPLLLKHGANAGARNADQAVP LHLACQQGHFQEMTEILIKFARFPSLAQVVKCLLDSNAKPNKKDLSGNTPLIYACSGGHH ELVALLLQHGASINASNNKGNTALHEAVIEKHVFVVELLLLHGASVQVLNKRQRTAVDCA EQNSKIMELLQVVPSCVASLDDVAETDRKEYVTVKIRKKWRQSVTLRQNNLPAQSGSHAA EKGNSDWPERPGLTQTGPGHRRMLRRHTVEDAVVSQGPEAAGPLSTPQEERSSSPAMEQS WTENDFDELREEDFRRSNYELQEEIQTKGKEVKNFEKNSDECITRITNTEKCLKELMEPK AKAQELREECRSLRSRCDQLEERVSAIEDEMNEMKQEGKFREKIK >gi568815579f:32481262_32687297|GENSCAN_predicted_CDS_6|3378_bp atggagcagtctgcagacagcaacatccttcaaacagcttacttcctaaacattgggatc ttggggtaccaagagaatgagaatggaatccgcaagcccttttataagtgtgagatttgc tctgacccatctgaagtccatatggctctgtatgatgaagacctcctgaaaaatcctttc tatctggctctgcaaaagtgccgccctgacttgtgcagcaaagtggcccaaatccatggc attgtcttagtaccctgcaaaggaagcctgtcgagcagcatccagtctacttgtcagttt gagtcctacattttgatacctgtggaagagcattttcagaccttaaatggaaaggatgtc tttattcaagggaacaggattaaattaggagctggttttgcctgtcttctctcagtgccc attctctttgaagaaactttctacaatgaaaaagaagagagtttcagcatcctgtgtata gcccatcctttggaaaagagagagagttcagaagagcctttggcaccctcagatcccttt tccctgaaaaccattgaagatgtgagagagttcttgggaagacactccgagcgatttgac aggaacatcgcctctttccatcgaacattccgagaatgcgagagaaagagcctccgtcac cacatagactcagcgaatgctctctacaccaaatgcctccagcagcttctgagggactct cacctgaaaatgctcgccaagcaggaggcccagatgaacctgatgaagcaggcagtggag atatacgtccatcatgaaatttacaacctgatctttaaatacgtggggaccatggaggca agtgaggatgcggcctttaacaaaatcacaagaagccttcaagatcttcagcagaaagat attggtgtgaaaccggagttcaggatggcaaatttgagttacatcaaaaacttcaggttt agcagcttggcaaaggatgaactgggatactgcctgacctcattcgaagctgccattgaa tatattcggcaaggaagcctctctgctaaaccccctgagtctgagggatttggagacagg ctgttccttaagcagagaatgagcttactctctcagatgacttcgtctcccaccgactgc ctgtttaaggccagctcagtggtgcctccgctggcagccctggcagagctgatgcctccg ctgggctcgaggagcccgttgtcagatgtttatgggcgtgtgcactgggttgggaacacg gtgcctgacttcctggtggtctggcaggccgctgaggtgcccgcagggcagtccgtgagt gctgggcgagggcagagccccctgggggaggctgctgtgcttccaggcgacctggagcca gagtgggtcctggggaatgagcacgtccaggaagcaccagggcaggcatccctcatcgac ctcctggtttccaagggcgccatggtaaatgccacagactaccatggagccactccgctc cacctggcctgtcagaagggctaccagagcgtgacgctgctgctgctgcactacaaggcc agcgcggaagtgcaggacaacaatgggaatacgccactccacctggcctgcacctacggc cacgaggactgtgtgaaggctctggtttactacgacgtggagtcgtgcagacttgacatt ggcaatgagaaaggagacacccctctacacattgctgcccgctggggctaccaaggcgtc atagagacattgctgcagaacggagcgtccaccgagatccagaacagactgaaggagacg cccctcaagtgtgcattaaactcaaagattctgtctgtaatggaagcctatcacctgtcc ttcgagaggaggcagaagtcgtccgaggcccctgtgcagtccccgcagcgctccgtggac tccatcagccaagagtcctccacttccagcttctcctccatgtcagccagctcaaggcag gaggagaccaagaaggactacagagaggtagaaaaacttttgagagcagttgctgatgga gatctagaaatggtgcgttacctgttggaatggacagaggaggacctggaggatgcggag gacactgtcagtgcagcggaccccgaattctgtcacccgttgtgccagtgccccaagtgt gccccagctcagaagaggctggcgaaggttcctgccagtgggcttggtgtgaacgtgacc agccaggacggctcctccccgctgcatgtcgccgccctgcacggccgggcggacctcatc cccctcctgctgaagcacggggccaacgcaggtgccaggaacgcagaccaagccgtcccg ctccacctggcctgccagcagggccactttcaggaaatgacagaaattttgattaagttt gcacgttttccttctctggcccaggtggtgaagtgtctgttagattcgaatgcaaaaccc aataagaaggacctcagtggaaacacgcccctcatttacgcctgctccggtggccatcac gagcttgtggcactgctgctacagcacggggcctccattaacgcttctaacaataagggc aacacagcgctgcacgaggctgtgattgaaaagcacgtcttcgtggtagagctgcttctg ctccacggagcgtcagttcaggtgctgaacaagcggcagcgcacggctgtagactgtgct gaacagaattcaaaaataatggaattgcttcaggtggtaccaagctgtgttgcttcatta gatgatgtggctgaaactgaccgcaaggagtatgtcactgttaagatcaggaaaaaatgg aggcaaagtgtcacactgagacagaataacctgccagctcagagtggatctcatgctgct gagaaaggcaacagcgactggccagagaggcctggactgacacagactggccctggacac agacggatgctgcggagacacacggtagaggatgcggtcgtgtcccagggcccggaggct gctggccccctctccactccccaagaggaacgcagttcctcaccagcaatggaacaaagc tggacggagaatgactttgatgagctgagagaagaagacttcagacgatcaaactacgag ctacaggaggaaattcaaaccaaaggcaaagaagttaaaaactttgaaaaaaattcagac gaatgtataactagaataaccaatacagagaagtgcttaaaggagctgatggagccgaaa gccaaggctcaagaactacgtgaagaatgcagaagcctcaggagccgatgcgatcaactg gaagaaagggtatcagcgatagaagatgaaatgaatgaaatgaagcaagaagggaagttt agagaaaaaataaaatga >gi568815579f:32481262_32687297|GENSCAN_predicted_peptide_7|235_aa MAREECKALLDGLNKTTACYHHLVLTVGGSADSQNLRQELQKTRQKAQELAVSTCARLTA VLRDRGLAADERAEFERLWVAFSGCLDLLEADMRRALELGAAFPLHAPRRPLVRTGVAGA SSGVAARALSTRSLRLEAEGDFDVADLRELEREVLQVGEMIDNMEMKVNVPRWTVQARQA AGAELLSTVSAGPSSVVSLQERGGGCDPRKALAAILFGAVLLAAVALAVCVAKLS >gi568815579f:32481262_32687297|GENSCAN_predicted_CDS_7|708_bp atggcgagggaggagtgcaaggcgctgctggacgggctcaacaagacgactgcgtgctac caccacctggtgctgaccgtcggtggctcggcggactcgcagaacctgcggcaggagctg caaaagacgcgccagaaggcgcaggagctggcggtgtccacctgcgcccggctgactgct gtgctgcgcgaccggggcctggccgccgacgagcgcgccgagttcgagcggctctgggtg gccttctcgggctgcctggacctgctggaagcggacatgcgacgcgcgctggagctgggc gccgcgttcccgctgcacgcgccgcggcggccgctggtgcgcacaggtgtggctggcgcc tcctccggcgtggcggcgcgcgcgctgagcacccgcagcctgcggctcgaggcggagggc gacttcgacgtcgcggacctgcgggagctggagcgcgaggtccttcaggtgggcgagatg atcgacaacatggagatgaaggtcaacgtgccccgctggaccgtgcaagcccggcaggcg gcgggcgccgagctcctgtccacggtcagcgccggcccctcctcggtcgtgtccttgcag gagcgcggggggggttgcgaccccaggaaggccctggccgccatccttttcggcgccgtg ctgctggcggctgtggccctagccgtgtgcgtggcgaagctgagctga