GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:08:46 Sequence gi568815593r:135847334_136054833 : 207500 bp : 43.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3092 3163 72 1 0 128 84 113 0.985 14.50 1.02 Intr + 5230 5488 259 0 1 122 107 402 0.994 42.54 1.03 Intr + 24128 24385 258 0 0 120 110 195 0.961 22.33 1.04 Intr + 26688 26821 134 1 2 139 80 49 0.711 9.56 1.05 Term + 32635 32757 123 0 0 35 47 163 0.067 5.28 1.06 PlyA + 34116 34121 6 1.05 2.04 PlyA - 34382 34377 6 1.05 2.03 Term - 45177 45058 120 0 0 88 49 60 0.925 0.57 2.02 Intr - 46818 46687 132 0 0 63 100 38 0.869 3.34 2.01 Init - 48483 48370 114 0 0 74 96 231 0.796 20.71 2.00 Prom - 56700 56661 40 -3.26 3.03 PlyA - 56995 56990 6 1.05 3.02 Term - 61084 60833 252 1 0 57 43 180 0.665 6.04 3.01 Init - 63913 63869 45 1 0 89 65 14 0.245 0.04 3.00 Prom - 73201 73162 40 -3.16 4.03 PlyA - 73723 73718 6 1.05 4.02 Term - 75101 74995 107 0 2 44 49 207 0.989 10.97 4.01 Init - 75493 75475 19 1 1 97 94 5 0.918 2.23 4.00 Prom - 78589 78550 40 -4.06 5.00 Prom + 81892 81931 40 -3.26 5.01 Init + 83205 83334 130 2 1 79 72 82 0.465 4.01 5.02 Intr + 90164 90214 51 0 0 77 94 31 0.185 1.48 5.03 Term + 93695 94338 644 0 2 67 39 193 0.492 6.63 5.04 PlyA + 94990 94995 6 1.05 6.07 PlyA - 97607 97602 6 1.05 6.06 Term - 100164 99998 167 1 2 107 35 122 0.998 6.88 6.05 Intr - 104035 103890 146 0 2 58 94 126 0.981 10.13 6.04 Intr - 105634 105538 97 2 1 113 86 70 0.955 8.37 6.03 Intr - 109318 109128 191 0 2 99 70 14 0.505 -0.07 6.02 Intr - 111100 111011 90 0 0 62 91 58 0.620 2.61 6.01 Init - 136968 135842 1127 0 2 60 53 318 0.258 19.17 6.00 Prom - 140409 140370 40 -3.46 7.03 PlyA - 140704 140699 6 1.05 7.02 Term - 149578 149459 120 1 0 68 38 90 0.204 0.47 7.01 Init - 154609 154415 195 1 0 58 81 89 0.635 4.13 7.00 Prom - 155564 155525 40 -4.66 8.03 PlyA - 155669 155664 6 1.05 8.02 Term - 161212 161046 167 2 2 97 47 90 0.191 3.88 8.01 Init - 169152 169143 10 0 1 93 103 0 0.340 2.70 8.00 Prom - 173071 173032 40 -1.86 9.00 Prom + 174180 174219 40 -4.96 9.01 Init + 181723 181856 134 0 2 79 102 398 0.999 37.91 9.02 Intr + 186430 186528 99 1 0 104 92 124 0.742 13.63 9.03 Intr + 190543 190567 25 1 1 67 67 34 0.629 -2.77 9.04 Intr + 192035 192082 48 1 0 91 63 64 0.300 3.08 9.05 Intr + 196256 196310 55 1 1 102 65 11 0.004 -1.25 9.06 Intr + 199002 199162 161 1 2 112 31 160 0.997 12.41 9.07 Intr + 199518 199682 165 2 0 143 37 254 0.999 25.96 9.08 Intr + 199941 200087 147 2 0 106 90 318 0.999 34.23 9.09 Intr + 202106 202247 142 1 1 125 83 220 0.998 25.13 9.10 Intr + 205574 205786 213 0 0 92 96 287 0.997 28.59 9.11 Intr + 206610 206747 138 1 0 110 87 62 0.979 8.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 196725 196789 65 0 2 89 68 83 0.967 4.76 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:135847334_136054833|GENSCAN_predicted_peptide_1|281_aa TRLQAGVGYGNTLSCIRVVYRRESMFGFFKGMSFPLASIAVYNSVVFGVFSNTQRFLSQH RCGEPEASPPRTLSDLLLASMVAGVVSVGLGGPVDLIKIRLQMQTQPFRDANLGLKSRAV APAEQPAYQGPVHCITTIVRNEGLAGLYRGASAMLLRDVPGYCLYFIPYVFLSEWITPEA CTGPSPCAVWLAGGMAGAISWGTATPMDVVKSRLQADGVYLNKYKGVLDCISQSYQKEGL KVFFRGITVNAVRGFPMSAAMFLGYELSLQAIRGDHAVTSP >gi568815593r:135847334_136054833|GENSCAN_predicted_CDS_1|846_bp actcgcctgcaggctggcgttggctacggaaacaccctcagctgcatccgcgtggtgtac aggagggagagtatgttcggcttcttcaagggcatgtccttccccctcgccagcattgcc gtctacaactccgtggtgtttggggtcttcagtaacacgcagcggttcctcagccagcac cgctgcggggagccagaggccagtcctccccgcacgctgtcagacctgctcctggccagc atggtggccggcgtggtctctgtcgggctgggagggcccgtggacctcatcaagatccgg ttgcagatgcagacacaaccgtttcgggacgccaacctcggtttgaagtccagggcagtg gctcctgcggagcagccagcataccaggggccagtgcactgcattacaaccattgtgagg aatgagggcctggcggggctataccggggggccagtgccatgctgctgagggatgtccca ggctattgcctctacttcatcccctacgtgttcctgagtgagtggatcacacctgaggcc tgcacaggccccagcccctgtgccgtgtggctggcgggcggcatggcaggagcaatttct tgggggacagcgactcctatggatgtcgtgaaaagtcgactccaagctgatggggtttat ttaaacaaatataaaggtgtcctggactgtatctcccagagttaccagaaggaaggtctt aaagtgtttttcagaggcatcactgtgaacgcggtgcggggcttccccatgagtgcggcc atgttccttgggtacgagctgtcgctgcaggctatccgcggggaccacgcagtgacgagc ccataa >gi568815593r:135847334_136054833|GENSCAN_predicted_peptide_2|121_aa MLLAMVLTSALLLCSVAGQGCPTLAGILDINFLINKMQDNCTRPCFSERLSQMTNTTMQT RYPLIFSRVKKSVEVLKNNKCPYFSCEQPCNQTTAGNALTFLKSLLEIFQKEKMRGMRGK I >gi568815593r:135847334_136054833|GENSCAN_predicted_CDS_2|366_bp atgcttctggccatggtccttacctctgccctgctcctgtgctccgtggcaggccagggg tgtccaaccttggcggggatcctggacatcaacttcctcatcaacaagatgcaggacaac tgcaccagaccatgcttcagtgagagactgtctcagatgaccaataccaccatgcaaaca agatacccactgattttcagtcgggtgaaaaaatcagttgaagtactaaagaacaacaag tgtccatatttttcctgtgaacagccatgcaaccaaaccacggcaggcaacgcgctgaca tttctgaagagtcttctggaaattttccagaaagaaaagatgagagggatgagaggcaag atatga >gi568815593r:135847334_136054833|GENSCAN_predicted_peptide_3|98_aa MWEFLHPILTTGDLKCAHFDEASCHIREVHVTKEPSAHSQGGIKILSPTSLAGGTESCQS HVGLEADLSPNGAFRGDHTASVNTLTTALRETLNQRNT >gi568815593r:135847334_136054833|GENSCAN_predicted_CDS_3|297_bp atgtgggaatttctgcatcccatccttaccacaggggaccttaagtgtgcacactttgat gaagccagctgccacattagagaagtccatgtgacaaaggagccttcagcgcatagccag ggaggaatcaagatcctcagcccaacaagcttggcaggaggaactgaatcctgccaaagc catgtgggcttggaagcagatctttcccccaacggagcctttagaggagaccacactgcc tctgtcaacaccttaactacagccttgcgggagaccctgaatcagaggaacacgtaa >gi568815593r:135847334_136054833|GENSCAN_predicted_peptide_4|41_aa MEEGSAEITTDTPIFNNYYPDQSAAVNIKTRSSTSKKMMTH >gi568815593r:135847334_136054833|GENSCAN_predicted_CDS_4|126_bp atggaggaaggaagtgcagaaattaccacagacaccccaatcttcaacaactactaccct gatcagtcagcggccgtcaacatcaagacgagatcctccaccagcaagaagatgatgact cactga >gi568815593r:135847334_136054833|GENSCAN_predicted_peptide_5|274_aa MGSLFGPGRRALAPWATSGRSRAAERPLERPLPRPGPGRPRSRDLGLDFNSQAKFHECVG GILCVADRCQGLRELALNYYILTDELFLALSSETHVNLEHLRIDVVSENPGQIKFHAVKK HSWDALIKHSPRVNVVMHFFLYEEEFETFFKEETPVTHLYFGRSVSKVVLGRVGLNCPRL IELVVCANDLQPLDNELICIAEHCTNLTALGLSKCEVSCSAFIRFVRLCERRLTQLSVME EVLIPDEDYSLDEIHTEVSKYLGRVWFPDVMPLW >gi568815593r:135847334_136054833|GENSCAN_predicted_CDS_5|825_bp atgggctcgctctttgggcccgggaggcgtgcgctcgcgccctgggccacgtcggggcgg tctcgggcggctgagcgccccctggagcgacccctcccgcggcccgggcccggccgcccc cgaagccgggaccttgggcttgatttcaacagccaagccaagtttcatgaatgtgtcgga ggaattctttgtgtagctgaccgttgtcaaggccttagagaactggcgttgaattattac atcctaactgatgaacttttccttgcactctcaagcgagactcatgttaaccttgaacat cttcgaattgatgttgtgagtgaaaatcctggacagattaaatttcatgctgttaaaaaa cacagttgggatgcacttattaaacattcccctagagttaatgttgttatgcacttcttt ctatatgaagaggaattcgagacgttcttcaaagaagaaacccctgttactcacctttat tttggtcgttcagtcagcaaagtggttttaggacgggtaggtctcaactgtcctcgactg attgagttagtggtgtgtgctaatgatcttcagcctcttgataatgaacttatttgtatt gctgaacactgtacaaacctaacagccttgggcctcagcaaatgtgaagttagctgcagt gccttcatcaggtttgtaagactgtgtgagagaaggttaacacagctctctgtaatggag gaagttttgatccctgatgaggattatagcctagatgaaattcacactgaagtctccaaa tacctgggaagagtatggttccctgatgtgatgcctctctggtaa >gi568815593r:135847334_136054833|GENSCAN_predicted_peptide_6|605_aa MSELPFTIASKRIKYLGIQLTRDMKDLFKENYKPLLKEIKEDTNKWKNIPCSWVRRINIV KMAILPKVIYRFNAIPIKLSMTFFTELEKATLKFIWNQKRARIAKSILSQKNKAGGITLP DFKLYYKATVTKTVWYWYQNRDIDQCNRTEPSEITPHIYNYLIFDKPEKNNQWGKDSLFN KWCWENWLATCRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKD FMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSGKGLISRI YNELKQIYKKKTNNPIKKWAKDTNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYH LTPVRMAIIKKSGNNSLEDGTGFRGEDTLETVNAEGRENRRKAGHRAAGSSASRRCRNTG RSVQRLRPQKTQAFPTHCHFCLILVVKATQLNPKSRVRKVPFAPMKHGKALAGPWANICA GKSSNEIRTCDRHGCGQYSAQRSQRPHQGVDILCSAGSTVYAPFTGMIVGQEKPYQNKNA INNGVRISGRGFCVKMFYIKPIKYKGPIKKGEKLGTLLPLQKVYPGIQSHVHIENCDSSD PTAYL >gi568815593r:135847334_136054833|GENSCAN_predicted_CDS_6|1818_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacttaggaatccaactt acaagggatatgaaggacctctttaaggagaactacaaaccactgctcaaggaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaagaagaatcaatattgtg aaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctatca atgactttcttcacagaattggaaaaagctactttaaagttcatatggaaccaaaaaaga gcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagtatggtactggtaccaaaac agagatatagatcaatgcaacagaacagagccctcagaaataacgccgcatatctacaac tatctgatctttgacaaacctgagaaaaacaatcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccacatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaacgttagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acaaaatgggagaaaattttcgcaacctactcatctggcaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcg aaggacacgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaa aaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccat ctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacagtttagaggacggc actggcttcagaggggaagatacattggaaacagttaatgcagaaggcagggaaaatagg aggaaagcaggccacagggcagcaggatcatctgcctcacggcgatgtagaaacacagga aggagtgtgcaaagactaaggccccagaagacccaggctttcccgacccactgccacttc tgcctcatcttagtggtcaaagctacacagctaaacccaaaatcaagggtgaggaaagtt ccctttgcccccatgaaacatggcaaagcactggcagggccatgggctaatatatgtgct ggcaagtcttccaatgagatccggacgtgtgaccgccatggctgtggacagtactctgct caaagaagtcagaggcctcaccagggtgtggacatcttgtgctctgctggatctactgtg tacgcaccattcactggaatgattgtgggccaggagaaaccttatcaaaacaagaatgct atcaataatggtgttcgaatatctggaagaggtttttgtgtcaaaatgttctacattaag ccaattaagtataaaggtcctattaagaagggagaaaaacttggaactctattgcccttg cagaaagtttatcctggcatacaatcgcatgtgcacattgaaaactgtgactcgagtgac cctactgcatacctgtaa >gi568815593r:135847334_136054833|GENSCAN_predicted_peptide_7|104_aa MRGKPPPHSRKHKLPSLLAFKAEEGHSPACQPAGHGVTLWGHTPCRLHTAPLSKHGPTEV NEKQMGSRVLGRRHEVSNKYEGIIIWMKCQRTIVQAMLPAVAFD >gi568815593r:135847334_136054833|GENSCAN_predicted_CDS_7|315_bp atgagggggaaaccacccccacactctaggaaacacaagctccccagcctcttggccttc aaggcagaggagggccactccccagcctgccagccagcaggccatggggtcaccctgtgg ggccacaccccatgcaggctgcatactgccccattgtctaagcatggccccactgaagtc aacgagaaacagatgggaagccgggtcctgggaagaaggcatgaagtttctaacaaatat gaaggcatcatcatctggatgaagtgtcagaggaccatagtccaggcaatgctgcctgct gtggcttttgattag >gi568815593r:135847334_136054833|GENSCAN_predicted_peptide_8|58_aa MPHAKEVPLPMCPNMHQMVKQARFLCLLENDKSGCIRYILYMKDGTLVLPKGLFKMPP >gi568815593r:135847334_136054833|GENSCAN_predicted_CDS_8|177_bp atgccccatgccaaggaagtacctctccccatgtgtccaaacatgcaccagatggtgaag caggcgaggttcctctgcctcctggaaaacgacaagagtggatgtattaggtacatcctg tacatgaaggatgggaccctggttttaccaaaaggtcttttcaaaatgcctccatga >gi568815593r:135847334_136054833|GENSCAN_predicted_peptide_9|443_aa MALFVRLLALALALALGPAATLAGPAKSPYQLVLQHSRLRGRQHGPNVCAVQKVIGTNRK YFTNCKQWYQRKICGKSTALGTADVKGVSSLAVSVSDLGGEQMRKLRPREMMKLAEDGPA ALPLSNLYETLGVVGSTTTQLYTDRTEKLRPEMEGPGSFTIFAPSNEAWASLPAEVLDSL VSNVNIELLNALRYHMVGRRVLTDELKHGMTLTSMYQNSNIQIHHYPNGIVTVNCARLLK ADHHATNGVVHLIDKVISTITNNIQQIIEIEDTFETLRAAVAASGLNTMLEGNGQYTLLA PTNEAFEKIPSETLNRILGDPEALRDLLNNHILKSAMCAEAIVAGLSVETLEGTTLEVGC SGDMLTINGKAIISNKDILATNGVIHYIDELLIPDSAKTLFELAAESDVSTAIDLFRQAG LGNHLSGSERLTLLAPLNSVFKX >gi568815593r:135847334_136054833|GENSCAN_predicted_CDS_9|1329_bp atggcgctcttcgtgcggctgctggctctcgccctggctctggccctgggccccgccgcg accctggcgggtcccgccaagtcgccctaccagctggtgctgcagcacagcaggctccgg ggccgccagcacggccccaacgtgtgtgctgtgcagaaggttattggcactaataggaag tacttcaccaactgcaagcagtggtaccaaaggaaaatctgtggcaaatcaacggcactg gggacagctgacgtgaagggggtatcaagcctggcagttagtgtcagcgacttaggaggt gaacaaatgaggaaactgagacccagagagatgatgaaattggctgaggatggcccagct gccctaccactctcaaacctttacgagaccctgggagtcgttggatccaccaccactcag ctgtacacggaccgcacggagaagctgaggcctgagatggaggggcccggcagcttcacc atcttcgcccctagcaacgaggcctgggcctccttgccagctgaagtgctggactccctg gtcagcaatgtcaacattgagctgctcaatgccctccgctaccatatggtgggcaggcga gtcctgactgatgagctgaaacacggcatgaccctcacctctatgtaccagaattccaac atccagatccaccactatcctaatgggattgtaactgtgaactgtgcccggctgctgaaa gccgaccaccatgcaaccaacggggtggtgcacctcatcgataaggtcatctccaccatc accaacaacatccagcagatcattgagatcgaggacacctttgagacccttcgggctgct gtggctgcatcagggctcaacacgatgcttgaaggtaacggccagtacacgcttttggcc ccgaccaatgaggccttcgagaagatccctagtgagactttgaaccgtatcctgggcgac ccagaagccctgagagacctgctgaacaaccacatcttgaagtcagctatgtgtgctgaa gccatcgttgcggggctgtctgtagagaccctggagggcacgacactggaggtgggctgc agcggggacatgctcactatcaacgggaaggcgatcatctccaataaagacatcctagcc accaacggggtgatccactacattgatgagctactcatcccagactcagccaagacacta tttgaattggctgcagagtctgatgtgtccacagccattgaccttttcagacaagccggc ctcggcaatcatctctctggaagtgagcggttgaccctcctggctcccctgaattctgta ttcaaagnn