GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:08:16 Sequence gi568815595r:53125759_53355942 : 230184 bp : 48.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 236 151 86 2 2 106 97 31 0.255 4.42 1.01 Init - 4642 4580 63 1 0 64 78 132 0.485 8.96 1.00 Prom - 12343 12304 40 -2.06 2.00 Prom + 14035 14074 40 -5.86 2.01 Init + 16489 16604 116 0 2 51 77 83 0.363 1.99 2.02 Intr + 18339 18404 66 0 0 97 89 44 0.543 3.42 2.03 Intr + 24079 24308 230 1 2 49 81 101 0.085 3.01 2.04 Intr + 31013 31197 185 0 2 23 94 61 0.104 -0.29 2.05 Intr + 35578 35670 93 0 0 59 107 31 0.330 2.26 2.06 Intr + 39346 39457 112 0 1 134 91 -2 0.273 4.65 2.07 Intr + 40304 40364 61 0 1 59 65 46 0.073 -2.81 2.08 Intr + 52646 52779 134 2 2 81 96 289 0.224 29.29 2.09 Intr + 53819 54018 200 0 2 105 82 332 0.999 33.37 2.10 Intr + 55449 55509 61 2 1 85 86 83 0.988 6.11 2.11 Intr + 55686 55848 163 1 1 68 85 215 0.957 18.23 2.12 Intr + 55943 55974 32 2 2 107 88 46 0.940 4.27 2.13 Intr + 57179 57232 54 0 0 107 60 28 0.428 0.85 2.14 Intr + 57363 57448 86 1 2 72 47 162 0.999 10.04 2.15 Intr + 57694 57823 130 0 1 62 100 195 0.570 18.37 2.16 Intr + 59116 59216 101 2 2 109 80 124 0.998 13.53 2.17 Intr + 59846 59942 97 1 1 102 67 33 0.981 2.18 2.18 Intr + 60169 60269 101 2 2 52 100 201 0.987 17.53 2.19 Intr + 60409 60582 174 0 0 109 44 306 0.736 28.44 2.20 Intr + 60846 60937 92 2 2 120 75 162 0.662 16.89 2.21 Intr + 62962 63100 139 1 1 119 83 266 0.805 29.67 2.22 Intr + 63300 63488 189 2 0 107 44 341 0.877 31.38 2.23 Intr + 64115 64243 129 1 0 65 68 85 0.971 5.09 2.24 Term + 66350 66508 159 1 0 82 51 286 0.994 22.24 2.25 PlyA + 66861 66866 6 -0.45 3.03 PlyA - 66927 66922 6 1.05 3.02 Term - 67662 67659 4 2 1 142 48 0 0.014 -2.22 3.01 Init - 73617 73496 122 0 2 103 27 133 0.169 6.36 3.00 Prom - 95683 95644 40 -1.96 4.16 PlyA - 95706 95701 6 1.05 4.15 Term - 99218 99021 198 2 0 114 53 84 0.276 4.90 4.14 Intr - 100173 100025 149 1 2 158 40 185 0.840 20.65 4.13 Intr - 101120 100998 123 0 0 129 116 233 0.999 30.76 4.12 Intr - 102391 102298 94 1 1 120 100 174 0.999 21.34 4.11 Intr - 102601 102518 84 1 0 80 86 176 0.975 16.62 4.10 Intr - 103379 103249 131 0 2 73 100 116 0.999 11.71 4.09 Intr - 103678 103522 157 1 1 123 109 302 0.999 35.38 4.08 Intr - 104863 104699 165 1 0 94 78 452 0.999 44.96 4.07 Intr - 105792 105599 194 1 2 75 82 253 0.991 22.61 4.06 Intr - 107516 107398 119 1 2 94 60 248 0.985 22.71 4.05 Intr - 109416 109225 192 2 0 90 79 244 0.924 22.31 4.04 Intr - 114590 114493 98 2 2 80 92 207 0.995 19.11 4.03 Intr - 115487 115374 114 2 0 117 131 159 0.999 23.74 4.02 Intr - 116484 116367 118 2 1 75 116 179 0.929 19.87 4.01 Init - 130184 130078 107 2 2 108 92 147 0.946 16.79 4.00 Prom - 144240 144201 40 -4.86 5.08 PlyA - 145873 145868 6 1.05 5.07 Term - 155696 155454 243 2 0 51 48 141 0.656 2.30 5.06 Intr - 165098 165033 66 2 0 34 107 79 0.861 3.50 5.05 Intr - 167069 166311 759 2 0 91 95 131 0.880 5.77 5.04 Intr - 186621 186483 139 2 1 98 72 229 0.738 22.77 5.03 Intr - 193715 193649 67 0 1 63 80 51 0.175 -0.24 5.02 Intr - 212387 212333 55 2 1 70 85 52 0.079 1.65 5.01 Init - 221759 221625 135 2 0 83 92 187 0.892 18.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 148045 148137 93 0 0 85 86 80 0.806 7.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:53125759_53355942|GENSCAN_predicted_peptide_1|50_aa MGSQEVLGHAARLASSGLLLQVLFRLITFVLNAFILRFLSKEIVGVVNVS >gi568815595r:53125759_53355942|GENSCAN_predicted_CDS_1|150_bp atgggcagccaggaggtgctgggccacgcggcccggctggcctcctccggtctcctcctg caggtgttgtttcggttgatcacctttgtcttgaatgcatttattcttcgcttcctgtca aaggaaatcgttggcgtagtaaatgtaagn >gi568815595r:53125759_53355942|GENSCAN_predicted_peptide_2|967_aa MPPFPKAVINISVLTAPTAPQEHLDLYMRDLILQENPPSVPICSVEELASLVQEGPGKRP RDFQGRACQCSWVQGAPEVQANPMQLNFWPLLYGGWGLKKPRNMQSLRLLSAVSGHSDVN TGNLVFTEYLLCTRPSQVSPSQPANLEQPPFSSGPPPTLFPMGQAGFGELSSEDVGDEGL RGGSGELRHPQARERKERQSAPRRTPSARAGERRPRRCRRDPWRLPLQREVCRELARQGG RPVSPGGWCVVAAAGARTKDKQELGAPGVLPEQTLTANPTGCYAPIGLPTAGPTMAPFLR IAFNSYELGSLQAEDEANQPFCAVKMKEALSTERGKTLVQKKPTMYPEWKSTFDAHIYEG RVIQIVLMRAAEEPVSEVTVGVSVLAERCKKNNGKAEFWLDLQPQAKVLMSVQYFLEDVD CKQSMRSEDEAKFPTMNRRGAIKQAKIHYIKNHEFIATFFGQPTFCSVCKDFVWGLNKQG YKCRRSSFEGWQEEDSSAGPLRECNAAIHKKCIDKIIGRCTGTAANSRDTIFQKERFNID MPHRFKVHNYMSPTFCDHCGSLLWGLVKQGLKCEDCGMNVHHKCREKVANLCGINQKLLA EALNQVTQRASRRSDSASSEPVGIYQGFEKKTGVAGEDMQDNSGTYGKIWEGSSKCNINN FIFHKVLGKGSFGKVLLGELKGRGEYFAIKALKKDVVLIDDDVECTMVEKRVLTLAAENP FLTHLICTFQTKDHLFFVMEFLNGGDLMYHIQDKGRFELYRATDLKLDNVLLDRDGHIKI ADFGMCKENIFGESRASTFCGTPDYIAPEILQGLKYTFSVDWWSFGVLLYEMLIGQSPFH GDDEDELFESIRVDTPHYPRWITKESKDILEKLFEREPTKRLGVTGNIKIHPFFKTINWT LLEKRRLEPPFRPKVKSPRDYSNFDQEFLNEKARLSYSDKNLIDSMDQSAFAGFSFVNPK FEHLLED >gi568815595r:53125759_53355942|GENSCAN_predicted_CDS_2|2904_bp atgcctccctttccaaaggccgtgatcaacatcagtgtcctaacagcacccactgcgccc caggagcatctggacctttacatgcgtgatctcattcttcaagagaaccctccaagtgtg cctatttgctccgtggaggagttggccagcctggtccaagagggccctggaaaaagaccc agagacttccagggcagggcctgccagtgctcctgggtgcagggagccccagaagtccag gccaaccccatgcagcttaacttttggcccttgctatatggaggctgggggctgaagaag ccaagaaacatgcagtcactaaggttgctgagtgctgtctctgggcacagcgatgtcaac acaggaaacctggtcttcactgagtacctgctgtgtacccggccctcacaagtcagtcct tcccagcctgccaacctcgagcagccgcctttcagctcagggccaccccctacactgttt cccatgggccaagctgggtttggtgagctgtcctctgaggatgtgggggatgaggggctg agaggtgggagcggggagctgaggcaccctcaggccagggagcgaaaggaaaggcagtca gcgccgcgccgaaccccgtccgcgcgcgccggggagcggcgcccccgccgctgccgccgc gacccttggcgcctgcccctgcaacgggaggtctgcagggaactggccaggcaagggggc aggcccgtttctcctggtggttggtgcgttgtagcagcagcgggagccaggactaaggac aagcaggagctgggagccccaggagtgctccctgagcagaccctcacagccaaccctact ggctgttacgcacctataggtctccccactgcaggccccaccatggcgccgttcctgcgc atcgccttcaactcctatgagctgggctccctgcaggccgaggacgaggcgaaccagccc ttctgtgccgtgaagatgaaggaggcgctcagcacagagcgtgggaaaacactggtgcag aagaagccgaccatgtatcctgagtggaagtcgacgttcgatgcccacatctatgagggg cgcgtcatccagattgtgctaatgcgggcagcagaggagccagtgtctgaggtgaccgtg ggtgtgtcggtgctggccgagcgctgcaagaagaacaatggcaaggctgagttctggctg gacctgcagcctcaggccaaggtgttgatgtctgttcagtatttcctggaggacgtggat tgcaaacagtctatgcgcagtgaggacgaggccaagttcccaacgatgaaccgccgcgga gccatcaaacaggccaaaatccactacatcaagaaccatgagtttatcgccaccttcttt gggcaacccaccttctgttctgtgtgcaaagactttgtctggggcctcaacaagcaaggc tacaaatgcaggcgctcctccttcgagggctggcaggaggaagactcaagcgctgggcct ctgcgggaatgtaacgctgccatccacaagaaatgcatcgacaagatcatcggcagatgc actggcaccgcggccaacagccgggacactatattccagaaagaacgcttcaacatcgac atgccgcaccgcttcaaggttcacaactacatgagccccaccttctgtgaccactgcggc agcctgctctggggactggtgaagcagggattaaagtgtgaagactgcggcatgaatgtg caccataaatgccgggagaaggtggccaacctctgcggcatcaaccagaagcttttggct gaggccttgaaccaagtcacccagagagcctcccggagatcagactcagcctcctcagag cctgttgggatatatcagggtttcgagaagaagaccggagttgctggggaggacatgcaa gacaacagtgggacctacggcaagatctgggagggcagcagcaagtgcaacatcaacaac ttcatcttccacaaggtcctgggcaaaggcagcttcgggaaggtgctgcttggagagctg aagggcagaggagagtactttgccatcaaggccctcaagaaggatgtggtcctgatcgac gacgacgtggagtgcaccatggttgagaagcgggtgctgacacttgccgcagagaatccc tttctcacccacctcatctgcaccttccagaccaaggaccacctgttctttgtgatggag ttcctcaacgggggggacctgatgtaccacatccaggacaaaggccgctttgaactctac cgtgccacggacctcaaactggacaatgtgctgctggaccgggatggccacatcaagatt gccgactttgggatgtgcaaagagaacatattcggggagagccgggccagcaccttctgc ggcacccctgactatatcgcccctgagatcctacagggcctgaagtacacattctctgtg gactggtggtctttcggggtccttctgtacgagatgctcattggccagtcccccttccat ggtgatgatgaggatgaactcttcgagtccatccgtgtggacacgccacattatccccgc tggatcaccaaggagtccaaggacatcctggagaagctctttgaaagggaaccaaccaag aggctgggagtgaccggaaacatcaaaatccaccccttcttcaagaccataaactggact ctgctggaaaagcggaggttggagccacctttcaggcccaaagtgaagtcacccagagac tacagtaactttgaccaggagttcctgaacgagaaggcgcgcctctcctacagcgacaag aacctcatcgactccatggaccagtctgcattcgctggcttctcctttgtgaaccccaaa ttcgagcacctcctggaagattga >gi568815595r:53125759_53355942|GENSCAN_predicted_peptide_3|41_aa MPSHCPALEPGPHPRLGRLASTTHLLEDMNSKALGFGYLPP >gi568815595r:53125759_53355942|GENSCAN_predicted_CDS_3|126_bp atgcccagccactgccctgctcttgagccgggtccccaccctcggctggggcggctggcc tccaccacccacctgctggaggacatgaactctaaggccttgggctttggttatctgcca ccgtag >gi568815595r:53125759_53355942|GENSCAN_predicted_peptide_4|680_aa MESYHKPDQQKLQALKDTANRLRISSIQATTAAGSGHPTSCCSAAEIMAVLFFHTMRYKS QDPRNPHNDRFVLSKGHAAPILYAVWAEAGFLAEAELLNLRKISSDLDGHPVPKQAFTDV ATGSLGQGLGAACGMAYTGKYFDKASYRVYCLLGDGELSEGSVWEAMAFASIYKLDNLVA ILDINRLGQSDPAPLQHQMDIYQKRCEAFGWHAIIVDGHSVEELCKAFGQAKHQPTAIIA KTFKGRGITGVEDKESWHGKPLPKNMAEQIIQEIYSQIQSKKKILATPPQEDAPSVDIAN IRMPSLPSYKVGDKIATRKAYGQALAKLGHASDRIIALDGDTKNSTFSEIFKKEHPDRFI ECYIAEQNMVSIAVGCATRNRTVPFCSTFAAFFTRAFDQIRMAAISESNINLCGSHCGVS IGEDGPSQMALEDLAMFRSVPTSTVFYPSDGVATEKAVELAANTKGICFIRTSRPENAII YNNNEDFQVGQAKVVLKSKDDQVTVIGAGVTLHEALAAAELLKKEKINIRVLDPFTIKPL DRKLILDSARATKGRILTVEDHYYEGGIGEAVSSAVVGEPGITVTHLAVNRVPRSGKPAE LLKMFGIDRDAIAQAVTALTLLTTVDFSPCCSGSAGVSPAYKVNCAKPYTGQEQTSAGGG GLGAPATGGVPRKSSELLRG >gi568815595r:53125759_53355942|GENSCAN_predicted_CDS_4|2043_bp atggagagctaccacaagcctgaccagcagaagctgcaggccttgaaggacacggccaac cgcctacgtatcagctccatccaggccaccactgcggcgggctctggccaccccacgtca tgctgcagcgccgcagagatcatggctgtcctctttttccacaccatgcgctacaagtcc caggacccccggaatccgcacaatgaccgctttgtgctctccaagggccatgcagctccc atcctctacgcggtctgggctgaagctggtttcctggccgaggcggagctgctgaacctg aggaagatcagctccgacttggacgggcacccggtcccgaaacaagctttcaccgacgtg gccactggctccctgggccagggcctcggggccgcttgtgggatggcctacaccggcaaa tacttcgacaaggccagctaccgagtctattgcttgctgggagacggggagctgtcagag ggctctgtatgggaggccatggccttcgccagcatctataagctggacaaccttgtggcc attctagacatcaatcgcctgggccagagtgacccggccccactgcagcaccagatggac atctaccagaagcggtgcgaggccttcggttggcatgccatcatcgtggatggacacagc gtggaggagctgtgcaaggcctttggccaggccaagcaccagccaacagccatcattgcc aagaccttcaagggccgagggatcacgggggtagaagataaggagtcttggcatgggaag cccctccccaaaaacatggctgagcagatcatccaggagatctacagccagatccagagc aaaaagaagatcctggcaacccctccacaggaggacgcaccctcagtggacattgccaac atccgcatgcccagcctgcccagctacaaagttggggacaagatagccacccgcaaggcc tacgggcaggcactggccaagctgggccatgccagtgaccgcatcatcgccctggatggg gacaccaaaaattccaccttctcggagatcttcaaaaaggagcacccggaccgcttcatc gagtgctacattgctgagcagaacatggtgagcatcgcggtgggctgtgccacccgcaac aggacggtgcccttctgcagcacttttgcagccttcttcacgcgggcctttgaccagatt cgcatggccgccatctccgagagcaacatcaacctctgcggctcccactgcggcgtttcc atcggggaagacgggccctcccagatggccctagaagatctggctatgtttcggtcagtc cccacatcaactgtcttttacccaagtgatggcgttgctacagagaaggcagtggaacta gccgccaatacaaagggtatctgcttcatccggaccagccgcccagaaaatgccatcatc tataacaacaatgaggacttccaggtcggacaagccaaggtggtcctgaagagcaaggat gaccaggtgaccgttatcggggctggggtgaccctgcacgaggccttggccgctgccgaa ctgctgaagaaagaaaagatcaacatccgcgtgctggaccccttcaccatcaagcccctg gacagaaaactcattctcgacagcgctcgtgccaccaagggcaggatcctcaccgtggag gaccattattatgaaggtggcattggtgaggctgtgtccagtgcagtagtgggcgagcct ggcatcactgtcacccacctggcagttaaccgggtaccaagaagtgggaagccggctgag ctgctgaagatgtttggtatcgacagggatgccattgcacaagctgtgactgccctcaca ctgctgaccacagtggatttctccccctgctgctcgggctcagctggggtcagccctgct tataaggtcaactgtgcaaaaccttatactggccaagaacaaactagtgctgggggagga gggctgggtgccccggccactggtggagtccccaggaaatcctcagagctgttgcgagga tga >gi568815595r:53125759_53355942|GENSCAN_predicted_peptide_5|487_aa MEALSRAGQEMSLAALKQHDPYITSIADLTGQVALYTFCPKANQWVEKTRGSELFRNLLE IRKVSIYSIWFYDKNDCHRIAKLMADVVEEETRRSQQAARDKQSPSQANGCSDHRPIDIL EMLSRAKDEYERSAPSGHKHLTVEELFGTSLPKEQPAVVGLDSEEMERLPGDASQKEPNS FLPFPFEQLGGAPQSETLGVPSAAHHSVQPEITTPVLITPASITQSNEKHAPTYTIPLSP VLSPTLPAEAPTAQVPPSLPRNSTMMQAVKTTPRQRSPLLNQPVPELSHASLIANQSPFR APLNVTNTAGTSLPSVDLLQKLRLTPQHDQIQTQPLGKGAMVASFSPAAGQLATPESFIE PPSKTAAARVAASASLSNMVLAPLQSMQQNQDPEVFVQPKVLSSAIPKMEKALNLVEDMN RKRVLIDSMLHQKALSLYKDLKDLLKWGHQAIYCKKGMVTQISGIQKVNSSLMLYHNAYA ITSLLIM >gi568815595r:53125759_53355942|GENSCAN_predicted_CDS_5|1464_bp atggaggcgctgagtcgagctgggcaggagatgagcctagcggccctgaagcaacacgac ccctatatcaccagcatcgcagacctcacgggccaggtcgctctgtacaccttctgcccc aaggccaaccagtgggttgagaaaacaagaggctcagagctgtttaggaatctgctcgag attcgcaaagtgtcgatatatagtatctggttttatgacaagaatgactgtcaccgcata gcaaaactcatggctgatgtggtagaagaggagacacggcgatcccagcaagctgctcgg gacaaacagagtcccagccaggccaatggctgcagcgaccacaggcccatcgacatcctg gagatgctgagcagagccaaggatgagtatgagaggtctgctccatctggacacaagcat ctgacggtagaagagttatttggaacctctttgccaaaggaacaaccagcagttgtgggt ctggattcagaagaaatggagaggttgccaggagatgcctcccagaaagagcccaattca ttcctaccatttccctttgagcagttaggaggagcccctcaatcagaaaccctgggtgtc ccttctgctgcccaccattcagtccagcctgaaatcaccaccccggtgctaatcactcca gcctccatcacacagtccaatgaaaagcatgctccaacctacacaatcccgttgagccct gttctcagtcccactctgccagctgaagctcctactgcacaggttccccccagcttacct cgaaacagcaccatgatgcaggcagtgaagaccacgcctagacagaggtctccactcctg aaccagccagtccctgagctaagccatgccagtctgattgccaaccagagccccttcagg gccccattgaacgtgacgaacacagctggcacatccctcccaagcgttgatcttctccag aaactcaggttgaccccacagcatgaccaaatacagacacaaccacttgggaaaggtgca atggtagccagcttttctccggcagctggtcagctagccacacctgagagcttcatagag cctccctctaagacagcagcagcaagagtggcggcctcagcctccctgagcaacatggtg cttgctccccttcagtctatgcagcagaaccaggatcctgaagtatttgtgcagcctaag gtgttatccagtgccatcccgaagatggaaaaagcattaaatctcgtggaagacatgaac agaaaacgtgttctgattgacagtatgttgcaccaaaaagcattgagcctatacaaagac ctaaaagatctcttgaaatggggacaccaagccatttactgcaagaaagggatggttaca cagatttcaggaatacagaaggtcaacagtagcctaatgctatatcacaatgcctacgcc atcacctcacttctcatcatgtag