GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:22:11 Sequence gi568815575r:103173946_103374563 : 200618 bp : 39.11% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 23956 24009 54 0 0 85 108 45 0.120 7.53 1.02 Term + 32541 32810 270 2 0 -2 38 233 0.047 3.80 1.03 PlyA + 34552 34557 6 1.05 2.00 Prom + 34792 34831 40 -6.05 2.01 Init + 41397 41570 174 2 0 83 37 270 0.745 18.79 2.02 Intr + 41744 41819 76 1 1 38 67 69 0.722 -2.03 2.03 Intr + 42204 42453 250 1 1 102 76 309 0.524 26.67 2.04 Term + 47321 47486 166 2 1 66 49 254 0.748 15.71 2.05 PlyA + 48207 48212 6 1.05 3.07 PlyA - 52591 52586 6 1.05 3.06 Term - 67970 67654 317 0 2 68 42 191 0.883 6.62 3.05 Intr - 69303 69191 113 2 2 11 90 59 0.113 -2.50 3.04 Intr - 76141 75990 152 1 2 22 42 170 0.123 3.84 3.03 Intr - 76719 76590 130 2 1 62 103 75 0.711 6.18 3.02 Intr - 80732 80619 114 1 0 97 39 98 0.197 4.44 3.01 Init - 90612 90557 56 0 2 74 95 48 0.319 4.91 3.00 Prom - 92793 92754 40 -3.45 4.07 PlyA - 94120 94115 6 1.05 4.06 Term - 100645 99998 648 1 0 123 35 548 0.729 46.09 4.05 Intr - 101036 100974 63 2 0 77 69 72 0.811 2.20 4.04 Intr - 101214 101159 56 1 2 106 81 30 0.989 1.88 4.03 Intr - 101397 101292 106 0 1 143 -4 141 0.590 9.27 4.02 Intr - 102824 102534 291 2 0 13 45 290 0.224 13.61 4.01 Init - 107086 106760 327 1 0 65 68 121 0.267 5.17 4.00 Prom - 107140 107101 40 -1.35 5.00 Prom + 111002 111041 40 -6.15 5.01 Sngl + 111463 111984 522 0 0 42 40 266 0.471 13.00 5.02 PlyA + 112062 112067 6 -0.45 6.00 Prom + 112284 112323 40 -11.44 6.01 Init + 112640 114045 1406 1 2 68 53 343 0.133 19.03 6.02 Intr + 119274 119388 115 0 1 83 22 68 0.073 -0.77 6.03 Term + 119836 120015 180 0 0 55 47 115 0.370 0.73 6.04 PlyA + 122196 122201 6 1.05 7.00 Prom + 131802 131841 40 -3.65 7.01 Init + 131926 132049 124 0 1 78 105 59 0.696 6.98 7.02 Intr + 132509 132538 30 0 0 105 115 1 0.523 1.68 7.03 Term + 133675 133703 29 2 2 66 48 43 0.326 -4.64 7.04 PlyA + 135243 135248 6 1.05 8.04 PlyA - 135422 135417 6 1.05 8.03 Term - 136036 135645 392 2 2 84 41 397 0.876 28.76 8.02 Intr - 136488 136413 76 0 1 109 82 15 0.677 1.17 8.01 Init - 137077 136772 306 1 0 63 30 279 0.548 16.94 8.00 Prom - 145109 145070 40 -6.75 9.00 Prom + 151080 151119 40 -2.65 9.01 Sngl + 157459 157761 303 0 0 60 32 217 0.780 8.88 9.02 PlyA + 158283 158288 6 1.05 10.00 Prom + 161904 161943 40 -2.85 10.01 Init + 182226 182334 109 2 1 77 53 68 0.926 2.63 10.02 Intr + 183138 183216 79 1 1 92 96 85 0.990 7.39 10.03 Term + 183670 184054 385 1 1 114 29 271 0.889 17.18 10.04 PlyA + 184241 184246 6 1.05 11.03 PlyA - 184544 184539 6 1.05 11.02 Term - 186317 186222 96 2 0 33 48 79 0.268 -4.61 11.01 Init - 192168 192034 135 0 0 71 116 93 0.656 10.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:103173946_103374563|GENSCAN_predicted_peptide_1|107_aa MDISHHPVIYDLNRIPFQSLVVSLGIDNHQKTWFLEGCLDLVSEVSRSEVARNRSGYTGS SKLQHSLLASIPRGYDTDISRVFTGNNGMNCHQKLLPGPSQGKVMAY >gi568815575r:103173946_103374563|GENSCAN_predicted_CDS_1|324_bp atggatattagtcaccaccctgtgatctatgacctcaaccggattccattccaaagcctg gtggtcagccttgggattgataaccaccagaagacatggttcctggaaggctgcttggat ctggttagtgaagtttccaggagtgaagtggccagaaataggagtggctacactggcagc agcaaacttcagcatagcctgctggccagtattcctagaggatatgacactgacatcagc agggttttcactggcaacaatggcatgaactgccaccagaagcttctcccaggtccttct caaggaaaagtgatggcatactga >gi568815575r:103173946_103374563|GENSCAN_predicted_peptide_2|221_aa MQPPGPAAGPRAFLERRGARPGGSRRSWGYDPSETPAYAISASFSDTPAPHFLVQKMVVC GAKCRGGAPRVKNPEEETARIGPGVMESKEELAANNLNGENAQQENEGGEQAPTQNEEES RHLGGGEGQKPGGNIRRGRVRRLVPNFRWAIPNRHIEHNEARDDVESRAVNASRCCEGPH RGQQHNTHQYPVAADECAPYYAVAADATATYEQGWIPLLPP >gi568815575r:103173946_103374563|GENSCAN_predicted_CDS_2|666_bp atgcagcccccgggccccgcggcgggcccgcgagccttccttgagcggagaggtgcccgg cccggagggagccggcggtcctggggctacgacccttcggaaacacctgcctacgccatc agcgcaagcttttccgacacccctgccccgcacttcttggtgcagaaaatggtggtctgc ggggctaagtgtcgcggcggcgcacctcgcgtcaagaatccggaggaggagactgcaagg ataggcccaggagtaatggagtccaaagaggaactagcggcaaacaatctcaacggggaa aatgcccaacaagaaaacgaaggaggggagcaggcccccacgcagaatgaagaagaatcc cgccatttgggagggggtgaaggccagaagcctggaggaaatatcaggcgggggcgagtt aggcgacttgtccctaattttcgatgggccatacctaataggcatattgagcacaatgaa gcgagagatgatgtagaaagccgcgctgtcaatgctagccgctgctgcgaaggcccgcac agaggccagcaacacaacacccatcagtaccctgtcgcagcagatgagtgtgcaccctac tatgctgtagctgcagatgctactgccacatacgaacaaggatggatcccactgctaccg ccctag >gi568815575r:103173946_103374563|GENSCAN_predicted_peptide_3|293_aa MIIYAENPKEFLKESRTLRIFGFLGATQIEVREERGKSPESLQVSFCKGAIASGLFGLPE GPRCALACLVGSARTLALCLPGAEHLMPEPAQALKHRRPQPPDIRLLGQTENVLSTTIIA VNSIIIIIIIIIIIIKGWCSVNIQWILVGDREGQTLTFIGDAICGPSRLAKTRVTGWCAL RHCSQRGGGVIAVKEEKGKTTFDMENSHQENEESVHNVEEELHLEEMEGQEARGNNLQEQ APPTQEDGDGLPHRHVNNNEGRGRKRMRRRKEMGRRRKRRRKRRRRKVSRRDF >gi568815575r:103173946_103374563|GENSCAN_predicted_CDS_3|882_bp atgattatctatgcagaaaatcctaaagaattcttgaaagagtctagaactcttaggata tttggttttttgggcgcgacacaaatcgaggtgagggaagagagaggaaaatcccctgaa tccctgcaggtcagtttttgtaaaggtgcaattgcttcaggtctcttcggactcccagaa gggcctcgatgtgccctggcctgtttggttggttctgcaaggactttagctctgtgtctt cctggtgcagagcatctgatgcctgagccagcccaggccctgaagcacaggagaccccag cccccagacataagactcttaggtcaaacagaaaatgtcctctctaccaccatcattgct gtcaacagcatcatcatcatcatcatcatcatcatcatcatcatcaagggatggtgttca gtgaatattcaatggattctggttggggacagagaagggcaaacactaaccttcatcggt gatgccatttgtggcccctcaagattggcgaagaccagagtgacagggtggtgtgcactg aggcactgttcacagaggggtggaggagtaattgcagtcaaagaggaaaagggaaaaaca actttcgacatggaaaattcccaccaggaaaatgaagaaagtgttcataacgtagaggaa gagctacatttggaagaaatggaaggccaggaagctagaggaaataatctccaggaacaa gcaccacctacccaggaagatggagatggcttgccccataggcatgtcaataacaatgaa gggagagggaggaagaggatgaggaggaggaaggagatggggaggaggagaaagaggagg agaaagaggaggagaagaaaagtgtccagaagagatttttaa >gi568815575r:103173946_103374563|GENSCAN_predicted_peptide_4|496_aa MGTGVMTKTPKAIASRAKIDKRDLIKHRSLCMAKETINRINRQPTEWEKILANYASDKGL ISSKYKGLKQICKRKNNPIKKWAKDMNRHFKRRHTCKLNITVESSVSIPKSLSSRYQIPV SVEEGRLQTEGQRRQLPYSPAGRYKPGTPEVGMVMHIGYRGSDLLSRYRPVPPPFILAQK FFCVGQEVWVPAFISLEKCRLLAEGNVFKSVVVGTNPDKKRKDLQYRCRSVGEGYCTRTL GISGYMTAVKTLRGYGDLWLFYKSYNCMQGEKEENCPGFLQERQRREHLNMEKLYKENEG KPENERNLESEGKPEDEGSTEDEGKSDEEEKPDMEGKTECEGKREDEGEPGDEGQLEDEG NQEKQGKSEGEDKPQSEGKPASQAKPESQPRAAEKRPAEDYVPRKAKRKTDRGTDDSPKD SQEDLQERHLSSEEMMRECGDVSRAQEELRKKQKMGGFHWMQRDVQDPFAPRGQRGVRGV RGGGRGQKDLEDVPYV >gi568815575r:103173946_103374563|GENSCAN_predicted_CDS_4|1491_bp atgggcacaggtgtcatgacaaagacaccaaaagcaattgcatcaagagcaaaaattgac aaacgagatttaattaaacataggagcttgtgtatggcaaaagaaactatcaacagaata aacagacaacctacagaatgggagaaaatattagcaaactatgcatctgacaaaggtcta atatcgagcaagtataagggacttaaacaaatttgcaagaggaaaaacaaccccattaaa aagtgggcaaaggacatgaacagacacttcaaaagaagacatacatgcaagctcaatatc actgtggaaagcagtgtgtcgattcccaagtccctctcctccaggtaccagatccctgtt tccgtggaggaaggcagacttcagactgaaggacagagaaggcaactgccctacagccct gcaggtcggtacaagccggggacccctgaggtagggatggtaatgcacataggctacaga ggatcggatctgctttctagataccgccctgttccacccccattcattttggcgcagaaa tttttttgtgttgggcaagaggtatgggtaccagcttttatttccctggagaagtgcagg cttctggcagaaggaaatgtcttcaagtctgtggttgtgggcactaacccagacaagaaa aggaaagacctgcagtatcggtgcaggtcagtgggagagggctactgcaccaggaccctt gggatttcgggatacatgactgctgtcaagaccctaaggggatatggtgatctatggctg ttttataagtcttacaactgcatgcaaggggaaaaagaagaaaactgcccaggatttctg caggaaaggcaaagaagggaacatctcaacatggaaaagctctacaaagaaaatgaagga aagccagagaatgaaagaaacctagaaagtgagggaaagccagaggatgagggaagtaca gaagatgaaggaaagtcagacgaggaagaaaagccggacatggaggggaagacagaatgc gagggaaagcgagaggatgagggagagccaggtgatgagggacaactggaagatgaggga aaccaggaaaagcagggcaagtctgaaggtgaggacaagccacaaagtgagggcaagcca gcctcccaggccaagccagagagccagccgcgggccgccgaaaagcgcccggctgaagat tatgtgccccggaaagcaaaaagaaaaaccgacagggggacggacgattcccccaaggac tctcaggaggacttacaagaaaggcatctgagcagtgaggagatgatgagagaatgtgga gatgtgtcaagggctcaggaggagctaaggaaaaaacagaaaatgggtggttttcattgg atgcaaagagatgtacaggatccattcgccccaaggggccaacggggtgtgaggggagtg aggggcggaggtaggggccagaaagacttagaagatgtcccatatgtttaa >gi568815575r:103173946_103374563|GENSCAN_predicted_peptide_5|173_aa MKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELE KQEQTHSKASRRQEITKIRAEQKEIETQKTLQKINESRSWFFEKINKIDRPLARLMKKRE KNQIDAIKNDKGDITTNPTEIQTTIREYYKHLYANKLENLEEMDKFLDTLSQD >gi568815575r:103173946_103374563|GENSCAN_predicted_CDS_5|522_bp atgaaagcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggacacattcaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaagaactagag aagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaagatcagagca gaacagaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatccaggagctgg ttttttgaaaagatcaacaaaattgatagaccgctagcaagattaatgaagaaaagagag aagaatcaaatagatgcaataaaaaatgataaaggggatatcaccaccaatcccacagaa atacaaactaccatcagagaatactataaacacctctatgcaaataaactagaaaatctg gaagaaatggataaattcctcgacacactctcccaagactaa >gi568815575r:103173946_103374563|GENSCAN_predicted_peptide_6|566_aa MPSLTTPIQHSVGSSGQGNQAGERNKRYSIRKEEVKLSLFADDMIVYLENPIVSAQNLLK LISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDL FKENYKPLLNEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLPMTFFTEL EKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNKDIDQWN RTEPSEIMSHIYNYPIFDKPDKNKKWGNNSLFNKWCWENWLAICRKLKLDPFLTPYTKIN SRWIKDLHVRPKTIKTLEENLSNTIQDIGMGKDFMSKTPKAMATKAKIDQWDLIKLKSFC TAKETTIRVNRQPTEWEKIFATHSSDKGLLSRIYNELKQIYKKKTDNPINKWAKDMNRHF SKEDIYAAKRHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRWNISEDPGLAL SCIKEASITVPSLRCAVSIIQGILSNWEDVINVMPWLYSWRKLGADKILSIKIVHNDKAI EIPHEKLNKGVSYLFEREGKLQLRGY >gi568815575r:103173946_103374563|GENSCAN_predicted_CDS_6|1701_bp atgccctctctcaccactcctattcaacatagtgttggaagttctggccagggcaatcag gcaggagaaagaaataaaaggtattcgattaggaaagaggaagtcaaattgtccctgttt gcagatgacatgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaag ctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagca ttcttatacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcaca attgcttcaaagagaataaaatacctaggaattcaacttacaagggacgtgaaggacctc ttcaaggagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaag aacattccatgctcatgggtaggaagaatcaatatcatgaaaatggccatactgcccaag gtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcacagaattg gaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatc ctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaag gctacagtaaccaaaacagcatggtactggtaccaaaacaaagatatagaccaatggaac agaacagagccctcagaaataatgtcacatatctacaactatccgatctttgacaaacct gacaaaaacaagaaatggggaaacaattccctatttaataaatggtgctgggaaaactgg ctagccatatgtagaaagctgaaactagatcccttccttacaccttatactaaaattaat tcaagatggattaaagacttacatgttagacctaaaaccataaaaaccctagaagaaaac ctaagcaataccattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaa gcaatggcaacaaaagccaaaattgaccaatgggatctaattaaactaaagagcttctgc acagcaaaggaaactaccatcagagtgaacaggcaacctacagaatgggagaaaattttt gcaacccactcatctgacaaagggctactgtccagaatctacaatgaactcaaacaaatt tacaagaaaaaaacagacaaccccatcaacaagtgggcgaaggacatgaacagacacttc tcaaaagaagacatttatgcagccaaaagacacatgaaaaaatgctcatcatcactggcc atcagagaaatgcaaatcaaaaccacaatgagataccatctcacaccagttagaatggca atcattaaaaagtcaggaaacaacaggtggaatatatcagaggacccaggtttggctctg tcctgtatcaaggaggcctccatcactgtgccaagcttgcgatgtgctgtgtccattatc cagggcatattgagcaactgggaagatgttatcaatgtaatgccatggctgtattcctgg agaaagctgggtgcagataagatcttgtccataaaaattgtgcacaatgacaaagctatt gagatacctcatgagaagcttaataaaggtgtatcatatctctttgaaagggaaggcaaa ctgcagctgagaggctactga >gi568815575r:103173946_103374563|GENSCAN_predicted_peptide_7|60_aa MEYYAAIKKDEFMSFAGTWMKLETIILSKLSQEQKTTWTHGDNFALHSLIDAGANTDNQE >gi568815575r:103173946_103374563|GENSCAN_predicted_CDS_7|183_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctttgcagggacatggatg aagctggaaaccatcattctcagcaaactatcacaagaacagaaaaccacatggacacat ggagataactttgccctccactctctgatagatgctggtgctaacactgacaatcaagag taa >gi568815575r:103173946_103374563|GENSCAN_predicted_peptide_8|257_aa MTTATRDRPTLSLTSLWRQNRRRSLPSRQWYLFPVSLRTCGPGTAPKVGSGGGAGKGPGG GPWGWCGGAAPACGCPIPGQQRGDAAGPRCSPGAPPRDREPCVCGAKCCGDAPHVENREE ETARIGPGVMESKEERALNNLIVENVNQENDEKDEKEQVANKGEPLALPLNVSEYCVPRG NRRRFRVRQPILQYRWDIMHRLGEPQARMREENMERIGEEVRQLMEKLREKQLSHSLRAV STDPPHHDHHDEFCLMP >gi568815575r:103173946_103374563|GENSCAN_predicted_CDS_8|774_bp atgacaacagccacacgtgatcggccaacactgagtcttacctcgttgtggcgtcagaac cgccgtcgctcgctcccttctcggcagtggtacctgttcccggtgtccctgaggacgtgc gggccaggtacggccccgaaagtaggaagcggagggggagcaggtaagggacccggaggg ggtccctggggttggtgtgggggagcagccccggcctgcggatgccccatccccgggcag cagcgcggagacgcagccggtccacgatgcagccccggggccccgccgcgggaccgcgag ccttgtgtttgcggggccaagtgttgcggcgacgcacctcacgtcgagaatcgggaggag gagactgcaaggataggcccaggagtaatggagtccaaagaggaacgagcgttaaacaat ctcatcgtggaaaatgtcaaccaggaaaatgatgaaaaagatgaaaaggagcaagttgct aataaaggggagcccttggccctacctttgaatgttagtgaatactgtgtgcctagagga aaccgtaggcggttccgcgttaggcagcccatcctgcagtatagatgggacataatgcat aggcttggagagccacaggcaaggatgagagaggagaatatggaaaggattggggaggag gtgagacagctgatggaaaagctgagggaaaagcagttgagtcatagtttgcgggcagtc agcactgatccccctcaccatgaccatcacgatgagttttgccttatgccctga >gi568815575r:103173946_103374563|GENSCAN_predicted_peptide_9|100_aa MQKPCKENEGKPKCSVPKREEKRPYGEFERQQTEGNFRQRLLQSLEEFKEDIDYRHFKDE EMTREGDEMERCLEEIRGLRKKFRALHSNHRHSRDRPYPI >gi568815575r:103173946_103374563|GENSCAN_predicted_CDS_9|303_bp atgcaaaaaccctgcaaagaaaacgaaggaaagccaaagtgcagcgtgccaaagagggag gaaaaacgcccgtatggagaatttgaacgccagcaaacagaagggaattttagacagagg ctgcttcagtctctcgaagaatttaaagaggacatagactataggcattttaaagatgaa gaaatgacaagggagggagatgagatggaaaggtgtttggaagagataaggggtctgaga aagaaatttagggctctgcattctaaccataggcattctcgggaccgtccttatcccatt taa >gi568815575r:103173946_103374563|GENSCAN_predicted_peptide_10|190_aa MEARGGTTDGGVILGVEEWKRQFSKEVEKFSSSRKRVYLQGCCFQQDPKLEKEEEETDPI SARSHCIQRRISKKEKKEGREVDRYKMKSCQKMEGKPENESEPKHEEEPKPEEKPEEEEK LEEEAKAKGTFRERLIQSLQEFKEDIHNRHLSNEDMFREVDEIDEIRRVRNKLIVMRWKV NRNHPYPYLM >gi568815575r:103173946_103374563|GENSCAN_predicted_CDS_10|573_bp atggaggctagaggagggacgactgatggaggagttatcctgggagtggaagaatggaag agacaattctcgaaagaagtagaaaagttcagtagctccagaaaaagagtttatctgcag ggctgttgtttccagcaagacccaaagctagaaaaggaggaggaagaaactgacccgatc agtgccagaagtcattgtattcaaagaagaataagcaagaaagaaaagaaggaaggaaga gaggtagacagatacaagatgaaatcctgtcaaaaaatggaaggaaaaccagaaaatgag agtgaaccaaagcatgaggaagagccaaagcctgaggaaaagccagaagaggaggagaag ctagaggaggaggccaaagcaaaaggaacttttagagaaaggctgattcaatctctccag gagtttaaagaagatatacacaacaggcatttaagcaatgaagatatgtttagagaagtg gatgaaatagatgagataaggagagtcagaaacaaacttatagtgatgcgttggaaggtt aatcgaaaccatccttacccctatttaatgtag >gi568815575r:103173946_103374563|GENSCAN_predicted_peptide_11|76_aa MRKVKNEEKILKAAREKRLFTFKGTSIRLTAGSSAKKQNKKKLMKPYQQPLGCHHLEEWS GDMKSSQNGITEENDT >gi568815575r:103173946_103374563|GENSCAN_predicted_CDS_11|231_bp atgcggaaagtcaaaaacgaggagaaaattttgaaagcagcaagagaaaaacgactcttc acatttaagggaacttcaataagattaacagctggctcctcagcaaaaaaacaaaacaaa aaaaaattgatgaagccctaccagcaaccattaggctgccatcaccttgaagaatggtct ggagacatgaagagttcccaaaatggcatcacagaggaaaatgacacatga