GENSCAN 1.0 Date run: 8-Nov-116 Time: 08:28:35 Sequence gi568815587r:123705411_123906349 : 200939 bp : 38.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 510 505 6 1.05 1.07 Term - 16340 16164 177 2 0 100 55 167 0.808 11.30 1.06 Intr - 21581 20710 872 0 2 107 -3 740 0.170 56.92 1.05 Intr - 22185 22066 120 1 0 41 94 104 0.876 5.85 1.04 Intr - 22912 22723 190 1 1 30 47 234 0.681 11.74 1.03 Intr - 23804 23644 161 0 2 108 72 134 0.820 12.59 1.02 Intr - 24499 24205 295 1 1 54 44 314 0.762 19.26 1.01 Init - 25478 25077 402 2 0 66 68 435 0.955 35.97 1.00 Prom - 28332 28293 40 -5.75 2.00 Prom + 37372 37411 40 -4.45 2.01 Init + 38467 38490 24 0 0 93 67 48 0.043 2.78 2.02 Term + 48524 48871 348 1 0 3 43 484 0.101 28.90 2.03 PlyA + 49028 49033 6 1.05 3.00 Prom + 64233 64272 40 -3.25 3.01 Init + 92502 92582 81 2 0 69 115 38 0.478 5.62 3.02 Intr + 95362 95413 52 0 1 83 105 37 0.325 2.46 3.03 Term + 96776 96867 92 0 2 54 42 64 0.240 -4.70 3.04 PlyA + 96997 97002 6 1.05 4.04 PlyA - 98048 98043 6 1.05 4.03 Term - 100969 99998 972 1 0 123 47 520 0.468 42.46 4.02 Intr - 122348 121930 419 0 2 42 20 177 0.013 -0.88 4.01 Init - 123617 123191 427 2 1 81 53 246 0.759 16.91 4.00 Prom - 131329 131290 40 -3.65 5.02 PlyA - 132485 132480 6 1.05 5.01 Sngl - 136138 135545 594 1 0 69 41 375 0.355 26.84 5.00 Prom - 137720 137681 40 -3.45 6.07 PlyA - 138092 138087 6 1.05 6.06 Term - 149651 149241 411 2 0 94 41 212 0.611 11.36 6.05 Intr - 149793 149706 88 2 1 19 95 53 0.892 -1.85 6.04 Intr - 152102 151991 112 0 1 100 90 58 0.884 5.72 6.03 Intr - 153380 153265 116 1 2 45 67 75 0.619 0.27 6.02 Intr - 156886 156715 172 2 1 105 7 111 0.124 2.78 6.01 Init - 163407 163275 133 0 1 92 40 109 0.736 6.85 6.00 Prom - 164028 163989 40 -3.95 7.00 Prom + 171305 171344 40 -4.55 7.01 Init + 176824 177011 188 0 2 70 16 81 0.288 -2.32 7.02 Intr + 177773 177896 124 2 1 78 92 108 0.341 9.87 7.03 Term + 180210 180269 60 2 0 97 46 64 0.377 -0.07 7.04 PlyA + 180634 180639 6 1.05 8.03 PlyA - 180961 180956 6 1.05 8.02 Term - 192934 192780 155 2 2 93 40 108 0.675 3.60 8.01 Init - 200711 200669 43 2 1 67 71 59 0.512 2.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:123705411_123906349|GENSCAN_predicted_peptide_1|738_aa MATAVEPEDQDLWEEEGILMVKLEDDFTCRPESVLQRDDPVLETSHQNFRRFRYQEAASP REALIRLRELCHQWLRPERRTKEQILELLVLEQFLTVLPGELQSWVRGQRPESGEEAVTL VEGLQKQPRRPRRWASSPKISSRDNQELPPDSMVTGSWNYSQVTVHVHGQEVLSEETVHL GVEPESPNELQDPVQSSTPEQSPEETTQSPDLGAPAEQRPHQEEELQTLQESEVPVPEDP DLPAERSSGDSEMVALLTALSQVCPSYLCTTENLFEEPLGISHTKQGQNTRLPVASRFDE ILMWFQGLVTFKDVAVCFSQDQWSDLDPTQKEFYGEYVLEEDCGIVVSLSFPIPRPDEIS QVREEEPWVPDIQEPQETQEPEILSFTYTGDRSKDEEECLEQEDLSLEDIHRPVLGEPEI HQTPDWEIVFEDNPGRLNERRFGTNISQVNSFVNLRETTPVHPLLGRHHDCSVCGKSFTC NSHLVRHLRTHTGEKPYKCMECGKSYTRSSHLARHQKVHKMNAPYKYPLNRKNLEETSPV TQAERTPSVEKPYRCDDCGKHFRWTSDLVRHQRTHTGEKPFFCTICGKSFSQKSVLTTHQ RIHLGGKPYLCGECGEDFSEHRRYLAHRKTHAAEELYLCSECGRCFTHSAAFAKHLRGHA SVRPCRCNECGKSFSRRDHLLYISPRQVQGTTNEKISGRGLQQLLGAWVHLRNLGNRSTE KQFHGKRAVGQLNNPSKQ >gi568815587r:123705411_123906349|GENSCAN_predicted_CDS_1|2217_bp atggctacagccgtggaaccagaggaccaggatctttgggaagaagagggaattctgatg gtgaaactggaagatgatttcacctgtcggccagagtctgtcttacagagggatgacccg gtgctggaaacctcccaccagaacttccgacgcttccgctaccaggaggcagcaagccct agagaagctctcatcagactccgagaactttgtcaccagtggctgagaccagagaggcgg acaaaggagcagatcctagagctgcttgtgctggaacaatttcttaccgtcctacctgga gaactacagagctgggtgcggggccaacggccagaaagtggcgaggaggcagtgacgctg gtggagggtttgcagaaacaacccaggagaccaaggcggtgggcatcttctcctaaaata agctcccgtgacaaccaagaacttcctcctgactccatggtgactggaagttggaattat tcccaggtgactgtccatgttcacggccaggaagtcctgtcagaggagacggtgcattta ggagtggagcctgagtcacctaatgagctgcaggatcctgtgcaaagctcgacccccgag cagtctcctgaggaaaccacacagagcccagatctgggggcaccggcagagcagcgtcca caccaggaagaggagctccagaccctgcaggagagcgaggtcccagtgcccgaggaccca gaccttcctgcagagaggagctctggagactcagagatggttgctcttcttactgctctg tcacaggtgtgccctagttacctctgtaccacagagaatttgtttgaagaaccactgggc ataagccatactaaacagggacaaaatacgaggctacccgtagcatcacgttttgatgaa atccttatgtggtttcagggactggtaacgttcaaggatgtggccgtatgcttttcccag gaccagtggagtgatctggacccaacacagaaagagttctatggagaatatgtcttggaa gaagactgtggaattgttgtctctctgtcatttccaatccccagacctgatgagatctcc caggttagagaggaagagccttgggtcccagatatccaagagcctcaggagactcaagag ccagaaatcctgagttttacctacacaggagataggagtaaagatgaggaagagtgtctg gagcaggaagatctgagtttggaggatatacacaggcctgttttgggagaaccagaaatt caccagactccagattgggaaatagtctttgaggacaatccaggtagacttaatgaaaga agatttggtactaatatttctcaagtgaatagttttgtgaaccttcgggaaactacaccc gtccaccccctgttagggaggcatcatgactgttctgtgtgtggaaagagcttcacttgt aactcccaccttgttagacacctgaggactcacacaggagagaaaccctataaatgtatg gaatgtggaaaaagttacacacgaagctcacatcttgccaggcaccaaaaggttcacaag atgaacgcgccttacaaatatcccctaaaccggaagaatttggaagagacctcccctgtg acacaggctgagagaactccatcagtggagaaaccctatagatgtgatgattgcggaaag cacttccgctggacttcagaccttgtcagacatcagaggacacatactggagaaaaaccc ttcttttgtactatttgtggcaaaagcttcagccagaaatctgtgttaacaacacaccaa agaatccacctgggaggcaaaccctacttgtgtggagagtgtggtgaggacttcagtgaa cacaggcggtacctggcgcaccggaagacgcacgctgctgaggaactctacctctgcagc gagtgcgggcgctgcttcacccacagcgcagcgttcgccaagcacttgagaggacacgcc tcagtgaggccctgccgatgcaacgaatgtgggaagagcttcagtcgcagggaccacctc ttgtacatctctcctagacaagtccaaggaactactaacgagaagatttcaggaagaggc ctacagcaattgcttggtgcttgggttcatttgcggaatcttggcaacaggtctacagag aagcagttccacggcaaaagagctgtggggcagttgaataatccatccaaacaatga >gi568815587r:123705411_123906349|GENSCAN_predicted_peptide_2|123_aa MEDEDTDQEFQNAGVYAGGFQTGPNITVEMTDNIIATEWQLDEQHRLTKDNGEAHHPGAQ GQLQAEFAGHDGGVVKGIADGEVAVKRHDSEDQELGGAHEEVEEGLQQAAGHADYCSCHY KGS >gi568815587r:123705411_123906349|GENSCAN_predicted_CDS_2|372_bp atggaggatgaggacactgaccaggagttccaaaatgctggtgtctatgcaggcggcttt caaactgggcccaacatcacagtagaaatgactgataacattattgccacagaatggcaa ctggatgagcagcatcgtctgacaaaagacaatggtgaagcccaccacccaggagctcag ggccagctgcaggcagagtttgctggtcatgatggtggggtggtgaaggggattgcagat ggtgaggtagcggtcaaaagacatgatagtgaggatcaagaactcggtggtgcccacgaa gaagtggaagaaggcctgcagcaggcagcaggacatgcagattactgttcttgccactac aaaggttcctag >gi568815587r:123705411_123906349|GENSCAN_predicted_peptide_3|74_aa MRKFLKGTKQGHSGSSCFRPALPDGLQLCNTVNLLIASPLIGKGDWCGQEEIRSHLLPVI LCAKERRMNENYLL >gi568815587r:123705411_123906349|GENSCAN_predicted_CDS_3|225_bp atgaggaaatttttgaagggcaccaagcaaggacattcaggcagctcatgcttcaggcct gcactccctgatggcttacagttgtgcaacacagtgaacttgctaattgcttctcctctg attggaaaaggagattggtgtggccaagaagaaattcggtcacatctcctacctgttatc ctgtgcgcaaaggaaagaagaatgaatgaaaactatcttttatag >gi568815587r:123705411_123906349|GENSCAN_predicted_peptide_4|605_aa MARELRDACTSFNSRCNQVEEKVSVIEHQINEIKQEDKVREKRVKRNEQSLQEIWDCVKR PNLCLIGVPESDGENGTKLENTLQDIIQENFPNLARQANIQIQEIQRTPQRYSSRREIPR HITVRFTKVEIKEKMLRAAREKDRSMRQKVNKDIQDLNSALHQADLIDIYRTLHPKSSEY TFFSAPHCTYSKIGHIIGSEALLSKCKRTEITTNRLSDHSAIILELRTKKLTQNRTTTWK LNNLLLNDYWVNNEMKAGIKIFFETNENKDTTYQNLWDTFKAGNPAHHIVVVMGNWSTVT EITLIAFPALLEIRISLFVVLVVTYTLTATGNITIISLIWIDHRLQTPMYFFLSNLSFLD ILYTTVITPKLLACLLGEEKTISFAGCMIQTYFYFFLGTVEFILLAVMSFDRYMAICDPL HYTVIMNSRACLLLVLGCWVGAFLSVLFPTIVVTRLPYCRKEINHFFCDIAPLLQVACIN THLIEKINFLLSALVILSSLAFTTGSYVYIISTILRIPSTQGRQKAFSTCASHITVVSIA HGSNIFVYVRPNQNSSLDYDKVAAVLITVVTPLLNPFIYSLRNEKVQEVLRETVNRIMTL IQRKT >gi568815587r:123705411_123906349|GENSCAN_predicted_CDS_4|1818_bp atggcacgagaacttcgtgacgcatgcacaagcttcaatagccgatgcaatcaagtggaa gaaaaggtatcagtgattgaacatcaaattaatgaaataaagcaagaagataaagttaga gaaaaaagagtaaaaagaaatgaacaaagcctccaagaaatatgggactgtgtgaaaaga ccaaatctatgtttgattggtgtacctgaaagtgatggggagaatggaaccaagttggaa aacactcttcaggatattatccaggagaacttccccaacctagcaaggcaggccaacatt caaattcaggaaatacagagaacaccacaaagatactcctcgagaagagaaatcccaaga cacataactgtcagattcaccaaggttgaaatcaaggaaaaaatgttaagggcagccaga gagaaagacagatcgatgagacagaaggttaataaggatatccaggacttgaactcagct ctgcaccaagcagacctaatagacatctatagaactctccaccccaaatcatcagaatat acattcttctcagcaccacattgcacctattctaaaattggccacataattggaagtgaa gcactcctcagcaaatgtaaaagaacagaaatcacaacaaaccgtctctcagaccacagt gcaatcatattagaactcaggactaagaaactcactcaaaaccgcacaactacatggaaa ctgaacaacctgctcctgaatgactactgggtgaataatgaaatgaaggcaggaataaag attttctttgaaaccaatgaaaacaaagacacaacataccagaatctctgggacacattt aaagcaggaaaccctgcccaccatatagtagttgtcatgggaaactggagcactgtgact gaaatcaccctaattgccttcccagctctcctggagattcgaatatctctcttcgtggtt cttgtggtaacttacacattaacagcaacaggaaacatcaccatcatctccctgatatgg attgatcatcgcctgcaaactccaatgtacttcttcctcagtaatttgtcctttctggat atcttatacaccactgtcattaccccaaagttgttggcctgcctcctaggagaagagaaa accatatcttttgctggttgcatgatccaaacatatttctacttctttctggggacggtg gagtttatcctcttggcggtgatgtcctttgaccgctacatggctatctgcgacccactg cactacacggtcatcatgaacagcagggcctgccttctgctggttctgggatgctgggtg ggagccttcctgtctgtgttgtttccaaccattgtagtgacaaggctaccttactgtagg aaagaaattaatcatttcttctgtgacattgcccctcttcttcaggtggcctgtataaat actcacctcattgagaagataaactttctcctctctgcccttgtcatcctgagctccctg gcattcactactgggtcctacgtgtacataatttctaccatcctgcgtatcccctccacc cagggccgtcagaaagctttttctacctgtgcttctcacatcactgttgtctccattgcc cacgggagcaacatctttgtgtatgtgagacccaatcagaactcctcactggattatgac aaggtggccgctgtcctcatcacagtggtgacccctctcctgaacccttttatctacagc ttgaggaatgagaaggtacaggaagtgttgagagagacagtgaacagaatcatgaccttg atacaaaggaaaacttga >gi568815587r:123705411_123906349|GENSCAN_predicted_peptide_5|197_aa MSFDCYVAICDPLHYTIIMNSRACLLLVLGCWVGAFLSVLCPTIVVSRLPFCYKEISHFF CDITPLLHVSCIDTHFIEMINFLLSSLILLTSLVLTTVSYIYIISTILHIPSAQGRRKAF STCASHITVISIAYISNIFRYVRPSQSHSMGFDKVTAVPTMVTPLLNPFTYSLRNEKVKA VLKEAVSKIMSSWHRRT >gi568815587r:123705411_123906349|GENSCAN_predicted_CDS_5|594_bp atgtcctttgactgctacgtggccatctgtgaccccctgcactacaccattatcatgaac agcagggcctgcctcctactagttctgggctgctgggttggagccttcctgtctgtgttg tgcccaaccattgtggtgtccagattgcctttctgttacaaggaaattagtcacttcttc tgtgacatcacccctctgctacatgtgtcctgtatagacactcatttcatcgagatgata aacttcctcttatcttccctcatcctcctgacctcactggtgctcaccactgtgtcctac atctacatcatttctaccatcctgcacatcccctcagcccaaggacgtcggaaggccttt tccacgtgcgcttcccacatcaccgtcatttccatcgcttatataagcaacatcttcagg tatgtgaggcccagccagagtcattcaatgggttttgacaaggtgacagctgtccccaca atggtgacccctcttctgaatcccttcacttatagtctaagaaatgaaaaggtaaaggca gtcttgaaagaagcagtcagcaaaattatgtcctcatggcacaggagaacttaa >gi568815587r:123705411_123906349|GENSCAN_predicted_peptide_6|343_aa MDAELRIALTVETKSVPFLINTEATHSTLPSFQESVSLASITVVGTVAFIPLAVTSFKHC MATCDPLCSTIIAKSRACLLLALGCWMGTFLAVLRLTIVVSRPVSQPVCTAVVWIGGEVM FKCKEEVAFTWIQQKAYRPIGPRGSEARIQCAQQLGELPEIPVSLTSLKALRGLRMIGAE DPPSKEKWVGVPVKEVVWLKKQSVHHLAGMAESTEPQIWWSSLLPGTPCQGEIGVLSVEQ RTHAVRRTMSGPRLKKQSGPDLARPLCCTVGAPIHLDCMRSPQPASWNGCFPLNHKDGGR PSLQEQGLVSSRLNPLPLAGQILTQWVLTHEVLWKWGLQNDAA >gi568815587r:123705411_123906349|GENSCAN_predicted_CDS_6|1032_bp atggatgccgagcttcgaatagctctcacagtggaaactaagtccgtccccttcttaatc aatacggaggctacccactccacattaccttcttttcaagagtctgtttcccttgcctcc ataactgttgtagggacagtggcgtttatccccttggcagtgacatccttcaaacactgc atggcaacctgtgaccccctgtgcagcaccatcattgcaaaaagcagggcctgcctcctg ctggctctgggatgctggatgggaaccttcctggctgtgttgcgcctgactattgtggtg tccagacccgtatcccaaccagtttgtacagcggtggtctggattggaggtgaagttatg ttcaagtgcaaagaggaagttgctttcacttggatacaacaaaaggcctataggcctata ggaccaagagggagtgaagctcgcatccaatgtgcccagcagttaggggagcttcctgag atacctgtttctctcacttccctcaaagcattgagaggacttagaatgataggagctgag gacccacccagtaaggagaagtgggtcggggtcccggttaaagaagtagtctggctaaag aagcagtctgtccaccatctggctgggatggctgagtctacagaaccacagatatggtgg tcatccctcctaccaggaactccctgccagggagagatcggagttctgtctgtggaacaa aggacccacgcagtaaggaggactatgtcagggcccaggttaaagaagcagtctggcccc gatctggcaaggccattgtgctgcaccgtgggggcccctattcatctggactgtatgcgt tctccacagccagcaagctggaatggctgtttccctctgaaccataaagatggtggccgt ccttcccttcaggaacaaggtcttgtctccagccgacttaaccctctgcccttggctggc cagattctaacccagtgggtcttaacccatgaggtactatggaagtggggcctacagaac gatgctgcttga >gi568815587r:123705411_123906349|GENSCAN_predicted_peptide_7|123_aa MFLGIYPKKLKAYVHTKTCTQVFITVLFITAQTWKQPRYPSVGEIDKPTVVQSYDGILLS PTKEFTVCARTMLRGIAVHSGNEISSIEFSDSLHSLSDLWMFRQGQPAVRTANCHGIIYW FDP >gi568815587r:123705411_123906349|GENSCAN_predicted_CDS_7|372_bp atgttccttggtatttacccaaaaaagttgaaagcttatgttcacacaaaaacctgcaca caggtgtttataacagttttattcataactgcccaaacttggaaacaaccaagatatcct tcagtaggtgaaatagataaaccaactgtggtacagtcttacgatggaatattacttagc cccacaaaggaattcacagtgtgtgcacggacaatgctacgaggcattgcagtgcattct ggtaatgaaatatcttcgatagaattctcagattccttacattcgttgtcagatttatgg atgttcaggcagggccagcctgctgtcaggactgccaactgccatggaatcatctattgg tttgacccttga >gi568815587r:123705411_123906349|GENSCAN_predicted_peptide_8|65_aa MPATGVKEFSKTVVEGCQEQLFQLRKAGDSTALETRFLAMNQTTKASYSEDCQHMASQLQ AKEYF >gi568815587r:123705411_123906349|GENSCAN_predicted_CDS_8|198_bp atgccagccactggagtaaaagaatttagcaagacagttgtagaaggatgtcaggaacag ctgtttcagctacgcaaagcaggagatagtactgccttggaaacacgtttcttggccatg aatcagactaccaaggcctcatatagtgaggactgccagcatatggcctctcaactccaa gccaaagaatatttctag