GENSCAN 1.0 Date run: 8-Nov-116 Time: 08:15:34 Sequence gi568815587r:123653583_123854518 : 200936 bp : 39.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 47 570 524 0 2 79 98 279 0.495 19.74 1.02 Intr + 659 897 239 1 2 97 47 146 0.404 6.89 1.03 Intr + 8679 8751 73 0 1 104 89 35 0.084 3.69 1.04 Term + 9208 9333 126 0 0 99 39 126 0.093 6.10 1.05 PlyA + 11439 11444 6 1.05 2.06 PlyA - 11456 11451 6 1.05 2.05 Term - 11723 11554 170 0 2 100 36 98 0.290 2.86 2.04 Intr - 16085 16037 49 2 1 108 83 37 0.089 2.53 2.03 Intr - 18430 18352 79 0 1 70 87 44 0.090 1.13 2.02 Intr - 24716 24625 92 2 2 53 73 116 0.064 4.47 2.01 Init - 24840 24769 72 0 0 70 58 51 0.719 1.42 2.00 Prom - 46080 46041 40 -3.05 3.04 PlyA - 46197 46192 6 1.05 3.03 Term - 48814 48708 107 2 2 55 49 142 0.953 4.59 3.02 Intr - 50471 50289 183 0 0 12 89 137 0.755 5.04 3.01 Init - 59966 59726 241 2 1 71 11 126 0.057 1.28 3.00 Prom - 60527 60488 40 -4.05 4.08 PlyA - 60940 60935 6 1.05 4.07 Term - 68168 67992 177 2 0 100 55 167 0.808 11.30 4.06 Intr - 73409 72538 872 0 2 107 -3 740 0.170 56.92 4.05 Intr - 74013 73894 120 1 0 41 94 104 0.876 5.85 4.04 Intr - 74740 74551 190 1 1 30 47 234 0.681 11.74 4.03 Intr - 75632 75472 161 0 2 108 72 134 0.820 12.59 4.02 Intr - 76327 76033 295 1 1 54 44 314 0.762 19.26 4.01 Init - 77306 76905 402 2 0 66 68 435 0.955 35.97 4.00 Prom - 80160 80121 40 -5.75 5.00 Prom + 89200 89239 40 -4.45 5.01 Init + 90295 90318 24 0 0 93 67 48 0.043 2.78 5.02 Term + 100352 100699 348 1 0 3 43 484 0.101 28.90 5.03 PlyA + 100856 100861 6 1.05 6.00 Prom + 116061 116100 40 -3.25 6.01 Init + 144330 144410 81 2 0 69 115 38 0.478 5.62 6.02 Intr + 147190 147241 52 0 1 83 105 37 0.325 2.46 6.03 Term + 148604 148695 92 0 2 54 42 64 0.240 -4.70 6.04 PlyA + 148825 148830 6 1.05 7.04 PlyA - 149876 149871 6 1.05 7.03 Term - 152797 151826 972 1 0 123 47 520 0.468 42.46 7.02 Intr - 174176 173758 419 0 2 42 20 177 0.013 -0.88 7.01 Init - 175445 175019 427 2 1 81 53 246 0.759 16.91 7.00 Prom - 183157 183118 40 -3.65 8.04 PlyA - 184313 184308 6 1.05 8.03 Term - 188057 187373 685 1 1 77 41 369 0.016 23.21 8.02 Intr - 197659 197546 114 0 0 46 64 81 0.147 0.14 8.01 Init - 198975 198911 65 0 2 55 98 80 0.545 6.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 24716 24546 171 2 0 53 44 165 0.841 5.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:123653583_123854518|GENSCAN_predicted_peptide_1|320_aa XTHRGKPARALFCHQRHQCVIIVSIVTVTCIRHGEVLVLTPVDKHERSQGKQSIEGRHLL GLAASKATQRDSLGQGLRPLRFFRGNPLGRTAPRGRLSDRRKGRAPVPGEALGAFGAGWG LWALSDPTGPPQFEGADQPLRARSRRRSAPRTVASPPTPELQLPLAPLPRSFAPRDRLRS DWNPPSPAAPASSALTASPAGRDPTPEAAGAARLGYRMRSTSVPSTPALGSLGRCSGAGK EAAAPVRPAMSEVGGCTVQAVSGSTILGSGGWWPCLPGQAQHHVEAAKAWGWHPMKPQPE LYISPFQPQLEQLEHRTPSP >gi568815587r:123653583_123854518|GENSCAN_predicted_CDS_1|963_bp natacgcacagaggcaagccagccagagctcttttctgtcaccaacgacatcaatgtgtt attattgttagcattgttactgttacctgcatccggcatggcgaggtgctggtacttacc ccagtagataagcacgagagaagccaggggaaacaatctattgaaggcaggcatcttctg gggctggcggcttccaaggctacacagagagattccctcggtcaaggactgcgccctctc agattctttcgaggaaaccctttgggacgaactgcccccagggggcgactttctgaccga aggaagggccgagcaccggtgcctggggaggccctgggagcttttggagccgggtggggg ctttgggccctaagcgaccccactggacctccccagttcgagggagccgatcagccgctc cgcgcccgctctcgccggcgctcagcaccacggacagtcgcctccccgcccaccccggaa ctccagcttccactcgcgcccctgcctcgctcgtttgcgcccagggatcggttgcgttcc gactggaatcctcccagccccgcggctcctgcttcgtccgccctaactgcttctccagct gggcgtgaccccactcccgaagccgccggggccgcccgtctgggctacagaatgcgttcc acgtcggtcccgtccacgcccgccctcggctccctggggaggtgcagcggggccgggaag gaagcggcagctcccgttcggccggcgatgtcagaggttggggggtgcacagtgcaagct gtcagtggatctaccattctggggtctggaggatggtggccttgtcttccaggacaggct caacaccatgtggaagctgccaaggcttggggctggcaccccatgaagccacagcctgag ctctacatcagcccctttcagccacagctagagcagctggaacacaggacaccaagtccc tag >gi568815587r:123653583_123854518|GENSCAN_predicted_peptide_2|153_aa MQEIVRDNHVQQLKWGHKRETCGEPGAAVGHRGPQSGGQAECRAEEGQNHLRPTRGSQQA DKNSNSQIKVKMTPSLWAEGQRTAKHPITFNPVQQNIHSLVKEALLFPLFILGLVYKAFP GHTAAGKSGFMRVVTERNQRIENEEWVALPHIS >gi568815587r:123653583_123854518|GENSCAN_predicted_CDS_2|462_bp atgcaggaaattgtgagagacaatcatgtccagcagctaaagtggggacacaaaagagaa acttgcggagagcctggagctgctgttggtcatcggggcccacagtctggaggacaagct gaatgcagagcagaggagggccagaaccacctaagacccacaagagggagccaacaggct gacaagaacagcaattcccagatcaaggtcaaaatgacaccatcgctgtgggcagaaggg cagagaactgccaaacatccaataactttcaatccagtccagcaaaacatacatagccta gttaaagaggcccttctctttcctcttttcatcctggggttggtgtacaaggcctttcct ggccatacggctgctgggaaatctggtttcatgagagtggtgactgagagaaatcaaaga atagaaaatgaagaatgggtggctctgccacatataagttaa >gi568815587r:123653583_123854518|GENSCAN_predicted_peptide_3|176_aa MGKDFMTKTPKAMATKAKIDKWDLIKLKSFCTAKETIIRVNRQPTEWEKIFAIYPSDKGL ISRIYKEHKHIYKKKTTPSKNNSSSNVAQGSQKFGHPCSRGSKGEADPCLFQLLVAAGVA FPGCGCIAPNPSFTFTWHPASAERQPEAATAAAPISAFRPPPRGRGQGGHVAGKAT >gi568815587r:123653583_123854518|GENSCAN_predicted_CDS_3|531_bp atgggcaaagacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactatcatcagagtg aacagacaacctacagaatgggagaaaatttttgcaatctatccatctgacaaagggcta atatccagaatctacaaggaacataagcacatttacaagaaaaaaacaaccccatcaaaa aacaattcttcttccaatgtggcccagggaagccaaaagtttggacacccctgttctagg ggctcaaagggagaagctgatccttgcctcttccagcttctggtggctgctggtgtggca ttccccggctgtggctgcatcgctccaaacccttccttcacctttacgtggcatcctgct tcagctgaacgtcagccagaggcagcaactgcagccgcccccatctcagctttccggcca cctcccaggggacgtgggcagggaggccacgtggcaggaaaagcgacctag >gi568815587r:123653583_123854518|GENSCAN_predicted_peptide_4|738_aa MATAVEPEDQDLWEEEGILMVKLEDDFTCRPESVLQRDDPVLETSHQNFRRFRYQEAASP REALIRLRELCHQWLRPERRTKEQILELLVLEQFLTVLPGELQSWVRGQRPESGEEAVTL VEGLQKQPRRPRRWASSPKISSRDNQELPPDSMVTGSWNYSQVTVHVHGQEVLSEETVHL GVEPESPNELQDPVQSSTPEQSPEETTQSPDLGAPAEQRPHQEEELQTLQESEVPVPEDP DLPAERSSGDSEMVALLTALSQVCPSYLCTTENLFEEPLGISHTKQGQNTRLPVASRFDE ILMWFQGLVTFKDVAVCFSQDQWSDLDPTQKEFYGEYVLEEDCGIVVSLSFPIPRPDEIS QVREEEPWVPDIQEPQETQEPEILSFTYTGDRSKDEEECLEQEDLSLEDIHRPVLGEPEI HQTPDWEIVFEDNPGRLNERRFGTNISQVNSFVNLRETTPVHPLLGRHHDCSVCGKSFTC NSHLVRHLRTHTGEKPYKCMECGKSYTRSSHLARHQKVHKMNAPYKYPLNRKNLEETSPV TQAERTPSVEKPYRCDDCGKHFRWTSDLVRHQRTHTGEKPFFCTICGKSFSQKSVLTTHQ RIHLGGKPYLCGECGEDFSEHRRYLAHRKTHAAEELYLCSECGRCFTHSAAFAKHLRGHA SVRPCRCNECGKSFSRRDHLLYISPRQVQGTTNEKISGRGLQQLLGAWVHLRNLGNRSTE KQFHGKRAVGQLNNPSKQ >gi568815587r:123653583_123854518|GENSCAN_predicted_CDS_4|2217_bp atggctacagccgtggaaccagaggaccaggatctttgggaagaagagggaattctgatg gtgaaactggaagatgatttcacctgtcggccagagtctgtcttacagagggatgacccg gtgctggaaacctcccaccagaacttccgacgcttccgctaccaggaggcagcaagccct agagaagctctcatcagactccgagaactttgtcaccagtggctgagaccagagaggcgg acaaaggagcagatcctagagctgcttgtgctggaacaatttcttaccgtcctacctgga gaactacagagctgggtgcggggccaacggccagaaagtggcgaggaggcagtgacgctg gtggagggtttgcagaaacaacccaggagaccaaggcggtgggcatcttctcctaaaata agctcccgtgacaaccaagaacttcctcctgactccatggtgactggaagttggaattat tcccaggtgactgtccatgttcacggccaggaagtcctgtcagaggagacggtgcattta ggagtggagcctgagtcacctaatgagctgcaggatcctgtgcaaagctcgacccccgag cagtctcctgaggaaaccacacagagcccagatctgggggcaccggcagagcagcgtcca caccaggaagaggagctccagaccctgcaggagagcgaggtcccagtgcccgaggaccca gaccttcctgcagagaggagctctggagactcagagatggttgctcttcttactgctctg tcacaggtgtgccctagttacctctgtaccacagagaatttgtttgaagaaccactgggc ataagccatactaaacagggacaaaatacgaggctacccgtagcatcacgttttgatgaa atccttatgtggtttcagggactggtaacgttcaaggatgtggccgtatgcttttcccag gaccagtggagtgatctggacccaacacagaaagagttctatggagaatatgtcttggaa gaagactgtggaattgttgtctctctgtcatttccaatccccagacctgatgagatctcc caggttagagaggaagagccttgggtcccagatatccaagagcctcaggagactcaagag ccagaaatcctgagttttacctacacaggagataggagtaaagatgaggaagagtgtctg gagcaggaagatctgagtttggaggatatacacaggcctgttttgggagaaccagaaatt caccagactccagattgggaaatagtctttgaggacaatccaggtagacttaatgaaaga agatttggtactaatatttctcaagtgaatagttttgtgaaccttcgggaaactacaccc gtccaccccctgttagggaggcatcatgactgttctgtgtgtggaaagagcttcacttgt aactcccaccttgttagacacctgaggactcacacaggagagaaaccctataaatgtatg gaatgtggaaaaagttacacacgaagctcacatcttgccaggcaccaaaaggttcacaag atgaacgcgccttacaaatatcccctaaaccggaagaatttggaagagacctcccctgtg acacaggctgagagaactccatcagtggagaaaccctatagatgtgatgattgcggaaag cacttccgctggacttcagaccttgtcagacatcagaggacacatactggagaaaaaccc ttcttttgtactatttgtggcaaaagcttcagccagaaatctgtgttaacaacacaccaa agaatccacctgggaggcaaaccctacttgtgtggagagtgtggtgaggacttcagtgaa cacaggcggtacctggcgcaccggaagacgcacgctgctgaggaactctacctctgcagc gagtgcgggcgctgcttcacccacagcgcagcgttcgccaagcacttgagaggacacgcc tcagtgaggccctgccgatgcaacgaatgtgggaagagcttcagtcgcagggaccacctc ttgtacatctctcctagacaagtccaaggaactactaacgagaagatttcaggaagaggc ctacagcaattgcttggtgcttgggttcatttgcggaatcttggcaacaggtctacagag aagcagttccacggcaaaagagctgtggggcagttgaataatccatccaaacaatga >gi568815587r:123653583_123854518|GENSCAN_predicted_peptide_5|123_aa MEDEDTDQEFQNAGVYAGGFQTGPNITVEMTDNIIATEWQLDEQHRLTKDNGEAHHPGAQ GQLQAEFAGHDGGVVKGIADGEVAVKRHDSEDQELGGAHEEVEEGLQQAAGHADYCSCHY KGS >gi568815587r:123653583_123854518|GENSCAN_predicted_CDS_5|372_bp atggaggatgaggacactgaccaggagttccaaaatgctggtgtctatgcaggcggcttt caaactgggcccaacatcacagtagaaatgactgataacattattgccacagaatggcaa ctggatgagcagcatcgtctgacaaaagacaatggtgaagcccaccacccaggagctcag ggccagctgcaggcagagtttgctggtcatgatggtggggtggtgaaggggattgcagat ggtgaggtagcggtcaaaagacatgatagtgaggatcaagaactcggtggtgcccacgaa gaagtggaagaaggcctgcagcaggcagcaggacatgcagattactgttcttgccactac aaaggttcctag >gi568815587r:123653583_123854518|GENSCAN_predicted_peptide_6|74_aa MRKFLKGTKQGHSGSSCFRPALPDGLQLCNTVNLLIASPLIGKGDWCGQEEIRSHLLPVI LCAKERRMNENYLL >gi568815587r:123653583_123854518|GENSCAN_predicted_CDS_6|225_bp atgaggaaatttttgaagggcaccaagcaaggacattcaggcagctcatgcttcaggcct gcactccctgatggcttacagttgtgcaacacagtgaacttgctaattgcttctcctctg attggaaaaggagattggtgtggccaagaagaaattcggtcacatctcctacctgttatc ctgtgcgcaaaggaaagaagaatgaatgaaaactatcttttatag >gi568815587r:123653583_123854518|GENSCAN_predicted_peptide_7|605_aa MARELRDACTSFNSRCNQVEEKVSVIEHQINEIKQEDKVREKRVKRNEQSLQEIWDCVKR PNLCLIGVPESDGENGTKLENTLQDIIQENFPNLARQANIQIQEIQRTPQRYSSRREIPR HITVRFTKVEIKEKMLRAAREKDRSMRQKVNKDIQDLNSALHQADLIDIYRTLHPKSSEY TFFSAPHCTYSKIGHIIGSEALLSKCKRTEITTNRLSDHSAIILELRTKKLTQNRTTTWK LNNLLLNDYWVNNEMKAGIKIFFETNENKDTTYQNLWDTFKAGNPAHHIVVVMGNWSTVT EITLIAFPALLEIRISLFVVLVVTYTLTATGNITIISLIWIDHRLQTPMYFFLSNLSFLD ILYTTVITPKLLACLLGEEKTISFAGCMIQTYFYFFLGTVEFILLAVMSFDRYMAICDPL HYTVIMNSRACLLLVLGCWVGAFLSVLFPTIVVTRLPYCRKEINHFFCDIAPLLQVACIN THLIEKINFLLSALVILSSLAFTTGSYVYIISTILRIPSTQGRQKAFSTCASHITVVSIA HGSNIFVYVRPNQNSSLDYDKVAAVLITVVTPLLNPFIYSLRNEKVQEVLRETVNRIMTL IQRKT >gi568815587r:123653583_123854518|GENSCAN_predicted_CDS_7|1818_bp atggcacgagaacttcgtgacgcatgcacaagcttcaatagccgatgcaatcaagtggaa gaaaaggtatcagtgattgaacatcaaattaatgaaataaagcaagaagataaagttaga gaaaaaagagtaaaaagaaatgaacaaagcctccaagaaatatgggactgtgtgaaaaga ccaaatctatgtttgattggtgtacctgaaagtgatggggagaatggaaccaagttggaa aacactcttcaggatattatccaggagaacttccccaacctagcaaggcaggccaacatt caaattcaggaaatacagagaacaccacaaagatactcctcgagaagagaaatcccaaga cacataactgtcagattcaccaaggttgaaatcaaggaaaaaatgttaagggcagccaga gagaaagacagatcgatgagacagaaggttaataaggatatccaggacttgaactcagct ctgcaccaagcagacctaatagacatctatagaactctccaccccaaatcatcagaatat acattcttctcagcaccacattgcacctattctaaaattggccacataattggaagtgaa gcactcctcagcaaatgtaaaagaacagaaatcacaacaaaccgtctctcagaccacagt gcaatcatattagaactcaggactaagaaactcactcaaaaccgcacaactacatggaaa ctgaacaacctgctcctgaatgactactgggtgaataatgaaatgaaggcaggaataaag attttctttgaaaccaatgaaaacaaagacacaacataccagaatctctgggacacattt aaagcaggaaaccctgcccaccatatagtagttgtcatgggaaactggagcactgtgact gaaatcaccctaattgccttcccagctctcctggagattcgaatatctctcttcgtggtt cttgtggtaacttacacattaacagcaacaggaaacatcaccatcatctccctgatatgg attgatcatcgcctgcaaactccaatgtacttcttcctcagtaatttgtcctttctggat atcttatacaccactgtcattaccccaaagttgttggcctgcctcctaggagaagagaaa accatatcttttgctggttgcatgatccaaacatatttctacttctttctggggacggtg gagtttatcctcttggcggtgatgtcctttgaccgctacatggctatctgcgacccactg cactacacggtcatcatgaacagcagggcctgccttctgctggttctgggatgctgggtg ggagccttcctgtctgtgttgtttccaaccattgtagtgacaaggctaccttactgtagg aaagaaattaatcatttcttctgtgacattgcccctcttcttcaggtggcctgtataaat actcacctcattgagaagataaactttctcctctctgcccttgtcatcctgagctccctg gcattcactactgggtcctacgtgtacataatttctaccatcctgcgtatcccctccacc cagggccgtcagaaagctttttctacctgtgcttctcacatcactgttgtctccattgcc cacgggagcaacatctttgtgtatgtgagacccaatcagaactcctcactggattatgac aaggtggccgctgtcctcatcacagtggtgacccctctcctgaacccttttatctacagc ttgaggaatgagaaggtacaggaagtgttgagagagacagtgaacagaatcatgaccttg atacaaaggaaaacttga >gi568815587r:123653583_123854518|GENSCAN_predicted_peptide_8|287_aa MAEGEEEASTSHGESRSKMEEGHRGWQPCVQQQPAPTTGIANVTMNRNATAPLPLVPYHR TRRPYLLLGASPKLISFVFLGTVEFILLAVMSFDCYVAICDPLHYTIIMNSRACLLLVLG CWVGAFLSVLCPTIVVSRLPFCYKEISHFFCDITPLLHVSCIDTHFIEMINFLLSSLILL TSLVLTTVSYIYIISTILHIPSAQGRRKAFSTCASHITVISIAYISNIFRYVRPSQSHSM GFDKVTAVPTMVTPLLNPFTYSLRNEKVKAVLKEAVSKIMSSWHRRT >gi568815587r:123653583_123854518|GENSCAN_predicted_CDS_8|864_bp atggcagaaggtgaagaggaagccagcacatcacatggtgagagcagaagcaagatggag gaggggcacagaggctggcagccctgtgtgcaacagcagcctgcccccaccacgggcatt gccaatgtgaccatgaataggaatgccacagctccactcccactggtaccctaccacagg acaagaagaccatatcttttgctgggtgcatcacccaaacttatttcctttgttttcttg gggacagtggagtttatcctcttggcagtgatgtcctttgactgctacgtggccatctgt gaccccctgcactacaccattatcatgaacagcagggcctgcctcctactagttctgggc tgctgggttggagccttcctgtctgtgttgtgcccaaccattgtggtgtccagattgcct ttctgttacaaggaaattagtcacttcttctgtgacatcacccctctgctacatgtgtcc tgtatagacactcatttcatcgagatgataaacttcctcttatcttccctcatcctcctg acctcactggtgctcaccactgtgtcctacatctacatcatttctaccatcctgcacatc ccctcagcccaaggacgtcggaaggccttttccacgtgcgcttcccacatcaccgtcatt tccatcgcttatataagcaacatcttcaggtatgtgaggcccagccagagtcattcaatg ggttttgacaaggtgacagctgtccccacaatggtgacccctcttctgaatcccttcact tatagtctaagaaatgaaaaggtaaaggcagtcttgaaagaagcagtcagcaaaattatg tcctcatggcacaggagaacttaa