GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:16:19 Sequence gi568815584f:52607220_52827614 : 220395 bp : 38.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2396 2507 112 2 1 84 81 80 0.061 6.36 1.02 Intr + 24941 25090 150 1 0 68 94 43 0.409 2.34 1.03 Intr + 26609 26727 119 1 2 76 71 116 0.783 7.04 1.04 Term + 27719 27896 178 2 1 80 48 60 0.687 -2.62 1.05 PlyA + 28327 28332 6 1.05 2.13 PlyA - 29650 29645 6 1.05 2.12 Term - 31060 31024 37 0 1 93 37 44 0.130 -4.37 2.11 Intr - 39068 38935 134 2 2 109 96 79 0.686 9.32 2.10 Intr - 45089 45020 70 1 1 84 86 21 0.496 -0.23 2.09 Intr - 46096 45850 247 2 1 91 38 126 0.494 3.50 2.08 Intr - 56628 56582 47 2 2 96 94 16 0.859 0.13 2.07 Intr - 59276 59156 121 0 1 46 44 149 0.902 4.93 2.06 Intr - 64484 64411 74 1 2 75 30 94 0.922 0.43 2.05 Intr - 64652 64576 77 2 2 67 20 195 0.943 7.99 2.04 Intr - 71253 71215 39 0 0 127 93 40 0.979 5.90 2.03 Intr - 75189 75106 84 0 0 112 47 54 0.766 2.80 2.02 Intr - 76688 76569 120 2 0 103 63 64 0.842 5.17 2.01 Init - 88262 88149 114 2 0 77 99 253 0.292 23.56 2.00 Prom - 94039 94000 40 -6.35 3.00 Prom + 97046 97085 40 -2.15 3.01 Init + 99959 100085 127 1 1 102 65 214 0.990 20.87 3.02 Intr + 101090 101169 80 0 2 45 96 111 0.949 5.95 3.03 Intr + 101264 101303 40 1 1 93 82 37 0.867 0.48 3.04 Intr + 101545 101597 53 2 2 79 90 78 0.813 4.81 3.05 Intr + 106662 106749 88 2 1 82 77 35 0.661 0.42 3.06 Intr + 110862 110923 62 1 2 102 98 16 0.776 1.53 3.07 Intr + 111010 111133 124 0 1 79 91 73 0.936 5.94 3.08 Intr + 113891 113971 81 0 0 69 82 50 0.687 1.19 3.09 Intr + 116746 116817 72 2 0 88 110 100 0.999 10.66 3.10 Term + 120280 120398 119 2 2 94 28 144 0.980 6.72 3.11 PlyA + 120450 120455 6 1.05 4.00 Prom + 120680 120719 40 -6.25 4.01 Init + 123256 123404 149 0 2 107 29 211 0.764 16.71 4.02 Intr + 150655 150705 51 1 0 101 110 10 0.357 1.60 4.03 Intr + 152463 152535 73 0 1 117 93 12 0.457 3.09 4.04 Intr + 161621 161714 94 1 1 111 110 28 0.443 5.92 4.05 Term + 163814 163887 74 0 2 36 38 101 0.205 -3.01 4.06 PlyA + 164775 164780 6 1.05 5.08 PlyA - 166456 166451 6 1.05 5.07 Term - 171239 171092 148 1 1 110 42 49 0.548 -1.11 5.06 Intr - 173521 173460 62 1 2 54 87 83 0.460 1.41 5.05 Intr - 174692 174565 128 0 2 83 95 145 0.997 14.18 5.04 Intr - 177445 177278 168 2 0 73 80 103 0.679 7.00 5.03 Intr - 187032 186877 156 1 0 81 37 71 0.002 0.36 5.02 Intr - 194375 194278 98 1 2 46 78 69 0.047 0.43 5.01 Intr - 212817 212767 51 2 0 104 80 60 0.026 3.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 183703 184006 304 0 1 40 39 171 0.870 4.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:52607220_52827614|GENSCAN_predicted_peptide_1|186_aa XCLSTGGRSKPRLAPCLVRARKLMPISGPDGSGEDTRKGMSLCQTVVVGSVVILLYSSRA CYNLVVVTISQDTLESPFNYGWDNLSDKAPAGMINSHSYSSRAYFFDNPRRYDSDDDLPR LGSSREGSLPNSQSLGWYGTMTGCGSSSYTVTPHLNGPMTDTAPLLFTCSNLDLNNHHSL YVTPQN >gi568815584f:52607220_52827614|GENSCAN_predicted_CDS_1|561_bp nnttgtctcagcactggaggacgctctaagcctaggcttgctccatgtcttgtgagggct cggaagttgatgccaatttctggcccagatggatctggggaagacacaaggaagggtatg tctctgtgccagactgtcgtcgtgggctctgtagtcattcttctgtactcttccagagct tgttataatttggtggtggtcaccatatctcaggatacattagaaagtccatttaattat ggctgggataatctttcagataaggcacctgctggcatgataaatagtcacagttatagt tccagagcttactttttcgacaatccaagacgatatgatagtgatgatgacctgccaaga ctgggaagttcaagagaaggaagtttaccaaattcgcaaagtttgggctggtatggcacc atgactgggtgtggcagcagcagttacacagtcactccccacctgaatggacctatgaca gatactgctcctttgctctttacttgtagtaatttagatttgaacaatcatcatagctta tatgtgacaccacaaaactga >gi568815584f:52607220_52827614|GENSCAN_predicted_peptide_2|387_aa MGRGWGFLFGLLGAVWLLSSGHGEEQPPETAAQRCFCQVSGYLDDCTCDVETIDRFNNYR LFPRLQKLLESDYFRYYKVNLKRPCPFWNDISQCGRRDCAVKPCQSDEVPDGIKSASYKY SEEANNLIEECEQAERLGAVDESLSEETQKAVLQWTKHDDSSDNFCEADDIQSPEAEYVD LLLNPERYTGYKGPDAWKIWNVIYEENCFKPQTIKRPLNPLASGQETWLEKKWGHNITEF QQRFDGILTEGEGPRRLKNLYFLYLIELRALSKVLPFFERPDFQLFTGNKIQDEENKMLL LEILHEIKSFPLHFDENSFFAGDKKEAHKLKTQGLGTALKILFSEKLIANMPESGPSYEF HLTRQEIVSLFNAFGRDMDEIGNNHSQ >gi568815584f:52607220_52827614|GENSCAN_predicted_CDS_2|1164_bp atgggccgcggctggggattcttgtttggcctcctgggcgccgtgtggctgctcagctcg ggccacggagaggagcagcccccggagacagcggcacagaggtgcttctgccaggttagt ggttacttggatgattgtacctgtgatgttgaaaccattgatagatttaataactacagg cttttcccaagactacaaaaacttcttgaaagtgactactttaggtattacaaggtaaac ctgaagaggccgtgtcctttctggaatgacatcagccagtgtggaagaagggactgtgct gtcaaaccatgtcaatctgatgaagttcctgatggaattaaatctgcgagctacaagtat tctgaagaagccaataatctcattgaagaatgtgaacaagctgaacgacttggagcagtg gatgaatctctgagtgaggaaacacagaaggctgttcttcagtggaccaagcatgatgat tcttcagataacttctgtgaagctgatgacattcagtcccctgaagctgaatatgtagat ttgcttcttaatcctgagcgctacactggttacaagggaccagatgcttggaaaatatgg aatgtcatctacgaagaaaactgttttaagccacagacaattaaaagacctttaaatcct ttggcttctggtcaagagacctggttagaaaagaaatggggacacaacattacagaattt caacagcgatttgatggaattttgactgaaggagaaggtccaagaaggcttaagaacttg tattttctctacttaatagaactaagggctttatccaaagtgttaccattcttcgagcgc ccagattttcaactctttactggaaataaaattcaggatgaggaaaacaaaatgttactt ctggaaatacttcatgaaatcaagtcatttcctttgcattttgatgagaattcatttttt gctggggataaaaaagaagcacacaaactaaagactcagggtttgggcactgctctgaag atcttattttctgagaaattgatagcaaatatgccagaaagtggacctagttatgaattc catctaaccagacaagaaatagtatcattattcaacgcatttggaagggacatggatgaa attggaaataatcattctcagtaa >gi568815584f:52607220_52827614|GENSCAN_predicted_peptide_3|281_aa MAIPGIPYERRLLIMADPRDKALQDYRKKLLEHKEIDGRLKELREQLKELTKQYEKSEND LKALQSVGQIVGEVLKQLTEEKFIVKATNGPRYVVGCRRQVIELPLTNPELFQRVGIIPP KGCLLYGPPGTGKTLLARAVASQLDCNFLKVVSSSIVDKYIGESARLIREMFNYARDHQP CIIFMDEIDAIDIDLPNEQARLDILKIHAGPITKHGEIDYEAIVKLSDGFNGADLRNVCT EAGMFAIRADHDFVVQEDFMKAVRKVADSKKLESKLDYKPV >gi568815584f:52607220_52827614|GENSCAN_predicted_CDS_3|846_bp atggccattcccggcatcccctatgagagacggcttctcatcatggcggaccctagagat aaggcgcttcaggactaccgcaagaagttgcttgaacacaaggagatcgacggccgtctt aaggagttaagggaacaattaaaagaacttaccaagcagtatgaaaagtctgaaaatgat ctgaaggccctacagagtgttgggcagatcgtgggtgaagtgcttaaacagttaactgaa gaaaaattcattgttaaagctaccaatggaccaagatatgttgtgggttgtcgtcgacag gtgatagaattacctcttacaaacccagagttatttcagcgtgtaggaataatacctcca aaaggctgtttgttatatggaccaccaggtacgggaaaaacactcttggcacgagccgtt gctagccagctggactgcaatttcttaaaggttgtatctagttctattgtagacaagtac attggtgaaagtgctcgtttgatcagagaaatgtttaattatgctagagatcatcaacca tgcatcatttttatggatgaaatagatgctattgatattgatttgccaaatgaacaagca agattagacatactgaaaatccatgcaggtcccattacaaagcatggtgaaatagattat gaagcaattgtgaagctttcggatggctttaatggagcagatctgagaaatgtttgtact gaagcaggtatgttcgcaattcgtgctgatcatgattttgtagtacaggaagacttcatg aaagcagtcagaaaagtggctgattctaagaagctggagtctaaattggactacaaacct gtgtaa >gi568815584f:52607220_52827614|GENSCAN_predicted_peptide_4|146_aa MEDVKLEFPSLPQCKEDAEVSRSRGCHAQASPCGSGRGATPVPNRLSRHLAAFVIAYIME TFGMKYRDAFAYVQERRFCINPNAGFVHQLQEYEAIYLAKLTIQMMSPLQIERSLSVHSG TTGSLKRTHEEEDDFGTMQVATAQNG >gi568815584f:52607220_52827614|GENSCAN_predicted_CDS_4|441_bp atggaggacgtgaagctggagttcccttcccttccacagtgcaaggaagacgccgaggtg agtcgctcccgtggctgccacgcacaggcctctccctgtggctccggccgaggggcgacc ccagtccccaaccgtcttagccgccaccttgcagcctttgttattgcatacattatggaa acatttggaatgaagtacagagatgcttttgcttatgttcaagaaagaagattttgtatt aatcctaatgctggatttgtccatcaacttcaggaatatgaagccatctacctagcaaaa ttaacaatacagatgatgtcaccactccagatagaaaggtcattatctgttcattctggt accacaggcagtttgaagagaacacatgaagaagaggatgattttggaaccatgcaagtg gcgactgcacagaatggctga >gi568815584f:52607220_52827614|GENSCAN_predicted_peptide_5|270_aa XSQDFENPDSNKEIDMASCVTNTDKFVCDKSSSLLLEQLVTTSLQCRVVLMLGLQVLTMV PGQGILKLLIQANVPESLLNSITDIYVYPRNSLICANENIVVDLTRKMKPDETPMFDPSL LKEVDWSQNTATFSPAISPTHPGEGLVLRPLCTADLNRESFEHMKKSGDYYVTVVEDVTL GQIVATATLIIEHKFIHSCAKRGRVEDVVVSDECRGKQLGKLLLSTLTLLSKKLNCYKIT LECLPQNVGFYKKFGYTVSEENYMCRRFLK >gi568815584f:52607220_52827614|GENSCAN_predicted_CDS_5|813_bp nngtcacaagattttgagaacccagattcaaataaagaaatagatatggccagttgtgtg actaacactgacaaatttgtctgtgacaagagttcttccttactacttgagcagcttgtc accacttcccttcagtgcagggtggttctaatgctgggattacaggtgttaaccatggta cccggccaaggtattcttaaacttctcattcaggccaacgttcctgaatcacttctcaac tcaatcactgacatctatgtttatccacgtaatagccttatttgtgccaatgagaatatt gttgttgaccttactagaaaaatgaaacctgatgaaactcctatgtttgacccaagtcta ctcaaagaagtggactggagtcagaatacagctacattttctccagccatttccccaaca catcctggagaaggcttggttttgaggcctctttgtactgctgacttaaatagagaatct tttgagcatatgaagaaatctggggattattatgttacagttgtagaagatgtgactcta ggacagattgttgctacggcaactctgattatagaacataaattcatccattcctgtgct aagagaggaagagtagaagatgttgttgttagtgatgaatgcagaggaaagcagcttggc aaattgttattatcaacccttactttgctaagcaagaaactgaactgttacaagattacc cttgaatgtctaccacaaaatgttggtttctataaaaagtttggatatactgtatctgaa gaaaactacatgtgtcggaggtttctaaagtaa