GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:56:54 Sequence gi568815594f:40143387_40343959 : 200573 bp : 43.15% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1246 1414 169 0 1 41 97 189 0.531 14.62 1.02 Intr + 9394 9517 124 2 1 103 109 50 0.953 8.24 1.03 Intr + 40473 40565 93 0 0 96 81 35 0.047 2.68 1.04 Term + 55245 55383 139 0 1 102 45 51 0.027 -0.36 1.05 PlyA + 57748 57753 6 1.05 2.00 Prom + 79098 79137 40 -3.36 2.01 Init + 79581 79653 73 2 1 30 81 85 0.975 3.23 2.02 Term + 79970 80079 110 0 2 46 33 134 0.950 2.37 2.03 PlyA + 80316 80321 6 1.05 3.00 Prom + 89287 89326 40 -3.46 3.01 Sngl + 100001 100576 576 1 0 78 40 815 0.990 69.87 3.02 PlyA + 101728 101733 6 1.05 4.03 PlyA - 104372 104367 6 1.05 4.02 Term - 112581 112514 68 1 2 70 42 85 0.483 0.20 4.01 Init - 118082 118016 67 2 1 8 98 119 0.481 4.15 4.00 Prom - 138260 138221 40 -1.56 5.02 PlyA - 138714 138709 6 1.05 5.01 Sngl - 151701 151240 462 0 0 98 42 429 0.524 35.46 5.00 Prom - 153748 153709 40 -4.46 6.00 Prom + 155393 155432 40 -4.26 6.01 Init + 163943 164104 162 1 0 68 76 126 0.656 9.13 6.02 Intr + 164891 164942 52 1 1 92 99 36 0.662 3.58 6.03 Intr + 166940 167105 166 0 1 92 50 31 0.474 -1.28 6.04 Term + 169079 169250 172 2 1 91 43 74 0.481 0.50 6.05 PlyA + 170781 170786 6 1.05 7.05 PlyA - 171557 171552 6 1.05 7.04 Term - 185775 185648 128 1 2 46 36 142 0.565 3.24 7.03 Intr - 186439 186364 76 1 1 53 110 66 0.919 4.39 7.02 Intr - 187724 187599 126 2 0 75 89 112 0.991 10.88 7.01 Init - 187921 187871 51 1 0 58 44 65 0.938 -1.72 7.00 Prom - 189336 189297 40 -6.06 8.00 Prom + 191010 191049 40 -6.56 8.01 Init + 192082 192145 64 0 1 63 78 4 0.884 -1.65 8.02 Intr + 192441 192586 146 1 2 27 101 165 0.978 11.70 8.03 Intr + 193824 193978 155 2 2 39 108 146 0.919 10.57 8.04 Term + 197177 197264 88 2 1 55 46 92 0.622 -1.17 8.05 PlyA + 197716 197721 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:40143387_40343959|GENSCAN_predicted_peptide_1|174_aa GTLHEQKMKEANHLAAIEIFEKVNASLLPQNVLDLHGLHVDEALEHLMRVLEKKTEEFKQ NGGKPYLSVITGRGNHSQGGVARIKPAVIKYLISHSFRAAATFKASNSPRLHPKHVRKTG APARVYFNWTSFMQRKVRVARGTSKTSTSGFFILRRPFNYPGIKVAFLLGERVG >gi568815594f:40143387_40343959|GENSCAN_predicted_CDS_1|525_bp ggtactcttcatgagcagaagatgaaagaagccaatcaccttgctgccatagagatcttt gagaaagtcaatgcttcgctgctgccacagaatgttttagacctccatgggctgcatgtg gatgaagctctagaacatttgatgagagttttagagaagaagactgaagaatttaagcag aacggtgggaagccctatttgtctgtgattacggggagaggaaaccacagccagggagga gttgctcgcatcaaaccagctgtcattaagtacctcataagccatagcttcagggctgct gcaaccttcaaggcatctaattctccacggctacacccaaagcatgtaaggaagacgggt gctcctgcccgtgtctactttaactggacatcatttatgcagagaaaggttcgtgtggct cggggtaccagtaagacctccacctctggtttcttcattttaaggaggcccttcaattat ccaggaattaaagtggccttcctcttgggagaacgagttggttga >gi568815594f:40143387_40343959|GENSCAN_predicted_peptide_2|60_aa MLVEEVTADVVEGARELELEEEAEEIATATPTCSNHYPDQLAAINIEAKRLGLAEGSDDC >gi568815594f:40143387_40343959|GENSCAN_predicted_CDS_2|183_bp atgctagtggaggaagtaactgcagatgtggtagaaggagcaagagagctagaattagaa gaggaggctgaagaaatagccacagccactccaacctgcagcaaccactaccctgatcaa ttggcagccatcaacattgaggcaaaaagattaggacttgctgaaggctcagatgattgt tag >gi568815594f:40143387_40343959|GENSCAN_predicted_peptide_3|191_aa MLSSIKCVLVGDSAVGKTSLLVRFTSETFPEAYKPTVYENTGVDVFMDGIQISLGLWDTA GNDAFRSIRPLSYQQADVVLMCYSVANHNSFLNLKNKWIGEIRSNLPCTPVLVVATQTDQ REMGPHRASCVNAMEGKKLAQDVRAKGYLECSALSNRGVQQVFECAVRTAVNQARRRNRR RLFSINECKIF >gi568815594f:40143387_40343959|GENSCAN_predicted_CDS_3|576_bp atgctgagttccatcaagtgcgtgttggtgggcgactctgctgtggggaaaacctctctg ttggtgcgcttcacctccgagaccttcccggaggcctacaagcccacagtgtacgagaac acaggggtggacgtcttcatggatggcatccagatcagcctgggcctctgggacacagcc ggcaatgacgccttcagaagcatccggcccctgtcctaccagcaggcagacgtggtgctg atgtgctactctgtggccaaccataactcattcctgaacttgaagaacaagtggattggt gaaattaggagcaacttgccctgtacccctgtgctggtggtggccacccagactgaccag cgggagatggggccccacagggcctcctgcgtcaatgccatggaagggaagaaactggcc caggatgtcagagccaagggctacctggagtgctcagcccttagcaatcggggagtacag caggtgtttgagtgcgccgtccgaactgccgtcaaccaggccaggagacgaaacagaagg aggctcttctccatcaatgagtgcaagatcttctaa >gi568815594f:40143387_40343959|GENSCAN_predicted_peptide_4|44_aa MGAAAAALLLVVLSGSPLATLSGSSRASDESIIEWKKGSLEQEI >gi568815594f:40143387_40343959|GENSCAN_predicted_CDS_4|135_bp atgggagctgctgctgctgcactcttgctggtggtgctaagcggcagtcccctagccacg ctgtcagggtcatccagagcatcagatgaaagcatcatcgaatggaaaaaagggagtttg gagcaagagatctaa >gi568815594f:40143387_40343959|GENSCAN_predicted_peptide_5|153_aa MEMEAEMEMEVEMEMEAEMEMEAEMEMEVEMEMEAEMEMEAQMEMEMEAEMEMEMEAEME MEMEAEMEMKMEAEMEMEMEVEMEAEMEMEMEVEMEAEIEVEMEAQMEMEIEAEMEMEAE MEMEMEAEMEMEMEAEMEMEMEAEMEMEIEAEM >gi568815594f:40143387_40343959|GENSCAN_predicted_CDS_5|462_bp atggagatggaggcagagatggagatggaggtggagatggagatggaggcagagatggag atggaggcagagatggagatggaggtggagatggaaatggaggcagagatggagatggag gcacagatggagatggagatggaggcagaaatggagatggagatggaggcagagatggag atggagatggaggcagagatggagatgaagatggaggcagagatggagatggagatggag gtggagatggaggcagagatggagatggagatggaggtggagatggaggcagagatagag gtggagatggaggcacagatggagatggagatagaggcagagatggagatggaggcagaa atggagatggagatggaggcagagatggagatggagatggaggcagagatggagatggag atggaggcagagatggagatggagattgaggcagagatgtag >gi568815594f:40143387_40343959|GENSCAN_predicted_peptide_6|183_aa MPPRPLWFYRLMFVLELEQDEVGNEAYRVTRDGGSSRKDRVIWFLEGILEISSYMGQLKL REKWLDQGPTAETTQDCYGGSLITRFLPSCSILFNTWLLPRGPKWLLQLQAWFSVSVTGQ EERNREGMTFSFRQTKPESKEKGSRKLRDLILKQGHRSQCEGATPGQIGHNLSTKVIKDS NGM >gi568815594f:40143387_40343959|GENSCAN_predicted_CDS_6|552_bp atgcccccaaggccactgtggttctacagacttatgtttgtgctcgagttagagcaggat gaggttggaaatgaagcttacagagttacaagggacggcggtagttccagaaaggacagg gtcatctggtttttggaaggcatcttggagatttcatcgtacatggggcagttgaagctc agagagaagtggcttgaccaaggcccaacggcagagacaacccaagactgttatggcggc tccctgatcaccaggtttcttccatcttgttccattctcttcaacacatggcttctacct cgtggcccaaaatggctgctccagctccaagcttggttcagcgtttcagtcactggtcag gaggaaaggaacagagaagggatgactttcagcttcaggcagaccaagcctgaatcaaag gagaagggatccagaaagttacgtgatctgattttaaagcaaggacacaggagccagtgt gaaggggctacccctgggcaaatcggacacaatttgagcaccaaagtaattaaggacagc aatgggatgtga >gi568815594f:40143387_40343959|GENSCAN_predicted_peptide_7|126_aa MRRRMAGPGLTPTAAQPGDRDSVVDEELEFPDIGDGGNCGYSQARGWQGEQQGNGSEVQP LLTAICTNHTCLLAANGAPEFPSPVDSLQWILTLFGIKAKILTGNYKAVRDVSPDFPSPT LSPSLM >gi568815594f:40143387_40343959|GENSCAN_predicted_CDS_7|381_bp atgcgcagacgcatggcagggccagggctgactcccactgcagctcagccaggtgaccgt gatagtgtcgtggatgaggaattagagtttccagatataggtgatggtgggaactgtggc tacagccaggccaggggctggcagggtgagcagcaggggaatgggtcagaagttcaacca ttgctgactgccatctgcaccaaccacacttgcctcctagcagccaacggtgcccctgaa ttcccttctccagtggattctctccagtggattctcaccttatttggaataaaagccaaa atccttactggcaactacaaggccgtacgtgatgtctcccccgactttccttctccaaca ctctccccctcgctcatgtag >gi568815594f:40143387_40343959|GENSCAN_predicted_peptide_8|150_aa MNWSHSCISFCWIYFAASRLRAAETADGKYAQKLFNDLFEDYSNALRPVEDTDKVLNVTL QITLSQIKDMDERNQILTAYLWIRQIWHDAYLTWDRDQYDGLDSIRIPSDLVWRPDIVLY NKYNIIASQKQVQNIIGNTMGFLSCGKKEK >gi568815594f:40143387_40343959|GENSCAN_predicted_CDS_8|453_bp atgaactggtcccattcctgcatctccttttgctggatctactttgctgcttccagactg agagctgcagagacggcagatggaaaatatgctcagaagttgtttaatgacctttttgaa gattattctaatgctcttcgtccagtggaagatacagataaagtcctgaatgtgaccctg cagattacgctctctcagattaaggatatggatgaaagaaaccaaattctgactgcttat ttgtggatccgccaaatctggcacgatgcctatctcacgtgggaccgagatcagtacgat ggcctagactccatcaggatccccagtgacctcgtgtggaggccagacatcgtcttatat aacaaatacaatatcattgcttctcagaagcaggtacagaacatcataggcaacacaatg ggcttcctgagttgtggcaagaaggagaaatag