GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:26:09 Sequence gi568815585r:22596026_22796574 : 200549 bp : 38.22% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1291 1460 170 0 2 91 37 81 0.130 2.36 1.02 Term + 8327 8441 115 2 1 81 48 86 0.351 0.96 1.03 PlyA + 8452 8457 6 1.05 2.00 Prom + 9258 9297 40 -3.05 2.01 Init + 10989 11048 60 2 0 74 47 73 0.253 3.10 2.02 Term + 40292 40489 198 1 0 70 40 186 0.006 8.42 2.03 PlyA + 43249 43254 6 1.05 3.07 PlyA - 44051 44046 6 1.05 3.06 Term - 46339 45982 358 0 1 87 45 238 0.060 12.40 3.05 Intr - 48699 48551 149 0 2 7 36 225 0.181 7.31 3.04 Intr - 81933 81802 132 0 0 90 66 76 0.122 5.52 3.03 Intr - 82999 82953 47 2 2 64 53 16 0.129 -7.09 3.02 Intr - 83604 83288 317 2 2 9 49 550 0.299 37.98 3.01 Init - 83841 83684 158 0 2 62 32 183 0.358 7.87 3.00 Prom - 88718 88679 40 -4.25 4.00 Prom + 93507 93546 40 -3.55 4.01 Init + 100548 100648 101 2 2 104 51 173 0.004 12.98 4.02 Intr + 104647 104701 55 1 1 95 69 33 0.004 0.16 4.03 Intr + 108623 108883 261 1 0 36 92 217 0.740 13.66 4.04 Intr + 109300 109587 288 0 0 23 75 181 0.634 6.82 4.05 Intr + 109956 110072 117 2 0 67 24 89 0.235 0.14 4.06 Intr + 118007 118158 152 1 2 98 -12 171 0.033 6.14 4.07 Intr + 129624 129817 194 0 2 45 111 68 0.010 3.01 4.08 Intr + 148862 149134 273 0 0 65 58 179 0.035 9.19 4.09 Intr + 162948 163045 98 1 2 42 72 108 0.125 3.41 4.10 Intr + 174654 175272 619 2 1 84 49 393 0.220 25.96 4.11 Term + 197508 197824 317 1 2 47 44 227 0.767 8.32 4.12 PlyA + 199074 199079 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 140630 140518 113 2 2 87 83 92 0.826 8.45 S.002 Term + 182076 182239 164 1 2 62 38 126 0.924 2.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:22596026_22796574|GENSCAN_predicted_peptide_1|94_aa MAKKHMTKCSSSLTMTEMHIKTTMRHRLTPVRMAVIEKTKNNRLLVSTRVKGNSYTLIPT LWRIRHLRPYLGHGHWALTRHQTCQHLDLGLSGL >gi568815585r:22596026_22796574|GENSCAN_predicted_CDS_1|285_bp atggccaaaaagcatatgacaaaatgctcatcatcactaaccatgacagaaatgcacatt aaaaccactatgaggcatcgtcttactccagtcagaatggctgttattgaaaagacaaaa aacaacagattgttagtgagcacacgggtaaaagggaactcttatacactcatccctacc ctctggaggatacggcacttaagaccctatcttggacatggacactgggctctcaccaga caccaaacctgccagcaccttgatcttggactctccggcctctag >gi568815585r:22596026_22796574|GENSCAN_predicted_peptide_2|85_aa MAKIQNIDNTKMLVKMWSKMSGPFAAGLLEFAGGPLQTLFAWVSPVETQLEMQKSPTFWV DLALELHTGAVLIRSSFLVSHIVIF >gi568815585r:22596026_22796574|GENSCAN_predicted_CDS_2|258_bp atggccaaaatccagaacattgacaacaccaaaatgctggtgaagatgtggagcaaaatg tcaggcccctttgctgcaggtctcctcgagtttgctggaggtccgctccagaccctgttt gcctgggtatcaccagtggagactcagttggaaatgcagaaatcacccaccttctgggtt gatctcgctttggagctgcacaccggagctgttcttattcggtcatctttcctagtctcc cacatagtaattttctag >gi568815585r:22596026_22796574|GENSCAN_predicted_peptide_3|386_aa MVIGMVVALMVLMVVVIVVVMVINVVVLMVLVRMVVMVMVVMVTVVVVKVMGCFGVDGNG GDGEGSGGSESGGGDGGGNDNDGDDHGGSDSDGGDGDDFSDGSKSGGDGGNGDDVGGEEV GGADGNGGGEGGGGGEGSVCGGGGGGNTGDDSGDGIGGVPEAQSLRQDGKCRFKIKIEVG TEIWRLAVINRDFILQEGGKVFTQCKTKFRVGGRNIKIADTQAAGRREEQIGRGTHGWLD VERNTVTGTCMPAGHRFAEQQRGVWLRQAPGTAAAATPTAAGSAIVHSAGDSVGLFQVPA PAHTGVDVATLLGAGCPPCLGLHVPGQGACTCIYLVPARGGAPAGGLPLDTCEDLHLPSE CPVHKSVQYQNSKQKTPGMVEGTWKG >gi568815585r:22596026_22796574|GENSCAN_predicted_CDS_3|1161_bp atggtgatagggatggtggtggcgttgatggtgctgatggtggtggtgatagtggtggta atggtgattaatgttgtggtgctgatggtgctggtgaggatggtggtgatggtgatggta gtaatggtgacggtggtggtggtgaaggtgatggggtgttttggtgttgatggtaatggt ggtgatggtgaaggtagtggtggcagtgaaagtggtggaggtgatggtggtggtaatgat aatgatggtgatgatcatggtggtagtgacagtgatggtggtgatggtgatgatttcagt gatggtagtaaaagtggtggtgatggtggtaatggtgatgatgttggtggtgaagaagtt ggtggtgcagatggtaatggtggtggtgaaggtggtggtggtggtgagggtagtgtttgt ggcggcggtggtggtggtaatactggtgatgatagcggtgatggtattggtggggtgccc gaagcccagagtttgaggcaggatggaaaatgcagatttaagattaagattgaggtgggg acagagatctggagattggcggtaataaatagagactttattctacaagaaggagggaag gtgtttactcaatgcaagacgaagttcagagtgggtggtaggaatataaagatagcagac acacaagctgctggacgtcgagaggagcagataggcagaggaacacacgggtggctggat gtcgagaggaatacagtaacaggcacctgcatgccggcaggccaccgatttgcagaacaa caacgtggagtttggctgcggcaggccccagggaccgctgctgcagctacacccacggca gctggcagtgccattgtccattcagcaggagactcagtggggctgttccaggtgcccgcc ccagcgcacactggcgtggatgttgccacactcttaggggctggctgtccaccatgcttg gggctgcatgtccctgggcagggggcatgcacgtgcatttatctggtgccagcgaggggt ggggcaccagctgggggcctgcccctggacacctgtgaggatctgcacttacccagtgag tgccctgtgcataagagtgtgcaatatcaaaatagcaagcagaaaacaccaggaatggtt gaggggacatggaagggctaa >gi568815585r:22596026_22796574|GENSCAN_predicted_peptide_4|824_aa MAATKQRRRWRRRWLRGAGAAAGALGQSEGAVKRTVSIKFNQGDTNELTVFQLLDSQKRD PPGTQEATHGQQDHSRPSRPMVGTLLTICGDVAFLQKLCHINVKLVLDVIEGFGIILLRH KSDGQVFSAKATCTGHWVEEVLLHEELGRGHAGLYRLDEDCHLFKFQHVQEIEKLAVLLS VLELEVVLLRCVQRALGLVIHKHIHGTLHGLPAHRPDLLAQGGTEHDSLLLLEYQIWLGT GCPYPGWPEECTHAGPQELQLQQKVPETRAVDGNALLTPFWGEVTRKHRVVFAQPLRFPG TLQGSFRASLSPASQQQLTGFSSAGCVLACCLLPAACCLLPAWCVYPVKMTKPWISPACS TSADFTCRSLGLVYNFTKLPPGDYQIFTAGSLCESAQLDPCLIATTEGFLYPEVLAPEES DHTWAWRMSTRFYRVEVADGRRQNGMEWEGFPLESGRLVARALLCPSQTAGCSASARRWP GVKVDEKKSENISDEKTFLQQEIKVLAVAMSKSKKQMGAEISPDETSPKPWNTNIQLTRD NRQSGGGSLDLTSSRSEKRSQSNEEAKHGRLRGSKTLLHDTIMVHMSLRRRIPSMLLRWP EISTARGAYTGLRRGLWTTGLALPTRPSRQRLAAATGPGSQACRRSVLSPQLARTCRTSF HLECRGLNKEDAAAPENGDASNWRAPRGVTSFAQGVSRSEPAGSVTALSRSCHPQLSEWE GQVKKPTHRRTLHGDWSSAAKESVEARKEPATDPSAAPPEEAQPWGLLDLTLLASRTVSQ YISVVLAAQFAVLCCADSRKLDNPHAVQSKRYILTFSNTPSPKS >gi568815585r:22596026_22796574|GENSCAN_predicted_CDS_4|2475_bp atggcggcgactaagcagaggcggcggtggcggcggcggtggctgcgcggcgctggagcg gcggcgggggccctggggcagtccgagggtgcggtgaagagaactgtttcaatcaagttc aatcaaggtgacactaatgagctaacagtcttccagcttttggacagtcaaaagagagac ccacctggcacacaggaagccacacacgggcagcaggaccactccaggccgtcgaggccc atggtcgggacgctgctcaccatatgtggagatgttgctttcctccagaagctctgtcac attaacgtcaaactggtcttagatgtcattgagggctttggcatcattctcctcagacac aaaagtgatggccaggtctttagtgccaaagcaacctgcacaggccactgggtggaggaa gttctgctccatgaggagctgggccggggacatgcagggctgtaccggcttgatgaagat tgccacctgtttaaattccagcatgtccaggagatcgaaaagcttgcagttcttctcagt gtccttgagcttgaggtagtcctgctgcggtgtgtgcagcgtgcgcttggtctcgtcatc cacaaacacatccatgggaccctgcatggactccctgcacacaggccagatctccttgct cagggtggcactgaacatgattcactgcttctgctggaatatcagatttggcttggcact ggatgtccatacccagggtggcctgaagaatgcactcatgctggacctcaggagctccag ttgcagcagaaagtcccagaaaccagagctgtggacggaaacgcacttctgactccattc tggggtgaagtgacacggaagcatcgagtggtttttgcacagcctctgcggtttccagga acactccagggcagcttcagggcctcactgtctccggcatcgcagcagcaattaactggg ttctccagtgctgggtgtgtgcttgcctgctgcctgctgcctgctgcctgctgcctgctg cctgcttggtgtgtgtatcctgtaaagatgacaaagccgtggatttcacctgcctgcagt acatctgcagattttacctgcagatcattggggctagtttacaacttcactaaactgccc ccaggggactatcaaatctttaccgcaggcagtttgtgtgagtcagctcaactggatccc tgccttattgcaactacagagggctttctgtatcccgaggttcttgccccagaagaatcg gatcacacgtgggcttggagaatgagtacaaggttttatcgagtggaagtagcagatggc agaagacagaacggaatggagtgggaaggttttcccctagagtcgggccgtttagtggcc cgggctctcctctgccccagccaaactgcaggttgttctgcttctgcccgtcggtggcct ggtgttaaagtagatgagaagaaatctgagaatatttcggatgaaaagacttttttacag caggaaataaaggtattagctgttgccatgagtaagtcgaaaaagcaaatgggagcagag atatcaccagatgagacatcaccaaaaccatggaacactaacatccaactaacaagggac aacagacagagcggtggaggaagtttagacctaacttcatctaggagtgaaaaacgaagc cagagtaatgaagaggcaaagcatggaagacttcggggcagtaaaactcttctacatgat accataatggtgcacatgtcattaagacgcaggatcccttccatgctgcttcgctggcca gaaatctccacagccagaggcgcctataccgggcttcgccggggcctctggaccactggg ctcgctctgcccacgcggccaagcaggcaacgcttggctgctgctactgggcctggatcc caagcctgccgccgctctgtgctcagcccgcagttggcccggacatgccgcaccagcttc caccttgagtgccggggtctgaacaaggaggacgcagcggcacctgaaaacggagatgct agcaactggcgagccccaaggggtgttacatcttttgctcagggagtctcgaggtctgag cccgcaggaagtgttacagctctctcacgttcttgccacccacagctcagcgaatgggaa ggccaggtgaagaagccgacacacaggagaacgctgcatggagactggagttctgctgcc aaggaatcggtggaagccaggaaagagcctgcaacagatccttccgccgcacctccagaa gaagcacagccctggggactgctggatcttacacttctggcttccagaacagtgagtcaa tacatttctgttgttttagcggctcagtttgcggtcctttgttgtgccgactccaggaaa ctggacaatccccatgccgtgcaaagcaaacgttatattctaaccttcagtaatacaccc tctcccaagtcctga