GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:47:25 Sequence gi568815575f:16551007_16754506 : 203500 bp : 43.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 204 199 6 -3.24 1.04 Term - 324 266 59 1 2 129 48 55 0.088 3.45 1.03 Intr - 551 435 117 0 0 70 50 68 0.063 1.64 1.02 Intr - 2606 2585 22 2 1 97 98 -6 0.045 -1.58 1.01 Init - 24715 24434 282 1 0 88 69 208 0.784 14.80 1.00 Prom - 45672 45633 40 -2.46 2.09 PlyA - 45956 45951 6 -0.45 2.08 Term - 47088 46004 1085 1 2 72 47 301 0.021 16.67 2.07 Intr - 50544 50346 199 0 1 110 53 23 0.113 -0.08 2.06 Intr - 56514 56221 294 0 0 87 54 178 0.925 11.51 2.05 Intr - 58630 58535 96 1 0 39 91 55 0.522 1.11 2.04 Intr - 65330 65239 92 0 2 82 60 18 0.453 -1.89 2.03 Intr - 66240 66144 97 0 1 125 108 61 0.978 11.38 2.02 Intr - 69326 69271 56 0 2 121 80 19 0.465 3.10 2.01 Init - 70212 70122 91 0 1 69 26 96 0.370 0.40 2.00 Prom - 74902 74863 40 -2.66 3.00 Prom + 79845 79884 40 -4.96 3.01 Init + 90972 91026 55 2 1 103 36 52 0.460 2.95 3.02 Intr + 95304 95483 180 1 0 42 105 80 0.532 4.94 3.03 Intr + 99175 99197 23 2 2 98 92 0 0.174 -1.34 3.04 Intr + 99263 99294 32 1 2 94 113 22 0.199 2.23 3.05 Intr + 99897 99963 67 0 1 35 111 69 0.200 2.81 3.06 Intr + 100752 100880 129 2 0 26 91 51 0.507 0.09 3.07 Term + 103399 103503 105 0 0 85 49 71 0.915 1.31 3.08 PlyA + 103642 103647 6 1.05 4.17 PlyA - 104671 104666 6 1.05 4.16 Term - 115242 115144 99 0 0 83 39 66 0.662 -0.67 4.15 Intr - 116551 116508 44 2 2 94 119 8 0.741 2.46 4.14 Intr - 120442 120366 77 0 2 78 96 61 0.665 5.06 4.13 Intr - 127444 127356 89 1 2 60 61 83 0.612 1.57 4.12 Intr - 132220 132088 133 0 1 115 101 164 0.985 21.05 4.11 Intr - 138595 138444 152 1 2 47 115 20 0.826 -0.44 4.10 Intr - 140614 140534 81 1 0 34 77 95 0.851 2.83 4.09 Intr - 142218 142135 84 0 0 77 105 12 0.770 1.82 4.08 Intr - 142481 142365 117 2 0 82 80 74 0.900 6.66 4.07 Intr - 145413 145283 131 1 2 73 87 10 0.944 -0.19 4.06 Intr - 148087 147917 171 2 0 82 62 117 0.977 8.41 4.05 Intr - 151910 151731 180 0 0 20 80 217 0.259 13.94 4.04 Intr - 160587 160507 81 1 0 76 111 -1 0.337 0.61 4.03 Intr - 161605 161329 277 1 1 23 93 272 0.916 18.29 4.02 Intr - 168827 168523 305 0 2 87 26 312 0.481 21.21 4.01 Init - 200127 200055 73 0 1 50 52 102 0.043 4.03 4.00 Prom - 201898 201859 40 -3.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 46972 46004 969 1 0 60 47 263 0.951 16.13 S.002 Sngl + 47340 47891 552 2 0 24 37 309 0.859 15.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:16551007_16754506|GENSCAN_predicted_peptide_1|159_aa MAGCRSRALPHGKAAKARREIQHSAGGPALLGDPAHPPQPLARVLSPSLPGARRAGQLLR LPGRQAHAHPELQLARKRRTQPRFPLAPLPPHLPILYNEFYMRFGDAAPIYYLDFLTEMV LGAPAGFYPYLGQRMHGDGGGQIQFYLEKIGAKPETAKL >gi568815575f:16551007_16754506|GENSCAN_predicted_CDS_1|480_bp atggcgggctgcaggtcccgagccctgccccacgggaaggcagctaaggcccggcgagaa atccagcacagcgctggtgggccggcactgctgggggacccagcgcaccctccgcagccg ctggcccgggtgctaagcccctcattgcccggggcccgcagggccggccagctgctccga ctgccgggccgccaagcccacgcccacccggaactccagctggcccgcaaacgccgcacg cagccccggttcccgctcgcgcctctccctccacacctccctatactatacaatgagttt tacatgagatttggggatgcagcccctatttattacctggattttcttacagaaatggtt ctaggagcaccagcaggcttttacccttacctgggacagaggatgcatggagatggagga ggacaaatccagttttatctagagaagattggggctaaaccagaaactgcaaaactgtga >gi568815575f:16551007_16754506|GENSCAN_predicted_peptide_2|669_aa MKSLGSSLIAVPIATLVMPRVLCLAAVPQLRKLYGDVPFIEERHRHRFEVNPNLIKQFEQ NDLSFVGQDVDGDRMEIIELAKLFLRANRTEIKASELRKNTEVVVSNQAGTPADEAFPSV SGAVTCSNWEPECLLATGLQTVFQCTDFQTVLSELESGLLREQAMGALGSFLLQLSSSTL SGSCAMVLQERFHLKKSFLCSETAQLMPLIGHPNVVPCQMWAVTLCSRFTIPVILDSHTE RKDKMHTNPYGTACSSLNSPEGFACCVLIHIPDLLSERGIGIYILLEWLFTDSLGEQGES EVTRWRIGAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRI KYLGIHLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFN AIPIKLPMTFFTELEKTTLKFIWNQKRALIAKSILSQKNKAGGITLPDFKLYCKATVTKT TWYWYQNRDIDQWNRTEPSEITPHICNYLIFDKPEKNKEWGKDSLFNKWCWENWLAICKR LKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMCKTPKAMATKA KIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKKKQT TPSKSGRRT >gi568815575f:16551007_16754506|GENSCAN_predicted_CDS_2|2010_bp atgaagtcactaggcagctcgttgatcgccgtgcccatcgccacactggtcatgcccagg gtgctgtgcttagctgcagtgcctcagctacggaaactttatggtgatgttccttttata gaagaaagacacagacatcggttcgaggtaaaccctaacctgatcaaacaatttgagcag aatgacttaagttttgtaggtcaggatgttgatggagacaggatggaaatcattgaactg gcaaagttgtttttaagagcaaacagaacagaaataaaggcatcagaattgaggaagaac accgaagttgttgttagtaaccaggctggaacacccgccgatgaagccttcccctccgta tctggggctgttacttgcagcaactgggaacctgaatgcctacttgcaacagggttgcaa actgtcttccagtgcactgactttcaaactgtgctctcggagctggagtctgggctgctg cgggagcaggccatgggggctctgggctccttcctgcttcagctgagcagctccacttta tctggctcatgtgccatggttctgcaagagagatttcatctgaagaagagcttcctgtgt tctgagactgcacagcttatgccactgattggacatcctaatgtggtcccatgtcagatg tgggctgtgaccctttgcagccgttttacaattcccgttattttagattctcacactgaa aggaaggataaaatgcacacaaatccatatggaacagcttgcagttccctgaactcccca gagggttttgcctgttgtgtcttgatacacattcctgacctgcttagtgagaggggaata gggatctatatcctattagaatggttgttcacggactcgttaggtgaacaaggtgaatcg gaggtgacaaggtggcgaatcggagcccaaaatctccttaagctgataagcaacttcagc aaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaataac agacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaata aaatacctaggaatccaccttacaagggacgtgaaggacctcttcaaggagaactacaaa ccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattccatgctcatgg gtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattcaat gccatcccaatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaag ttcatatggaaccaaaaaagagccctcattgccaagtcaatcctaagccaaaagaacaaa gctggaggcatcacactacctgacttcaaactatactgcaaggctacagtaaccaaaaca acatggtactggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaa ataacaccgcatatctgcaactatctgatctttgacaaacctgagaaaaacaaggaatgg ggaaaggattccctatttaataaatggtgctgggaaaactggctagccatatgtaaaagg ctgaaactggatcccttccttacaccttataccaaaatcaattcaagatggattaaagac ttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcattaccattcag gacataggcatgggcaaggacttcatgtgtaaaacaccgaaagcaatggcaacaaaagcc aaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactacc attagagtgaacaggcaacccacaaaatgggagaaaattttcgcaacctactcatctgac aaagggctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaacaaaca accccatcaaaaagtgggcgaaggacatga >gi568815575f:16551007_16754506|GENSCAN_predicted_peptide_3|196_aa METNTEKELEEAKEWYRGSPWMSLQTGGTNSIHFIKLQQGFKEAVGVGCLGQHTAHSKDT HSRSTGYGQETSGNSTAEAVSLLGNQNGVKDNGLERRKWPNGTKILQTTLSTKLYFSFLS CQNQNAFKATEYREELIVLTPIYSTVFPFAKEKAESGHLPRMGPNTLDDLFQELDKNGDG EVSFEEFQVLVKKISQ >gi568815575f:16551007_16754506|GENSCAN_predicted_CDS_3|591_bp atggagacaaacacagagaaagagctagaagaagctaaagagtggtacagagggagtcct tggatgtctctacagactggggggaccaacagcatccacttcataaagttgcagcaagga tttaaggaggccgtgggagttgggtgcctaggtcagcacactgcccatagtaaagatacg cactcaagaagcacaggctatggtcaggagacctctgggaactcaactgctgaagctgtt tcactattgggcaaccagaatggtgtgaaagataatggtctggagagaaggaaatggccc aatggcactaaaatcctgcagacaaccctcagcacaaaactgtacttcagctttttgagt tgtcaaaatcaaaatgcattcaaggcaacagagtacagagaagagctgatcgtcctcact ccaatatacagtactgtctttccttttgctaaagaaaaagctgagtcaggacacttgcca aggatgggtccaaacaccctagatgatctctttcaagaactggacaagaatggagatgga gaagttagttttgaagaattccaagtattagtaaaaaagatatcccagtga >gi568815575f:16551007_16754506|GENSCAN_predicted_peptide_4|697_aa MGGKREESLTEKDFERVEYDQSVLDSATVSDGCSGGASPLGCPPPATGCCKPNQLLKPRN IPLAGRRRGPDPQRPQRALVAFDSTLRDSQGDVRKSEEHRSRRSPASMTTPPRPRKRHFR PKAPPLALVLDHCTNPGANPKLTSVSELGQQQGGEGKVGEEGAAHPEARVGGGKIPGAAL DSPGSDPFPSPTGERQASRGVEPAVPTGHPPRCYHHRAGPALNPGSDCYHSLALTVLELH RTRSSATILPMKYILVTGGVISGIGKGIIASSIGTILKSCGLRVTAIKIDPYINIDAGTF SPYEHGEVFVLNDGGEVDLDLGNYERFLDINLYKDNNITTGKIYQHVINKERRGDYLGKT VQGAHWLDFKFHGNRETDKQNPFSVITLMFIFAYWILVVNHPLWLILGGTIGDIEGMPFV EAFRQFQFKAKRENFCNIHVSLVPQLSATGEQKTKPTQNSVRALRGLGLSPDLIVCRSST PIEMAVKEKISMFCHVNPEQVICIHDVSSTYRVPVLLEEQSIVKYFKERLHLPIGDSASN LLFKWRNMADRYERLQKICSIALVGKYTKLRDCYASVFKALEHSALAINHKLNLMYIDSI DLEKITETEDPVKFHEAWQKLCKAEKMLPETGPNPDPKRGFLDLAQERIQDADSTEFRPN APVPLSEFCKMMVGLQSACLPATHAHPVVSLLLGTYF >gi568815575f:16551007_16754506|GENSCAN_predicted_CDS_4|2094_bp atggggggaaaacgagaagaaagcctgacagaaaaagactttgaaagagtagaatacgat cagtcagtgctagactcagccaccgtctcggacggctgctcgggtggagcatctccattg ggctgcccaccgcctgccaccggctgctgcaagcccaaccaactgctcaagccccggaac atcccgctcgcgggccgccgccgcggtcccgatccccagagaccgcagcgagcactggtt gcctttgactccactctgcgcgactcccagggagatgtccggaagtcggaggaacaccgc agccgccgcagccccgcctctatgacgactccgccgcggccgcgcaagcgtcacttccgt ccaaaggctccgcccctggcgcttgttttggaccattgcacaaacccgggtgcaaacccc aagctcaccagcgtgagtgagctgggccagcagcagggaggagaggggaaggtgggcgag gagggcgccgcgcaccccgaggcccgtgtgggcggtgggaagatcccgggggcggctttg gacagccccggcagcgaccccttccccagcccgacaggtgagcgccaggccagccgcggg gtggagcccgccgtgcccaccggccaccctccccggtgctaccaccaccgcgcaggcccc gcactcaaccccggttctgattgttatcatagcttagctttgactgttctagaacttcat agaacaagatcatcagccactattctgccaatgaagtacatcctggtcacgggtggggtc atctcaggcattggtaaagggatcattgccagcagcattggaacgattctaaaatcatgt ggactccgagttactgccataaaaatcgacccctatattaacatcgatgctggcactttt tcaccttatgaacacggtgaagtcttcgtcttaaatgatggtggagaagttgatttagac cttggaaattatgaaagatttttggatattaatctttataaagacaacaatatcaccacg gggaagatatatcagcatgtgatcaataaagagaggcgtggtgattacctggggaaaaca gtgcaaggagctcattggctggattttaaatttcatggaaacagggaaactgacaaacag aaccccttctccgttataaccctgatgtttatatttgcttattggatactagtggtcaac catcctctttggttgattctgggaggcaccattggagacatcgaaggaatgccgtttgtg gaggcgtttagacaattccagtttaaggcgaaaagagagaatttctgtaatatccacgtt agccttgtcccacagctcagtgctaccggagaacaaaaaaccaaacccacccaaaacagc gtccgcgcactgaggggtttaggcctgtctccagatctgattgtctgccgaagttcaacg cccattgagatggccgtgaaggagaagatttctatgttttgtcacgtgaaccctgaacag gtcatatgtatccatgatgtttcttccacataccgagttcctgtgcttttagaggaacaa agcattgtgaaatattttaaggagagattgcacctgcccatcggtgattctgcaagtaat ttgctttttaagtggagaaatatggctgacaggtatgaaaggttacagaaaatatgctcc atagccctggttggcaaatacaccaagctcagagactgctacgcctctgtgttcaaagcc ctggaacactcagccctggccatcaaccacaagttgaatctgatgtacatagactccatt gatctggagaagatcactgaaaccgaggaccctgtgaaatttcatgaagcttggcagaag ctatgcaaagctgaaaagatgttaccagaaacgggtcccaatccagaccccaagagaggg ttcttggatcttgctcaagaaagaattcaggatgctgattccacagagtttaggccaaat gccccagttcctctgagcgagttctgcaagatgatggttggccttcagtcagcctgtttg ccagccacccatgcacacccagtcgtgtccctgctgctgggcacctacttctaa