GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:14:04 Sequence gi568815589f:114223134_114329904 : 106771 bp : 51.71% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4379 4456 78 1 0 72 44 50 0.331 0.09 1.02 Intr + 7946 7999 54 1 0 146 121 44 0.984 13.16 1.03 Intr + 8689 8725 37 0 1 88 54 36 0.498 -1.48 1.04 Intr + 8882 8982 101 0 2 95 91 15 0.473 2.83 1.05 Intr + 12466 12519 54 0 0 75 86 23 0.314 0.46 1.06 Intr + 13848 13901 54 2 0 94 119 -18 0.423 1.56 1.07 Intr + 14529 14582 54 2 0 147 87 41 0.993 9.56 1.08 Intr + 17087 17140 54 1 0 134 113 67 0.473 13.46 1.09 Intr + 17301 17354 54 2 0 130 78 21 0.456 4.96 1.10 Intr + 19054 19098 45 0 0 93 105 19 0.752 3.19 1.11 Intr + 22733 22777 45 1 0 121 92 30 0.606 5.79 1.12 Intr + 26856 26960 105 2 0 59 12 134 0.668 3.81 1.13 Intr + 27482 27535 54 1 0 47 115 20 0.419 0.26 1.14 Intr + 29460 29513 54 2 0 109 91 1 0.550 2.16 1.15 Intr + 29746 29799 54 0 0 109 115 26 0.789 7.06 1.16 Intr + 35408 35461 54 1 0 102 114 9 0.824 4.56 1.17 Intr + 38920 39120 201 0 0 90 68 19 0.387 0.10 1.18 Intr + 41222 41275 54 1 0 104 81 -3 0.529 0.26 1.19 Intr + 41791 41835 45 0 0 131 91 -2 0.891 3.49 1.20 Intr + 42289 42342 54 0 0 97 111 10 0.917 3.86 1.21 Intr + 43432 43485 54 0 0 129 113 13 0.968 7.56 1.22 Intr + 46108 46161 54 0 0 57 114 25 0.742 1.66 1.23 Intr + 47595 47648 54 2 0 98 115 29 0.965 6.26 1.24 Intr + 52528 52635 108 0 0 79 100 83 0.737 9.58 1.25 Term + 52774 52914 141 0 0 46 37 64 0.359 -4.66 1.26 PlyA + 53472 53477 6 -0.45 2.02 PlyA - 54736 54731 6 -0.45 2.01 Sngl - 55446 54997 450 0 0 81 38 541 0.999 44.97 2.00 Prom - 56371 56332 40 -0.51 3.00 Prom + 56377 56416 40 -1.81 3.01 Init + 58604 58921 318 1 0 77 25 79 0.505 -3.57 3.02 Intr + 59144 59197 54 1 0 128 100 70 0.996 11.86 3.03 Intr + 59324 59431 108 1 0 129 89 83 0.992 13.48 3.04 Intr + 60576 60629 54 2 0 85 69 27 0.528 0.16 3.05 Intr + 61591 61644 54 0 0 133 98 55 0.666 10.66 3.06 Intr + 65322 65378 57 2 0 96 102 88 0.991 10.57 3.07 Intr + 65569 65622 54 0 0 93 95 42 0.840 5.06 3.08 Intr + 65781 65834 54 2 0 126 86 5 0.639 3.76 3.09 Intr + 66039 66162 124 2 1 118 94 34 0.577 7.66 3.10 Intr + 66647 66801 155 0 2 56 23 47 0.472 -4.70 3.11 Intr + 67091 67198 108 1 0 98 99 45 0.846 7.58 3.12 Intr + 67677 67784 108 2 0 88 77 94 0.997 9.28 3.13 Intr + 68970 69077 108 2 0 78 117 31 0.889 5.98 3.14 Intr + 76937 76990 54 1 0 125 86 23 0.659 5.46 3.15 Intr + 77492 77554 63 1 0 100 101 51 0.992 7.01 3.16 Intr + 77939 77992 54 1 0 81 96 32 0.712 2.96 3.17 Intr + 78151 78186 36 0 0 104 106 39 0.899 6.24 3.18 Intr + 78312 78329 18 2 0 137 96 4 0.793 3.39 3.19 Intr + 78549 78584 36 2 0 134 94 -19 0.647 2.24 3.20 Intr + 78949 78975 27 0 0 131 111 7 0.774 6.10 3.21 Intr + 81475 81612 138 0 0 62 49 71 0.596 1.77 3.22 Intr + 83387 83555 169 1 1 90 81 199 0.963 19.43 3.23 Intr + 84536 84645 110 0 2 92 75 102 0.978 9.80 3.24 Intr + 86127 86345 219 2 0 128 88 284 0.999 31.43 3.25 Term + 87416 87562 147 1 0 109 43 211 0.999 16.91 3.26 PlyA + 89350 89355 6 1.05 4.00 Prom + 96761 96800 40 -0.21 4.01 Init + 100001 100114 114 1 0 68 100 190 0.999 16.37 4.02 Intr + 100530 100672 143 2 2 92 97 178 0.997 18.76 4.03 Intr + 100885 100955 71 1 2 76 89 112 0.996 9.42 4.04 Intr + 101657 101764 108 0 0 94 66 82 0.993 7.36 4.05 Intr + 101916 102019 104 1 2 111 92 193 0.888 22.39 4.06 Term + 103159 103224 66 0 0 118 43 77 0.993 4.43 4.07 PlyA + 103318 103323 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:114223134_114329904|GENSCAN_predicted_peptide_1|571_aa MVLTSWGYCEQLVRVQATSIVPGMAKGDLGVLGPIGYPGPKGMKGLMGSVGEPGLKASSW SLWVGLRQMVLGARKPQQIVQEQLLQQGGQGEQGVPGVSGDPGFQGDKGSQGLPGFPGAR GKPGPLGKVGDKGSIGFPGPPGPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPRGQLGPE GDEGPMGPPGAPGLEGEPGDPGRPGPVGEQGDGKDWGHLEEPPGTQRQVVTQRSSAEQQE LQRVSGFMGFIGLVGEPGIVGEKGDRGMMGPPGVPGPKGSMGHPGMPGGMGTPGEPGPQG PPGSRGPPGMRGAKGRRGGVCLSERQSPEPEIIPQEGSVVCHQLSPCPSSHDLTRPHPMW LQLRQACAHESRVLSLMTWCDSVHGPRGPDGPAGEQGSRGLKGPPGPQGRPGRPGQQGIP GPSGPPGTKGLPGEPGPQGPQGPIGPPGEMGPKGDLGPLGTPGEQGLIGQRGEPGLEGDS GPMGPDGLKGDRGDPGPDGEHGEKGQEGLMGEDGPPGPPGVTGVRFAREPICSMLHGSGE QPRLLILLAALPCANLVPADLISSIPHIQTH >gi568815589f:114223134_114329904|GENSCAN_predicted_CDS_1|1716_bp atggtgcttacctcctggggttactgtgagcagctggtgagggtccaagccaccagcatc gtccctggcatggcaaagggtgacttaggagtgttgggtccgattggctacccgggaccc aagggcatgaagggactgatgggcagcgtgggggagcccggactgaaagcttcttcctgg tcactctgggtggggctcaggcagatggttctgggagcccgaaaacctcagcagatagtg caggagcagctgttgcagcaaggagggcagggtgaacaaggggttccaggtgtgtcagga gatcccggattccaaggagacaaggggagccaggggttgccagggttccccggtgcacgg gggaagccagggcctctgggcaaagtcggagacaaaggatccattgggtttcccgggccc cctggacccgagggattcccaggagacatcggcccccctggcgacaatggcccagaaggc atgaagggtaagcctggagcccgaggcctgccgggaccccgtgggcagctggggcccgag ggagatgagggacccatggggccgccaggggcccctggcttggagggggaaccaggggat cctggtcggccggggcctgtgggagagcagggagacggcaaggactggggccacctagaa gaaccccctggaacccagaggcaagttgtcacccagcggagttcagcagagcagcaggag ctgcagcgtgtgtctggatttatgggattcattggtctggtcggggagccaggaatcgtg ggagaaaagggtgatcgtggcatgatgggacccccaggcgtgcctggacccaaggggtcg atgggtcatcctggaatgccaggtggtatggggacccctggagagcctggaccccagggt cctccaggatctcgaggcccaccaggcatgaggggagcaaagggacgtcggggaggtgta tgtctcagcgagagacagagcccagagccagagatcatccctcaggagggttcagttgtc tgccatcagctcagcccctgtcccagctcccatgacctgacacggccccaccccatgtgg ctccagctccgccaggcctgtgcccatgaatcacgggtgttaagcctaatgacgtggtgt gacagtgtccacggcccccgaggaccggacggaccagctggggagcaagggtccaggggc ctgaagggccctccaggaccccagggcagaccgggccggcctggacagcagggcatcccg ggtccctcaggccccccaggcaccaagggcctcccaggagaaccgggccctcagggaccc caggggccaattgggcctccaggagagatgggacccaagggtgaccttggacccctgggc actcctggggagcagggcctcattgggcaacggggagagccaggccttgagggtgacagt ggccccatgggacctgatgggctgaagggggacaggggagacccagggcctgatggagaa catggcgagaaaggccaggaagggctgatgggtgaggacgggccccccggcccccctggc gtcactggtgtccggtttgccagagaaccaatctgctcaatgctccatggctctggagaa caacctcggttactcatcctcctggctgccctgccctgtgcaaacctggtccctgccgac ctcatctcatccatcccccatatccagacacactaa >gi568815589f:114223134_114329904|GENSCAN_predicted_peptide_2|149_aa MTTIITPLPSLLPSLPPIITTTTTIISTTTTIITTTITTTIISAPSSSLSLPLSLSPPPS PPAPSSPSPPLHYHHRHQHQHHHHHLITVTTTTTTTINTPIIFIIITTTNTIISTTTITI IIIVITTTTIAAAPTHTIHSPSPPATTSL >gi568815589f:114223134_114329904|GENSCAN_predicted_CDS_2|450_bp atgaccaccatcatcactcccctgccatcactactaccatcactaccacccatcatcacc accactaccactatcattagtaccaccaccaccattatcaccaccaccatcaccaccacc atcatcagtgccccatcatcatcattatcattaccactatcactatcaccaccaccatcg ccaccagcaccatcatcaccctcacctccactccactatcaccatcgccaccagcaccag caccaccatcaccacctcatcacagtcaccactaccaccactaccaccatcaacaccccc atcatcttcatcatcatcaccaccaccaacaccatcatcagtactaccaccatcaccatc attattatcgtcattaccaccactaccattgctgctgcccccacccacaccatccactca ccatcaccaccagccaccaccagcctctag >gi568815589f:114223134_114329904|GENSCAN_predicted_peptide_3|808_aa MEPEQLGLIQGLQGAAGTLGKSHCIFGNLAASTGSGSHWGLMFKVVGRVEQHLALERAQC RELLPVGHPWMPNSNHDNSKASNCRSRTRHPPLHPGLTVSTFLHRQGPEGKSGKQGEKGR TGAKGAKGYQGQLGEMGVPGDPGPPGTPGPKGSRGSLGPTGAPGRMGAQGEPGLAGYDGH KGIVGPLGPPGPKGEKGEQGEDGKAEGPPGPPGDRGPVGDRGDRGEPGDPGYPGQEGVQG LRGKPGQQGQPAETGHPPRERILGLPCVISPWFAGASGTPGVAGTQRIERRRVSKRSSGT VASGSSQKIVNASEGTPRPRAGSVTAPGGSKASSGPRLPGQSAWGLQGLPGPRGVVGRQG LEGIAGPDGLPGRDGQAGQQGEQGDDGDPGPMGPAGKRGNPGVAGLPGAQGPPGFKGESG LPGQLGPPGKRGTEGRTGLPGNQGEPGSKGQPGDSGEMGFPGMAGLFGPKGPPGDIGFKG IQGPRGPPGLMGKEGIVGPLGILGPSGLPGPKGDKGSRGDWGLQGPRGPPGPRGRPGPPG PPGGPIQLQQDDLGAAFQTWMDTSGALRPEVSPGALPMWDPFLGETQIDNGPHVSYSYPD RLVLDQGGEIFKTLHYLSNLIQSIKTPLGTKENPARVCRDLMDCEQKMVDGTYWVDPNLG CSSDTIEVSCNFTHGGQTCLKPITASKVEFAISRVQMNFLHLLSSEVTQHITIHCLNMTV WQEGTGQTPAKQAVRFRAWNGQIFEAGGQFRPEVSMDGCKVQDGRWHQTLFTFRTQDPQQ LPIISVDNLPPASSGKQYRLEVGPACFL >gi568815589f:114223134_114329904|GENSCAN_predicted_CDS_3|2427_bp atggagccagaacagctgggattgattcagggcctgcaaggcgcagctgggaccctgggc aaaagtcactgtatctttgggaacttggctgcctcaactggaagcgggtctcattggggc ctaatgttcaaggtcgttgggagggtggaacagcacttggcgctggagagagctcagtgc agagagttgctccctgtaggacacccctggatgcctaacagtaaccatgacaacagcaaa gccagcaattgcagaagccgcacccgccatccacccctgcacccaggcctcactgtcagc acgttcttacatcgccagggtcctgaaggaaaatcagggaagcaaggcgagaagggccgc actggagccaagggtgccaagggctatcaaggacagctgggtgagatgggcgtccctgga gaccctggaccccctggcactccaggccctaaagggtcccggggcagcctgggaccaacg ggtgctccgggacgcatgggggcccaaggagaaccgggactggctggttatgatggacac aaaggcattgtgggaccccttggacctcctggaccaaaaggcgaaaagggggagcagggc gaggacggcaaggctgaggggccccctgggccacctggagatcggggccctgtgggtgat cgaggagaccgcggggaaccgggagaccctgggtaccctggacaggagggtgtgcaaggc ctccgtggaaagccaggccagcagggccaacccgcggagactggccaccctccccgggag agaatcctggggctgccttgcgtcatctcaccttggtttgcaggggcatccgggaccccg ggggtggccgggacccaaaggatcgaaaggcgcagagtctctaaaagatccagtggaact gtggcctctggctcctcccagaaaattgtcaatgccagtgaggggacaccaaggcccaga gcaggctcagtgacagccccaggaggttccaaggccagctctggccccaggcttccaggc cagagtgcttggggcctgcaggggctgccagggccccggggcgtggtggggagacagggc ctcgagggcatcgctggaccagatgggcttcctggcagggacgggcaagcaggacagcag ggggagcagggagacgatggggaccctggccccatgggccctgctgggaagagaggaaat ccaggtgtggccggcttacctggagcacagggacccccaggattcaagggtgagagtggg ttacccggacagctgggtccccctggcaagcgaggaacagagggcagaacggggctccct ggaaaccagggggagcctgggtccaaaggccagccgggcgactctggcgagatgggcttc ccaggaatggcaggtctcttcggacccaagggcccgcctggagacattggcttcaaaggc atccagggccctcgggggccacctggcttgatgggaaaggaaggcatcgtcgggcccctc ggaatcctgggaccttcgggactcccgggtccgaagggtgacaaaggcagccgtggggac tggggattgcaaggtccgaggggtcctcccggccccagagggcggcccggccccccgggt cctccagggggtcctatccaattgcaacaagatgatcttggggcagctttccagacgtgg atggacaccagtggagcactcaggccagaggtatctccaggggctctccccatgtgggat cccttcctgggagaaacgcaaatagataatggcccacatgtgagttacagctatccagac cggctggtgctggaccagggaggagagatctttaaaaccttacactacctcagcaacctc atccagagcattaagacgcccctgggcaccaaagagaaccccgcccgggtctgcagggac ctcatggactgtgagcagaagatggtggatggtacctactgggtggatccaaaccttggc tgctcctctgacaccatcgaggtctcctgcaacttcactcatggtggacagacgtgtctc aagcccatcacggcctccaaggtcgagtttgccatcagccgggtccagatgaatttcctg cacctgctaagctccgaggtgacccagcacatcaccatccactgccttaacatgaccgtg tggcaggagggcactgggcagaccccagccaagcaggccgtacgcttccgggcctggaat ggacagatttttgaagctgggggtcagttccggcccgaggtgtccatggatggctgcaag gtccaagatggccgctggcatcagacactcttcaccttccggacccaagacccccaacag ctgcccatcatcagtgtggacaacctccctcctgcctcatcagggaagcagtaccgcctg gaagttggacctgcgtgcttcctctga >gi568815589f:114223134_114329904|GENSCAN_predicted_peptide_4|201_aa MALSWVLTVLSLLPLLEAQIPLCANLVPVPITNATLDRITGKWFYIASAFRNEEYNKSVQ EIQATFFYFTPNKTEDTIFLREYQTRQDQCIYNTTYLNVQRENGTISRYVGGQEHFAHLL ILRDTKTYMLAFDVNDEKNWGLSVYADKPETTKEQLGEFYEALDCLRIPKSDVVYTDWKK DKCEPLEKQHEKERKQEEGES >gi568815589f:114223134_114329904|GENSCAN_predicted_CDS_4|606_bp atggcgctgtcctgggttcttacagtcctgagcctcctacctctgctggaagcccagatc ccattgtgtgccaacctagtaccggtgcccatcaccaacgccaccctggaccggatcact ggcaagtggttttatatcgcatcggcctttcgaaacgaggagtacaataagtcggttcag gagatccaagcaaccttcttttacttcacccccaacaagacagaggacacgatctttctc agagagtaccagacccgacaggaccagtgcatctataacaccacctacctgaatgtccag cgggaaaatgggaccatctccagatacgtgggaggccaagagcatttcgctcacttgctg atcctcagggacaccaagacctacatgcttgcttttgacgtgaacgatgagaagaactgg gggctgtctgtctatgctgacaagccagagacgaccaaggagcaactgggagagttctac gaagctctcgactgcttgcgcattcccaagtcagatgtcgtgtacaccgattggaaaaag gataagtgtgagccactggagaagcagcacgagaaggagaggaaacaggaggagggggaa tcctag