GENSCAN 1.0 Date run: 16-Jul-119 Time: 15:41:56 Sequence gi568815589f:36481682_36777334 : 295653 bp : 44.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 5879 5627 253 2 1 95 96 222 0.984 19.20 1.00 Prom - 28370 28331 40 -3.66 2.00 Prom + 32674 32713 40 -2.66 2.01 Init + 49009 49181 173 0 2 42 57 109 0.340 2.11 2.02 Term + 50843 50984 142 2 1 119 49 31 0.194 -0.30 2.03 PlyA + 55290 55295 6 1.05 3.00 Prom + 60020 60059 40 -2.46 3.01 Init + 67218 67269 52 2 1 80 86 58 0.612 6.12 3.02 Intr + 90900 91072 173 1 2 23 97 17 0.011 -4.24 3.03 Intr + 91327 91517 191 0 2 117 94 72 0.663 9.08 3.04 Term + 92469 92589 121 0 1 76 36 53 0.584 -3.15 3.05 PlyA + 92927 92932 6 1.05 4.00 Prom + 95357 95396 40 -5.26 4.01 Init + 100001 100058 58 1 1 47 91 56 0.924 3.27 4.02 Intr + 101946 102031 86 1 2 80 98 45 0.959 4.24 4.03 Intr + 107855 107971 117 1 0 65 88 42 0.835 2.56 4.04 Intr + 112989 113090 102 2 0 43 100 101 0.553 7.17 4.05 Intr + 115541 115609 69 1 0 76 115 22 0.883 3.08 4.06 Intr + 125894 125992 99 1 0 125 101 9 0.935 6.21 4.07 Intr + 161316 161402 87 2 0 66 99 29 0.328 1.97 4.08 Intr + 170065 170196 132 0 0 97 115 58 0.880 10.24 4.09 Intr + 175560 175682 123 2 0 16 79 85 0.221 1.18 4.10 Intr + 183669 183900 232 2 1 62 63 70 0.275 -0.75 4.11 Intr + 187629 187725 97 1 1 59 115 53 0.961 4.17 4.12 Intr + 189317 189485 169 2 1 116 121 25 0.986 8.55 4.13 Intr + 193153 193256 104 0 2 101 78 37 0.965 2.97 4.14 Intr + 195479 195650 172 2 1 105 70 74 0.568 7.25 4.15 Intr + 231463 231523 61 0 1 114 103 17 0.235 4.11 4.16 Intr + 232512 232819 308 1 2 17 100 195 0.245 9.77 4.17 Intr + 247430 247544 115 1 1 67 55 90 0.065 3.62 4.18 Term + 249496 249509 14 2 2 106 43 5 0.094 -3.84 4.19 PlyA + 251815 251820 6 1.05 5.09 PlyA - 252831 252826 6 1.05 5.08 Term - 258975 258838 138 0 0 32 51 168 0.257 5.46 5.07 Intr - 270637 270532 106 0 1 45 83 116 0.461 7.02 5.06 Intr - 275147 275008 140 2 2 20 78 66 0.224 -1.94 5.05 Intr - 281523 281506 18 0 0 86 113 13 0.332 0.41 5.04 Intr - 284377 284234 144 1 0 79 27 130 0.664 6.48 5.03 Intr - 290717 290655 63 2 0 92 98 35 0.876 3.81 5.02 Intr - 291736 291630 107 2 2 117 81 61 0.999 8.23 5.01 Init - 295023 294939 85 0 1 65 80 60 0.242 3.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 13287 13207 81 0 0 5 54 148 0.856 0.79 S.002 Intr - 224310 224222 89 2 2 58 75 126 0.802 7.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:36481682_36777334|GENSCAN_predicted_peptide_1|85_aa MSGSAAAAVAAAAASAAAAAAHFQPQSSSPRPGPAELGPPRRPRPPPPPPRPEPAATDRL SPPGHGPLRAGAQGGGACGRAPGAX >gi568815589f:36481682_36777334|GENSCAN_predicted_CDS_1|255_bp atgagcggaagcgcggccgccgcggtcgctgccgccgccgcctcagcagccgccgccgcc gcgcactttcagcctcagtcgtcgtctcccaggccgggccccgccgagcttgggcccccg cgccgcccgcgccccccaccgccgccgccccggccggagcccgccgccacggaccgcctc agccctccgggccacgggcctctccgggcgggagcccagggtggcggcgcctgcggccga gcgccgggagcagnn >gi568815589f:36481682_36777334|GENSCAN_predicted_peptide_2|104_aa MWIEPQQHLEGENLPEARASTGKTEPRDEKEKVGSGDIALDSVKPYLESAQPLEFSHTCL PSWGRNSGAPAVVRVLLSACAHSHLSQSTETFCSEGSMMINDEL >gi568815589f:36481682_36777334|GENSCAN_predicted_CDS_2|315_bp atgtggattgagccgcagcagcatcttgagggggagaatctgcctgaggctcgagccagc acaggaaaaactgaacccagagatgaaaaagagaaagtgggttctggtgacatagctcta gactcagtcaagccttatttggagtcagcccaacccctggagttttcacatacgtgtttg ccctcgtggggaagaaactctggagcaccagctgtggtcagagtgctcctcagtgcctgt gcccattctcatctcagccaaagtacagagaccttctgtagtgaaggatcaatgatgatc aatgatgagttgtag >gi568815589f:36481682_36777334|GENSCAN_predicted_peptide_3|178_aa MDPTKKKSLIYLKKNSEGRERKKVGFCGDLAYPRHLENHHVPSGARPRPQVTPRSPVDLP AIPFTAPQRPRAVAPVSSPACPPQSQLMAGVWAVRGAAAQTQFAPGSRETGGFHRSPALY QPCSLDKIFDLSDPRFSSPVCPFCVTTVDQLDGQINPNECLIVGAPRSLCKGAGQDAN >gi568815589f:36481682_36777334|GENSCAN_predicted_CDS_3|537_bp atggatccaaccaagaagaaatccttgatttacctgaaaaagaattcagaaggaagagaa cgtaagaaagttgggttctgtggagacttggcatatccacgccacctggaaaatcatcac gtgccctccggggcaagaccaaggcctcaagtcacccctcgcagcccagtggacctcccg gccatccctttcacggctccacagcgaccgcgagccgttgccccggtcagttctcccgcc tgcccgccgcagtcgcagttgatggctggggtctgggctgtgcggggcgcagcggcccaa acccagtttgctcctggctctcgggagactggaggatttcatcggagccccgcgctttac cagccctgttccctggataagatatttgacctttccgacccgcggttttcttctccggtc tgtcctttctgtgtaactactgtggaccaactagatgggcagatcaatcccaatgagtgc ctcattgttggtgctcccagaagcctttgcaagggagcagggcaggatgctaattaa >gi568815589f:36481682_36777334|GENSCAN_predicted_peptide_4|714_aa MKDYDELLKYYELHETIGTGGFAKVKLACHILTGEMVAIKIMDKNTLGSDLPRIKTEIEA LKNLRHQHICQLYHVLETANKIFMVLEDRLSEEETRVVFRQIVSAVAYVHSQGYAHRDLK PENLLFDEYHKLKLIDFGLCAKPKADVWSMGILLYVLMCGFLPFDDDNVMALYKKIMFIH LDDDCVTELSVHHRNNRQTMEDLISLWQYDHLTATYLLLLAKKARGKPVRLRLSSFSCGQ ASATPFTDIKSNNWSLEDVTASDKNYVAGLIDYDWCEDDLSTGAATPRTSQFTKYWTESN GVESKSLTPALCRTPANKLKNKENVYTPKSAVKNEEYFMFPEPKTPVNKNQHKREILTTP NRYTTPSKARNQCLKETPIKIPVNSTGTDKLMTGVISPERRCRSVELDLNQAHMEETPKR KGAKVFGSLERGLDKVITVLTRSKRKGSARDGPRRLKLHYNVTTTRLVNPDQLLNEIMSI LPKKHVDFVQKGYTLKCQTQSDFGKVTMQFELEVCQLQKPDVVGIRRQRLKGDAWVYKRL VEDILSSCKLLVTTTPPSVSMTLTTLDASSEKVGIQRKVPAGELIMSSLSECDKPAAEPL RKAGDGSPGAAAFHNGELAEGPGSLPAWQLKSGAPGLPATAPMGVFPHGTPLVGGFCLEL WGRAHLSEKGTKLPNASFVPDSHREIPRKFAGLPASQLPTVKAGGYKSMRRKKL >gi568815589f:36481682_36777334|GENSCAN_predicted_CDS_4|2145_bp atgaaagattatgatgaacttctcaaatattatgaattacatgaaactattgggacaggt ggctttgcaaaggtcaaacttgcctgccatatccttactggagagatggtagctataaaa atcatggataaaaacacactagggagtgatttgccccggatcaaaacggagattgaggcc ttgaagaacctgagacatcagcatatatgtcaactctaccatgtgctagagacagccaac aaaatattcatggttcttgaggatcgcctgtcagaagaggagacccgggttgtcttccgt cagatagtatctgctgttgcttatgtgcacagccagggctatgctcacagggacctcaag ccagaaaatttgctgtttgatgaatatcataaattaaagctgattgactttggtctctgt gcaaaacccaaggcagatgtttggagcatgggcatactgttatatgttcttatgtgtgga tttctaccatttgatgatgataatgtaatggctttatacaagaagattatgtttattcac ctcgatgatgattgcgtaacagaactttctgtacatcacagaaacaacaggcaaacaatg gaggatttaatttcactgtggcagtatgatcacctcacggctacctatcttctgcttcta gccaagaaggctcggggaaaaccagttcgtttaaggctttcttctttctcctgtggacaa gccagtgctaccccattcacagacatcaagtcaaataattggagtctggaagatgtgacc gcaagtgataaaaattatgtggcgggattaatagactatgattggtgtgaagatgattta tcaacaggtgctgctactccccgaacatcacagtttaccaagtactggacagaatcaaat ggggtggaatctaaatcattaactccagccttatgcagaacacctgcaaataaattaaag aacaaagaaaatgtatatactcctaagtctgctgtaaagaatgaagagtactttatgttt cctgagccaaagactccagttaataagaaccagcataagagagaaatactcactacgcca aatcgttacactacaccctcaaaagctagaaaccagtgcctgaaagaaactccaattaaa ataccagtaaattcaacaggaacagacaagttaatgacaggtgtcattagccctgagagg cggtgccgctcagtggaattggatctcaaccaagcacatatggaggagactccaaaaaga aagggagccaaagtgtttgggagccttgaaagggggttggataaggttatcactgtgctc accaggagcaaaaggaagggttctgccagagacgggcccagaagactaaagcttcactat aacgtgactacaactagattagtgaatccagatcaactgttgaatgaaataatgtctatt cttccaaagaagcatgttgactttgtacaaaagggttatacactgaagtgtcaaacacag tcagattttgggaaagtgacaatgcaatttgaattagaagtgtgccagcttcaaaaaccc gatgtggtgggtatcaggaggcagcggcttaagggcgatgcctgggtttacaaaagatta gtggaagacatcctatctagctgcaagctcctggtaaccaccaccccaccttctgtctct atgactttgacaactctagatgcctcatctgaaaaagtgggtattcagcgcaaagtgcca gctggggaattgattatgtcttctctcagcgagtgtgataaacctgctgccgagccactg cggaaggccggagacggctccccgggggcggcggcatttcataacggagaattggcagag gggccaggttcactccctgcgtggcagctgaagtctggggcaccaggtctgcccgccacg gcgcccatgggagtcttccctcacggcacccccctggttggtggcttctgcttagagctg tggggacgtgcgcatctgtcagagaaggggacgaagttgccaaatgccagctttgttcct gactcccacagggagattccccggaaatttgctggactcccagcaagtcagttgcccact gtgaaagcaggaggctacaagagtatgagaagaaagaagctatag >gi568815589f:36481682_36777334|GENSCAN_predicted_peptide_5|266_aa MVLQAGVADGWVGYATAMVLSGEAGMRSGPKISSLSATKWLIQLGSVGSGAAPAVTFTRR GKGMWIVLYHRVKKTQKYISQMPWQSLRGTTTRYPWAPVKLRQPSGPQRTLSASCLPARA PYAQEVSLLAEEQQPPPGKFLSPYDEIAPNKYSRITQGDEPKVSSSWFVPSKQYFLSEGL VPEEERRGEDAKSPQISVEPPGHRAPGPTPSAAKEAVTGEYGETITYRCVLLWAGRYRQL STCTLSFGPQQTLEELAAVIPRRGGQ >gi568815589f:36481682_36777334|GENSCAN_predicted_CDS_5|801_bp atggtcctgcaggcgggagtagcggatggttgggtgggctacgctacagccatggtactc agtggagaggctggaatgcggtcaggacccaagatcagctccttgtcagccaccaaatgg ttaatacaactgggttcagtgggctccggggcagcccctgcagtgaccttcacaaggaga ggcaaaggaatgtggattgtcctgtaccatagagtcaagaagacccaaaagtacatttct cagatgccctggcagagtctccggggaaccacaacacgttacccatgggcacctgtgaag ctgaggcaaccttctggaccccagcgcacgctctcggccagctgtcttcctgctcgagcc ccttatgcacaggaagtctcactcctcgccgaagagcagcagccgccaccagggaagttt ctatccccatacgatgagatagctccaaacaagtacagcaggattacccaaggtgatgaa ccgaaagtgtctagctcatggtttgtgcctagcaaacaatacttcctctctgagggtctg gttcctgaagaagagaggaggggagaagatgctaaaagcccccagataagtgtggagccg ccaggacacagagctcctgggcctacacctagtgctgccaaggaagcagtcactggagaa tacggtgaaacaatcacctaccgctgcgtgctgctgtgggctggacgttaccgccagctc tcaacatgcaccctctcatttggtcctcagcagaccctggaggaacttgctgctgtcatc cccagaaggggaggccagtga