GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:49:05 Sequence gi568815596f:188383429_188694008 : 310580 bp : 35.03% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 535 530 6 1.05 1.01 Sngl - 2517 2110 408 0 0 43 32 223 0.716 8.24 1.00 Prom - 11157 11118 40 -5.65 2.04 PlyA - 11544 11539 6 1.05 2.03 Term - 13715 13341 375 2 0 1 54 245 0.137 6.15 2.02 Intr - 29114 28920 195 2 0 72 116 105 0.571 10.39 2.01 Init - 52247 52179 69 2 0 36 115 33 0.069 1.90 2.00 Prom - 62427 62388 40 -4.95 3.00 Prom + 63744 63783 40 -6.85 3.01 Sngl + 72296 73042 747 1 0 49 48 268 0.684 14.83 3.02 PlyA + 74185 74190 6 1.05 4.03 PlyA - 74430 74425 6 1.05 4.02 Term - 80658 80546 113 1 2 44 38 113 0.070 -0.36 4.01 Init - 110607 110523 85 0 1 72 113 50 0.801 6.93 4.00 Prom - 122918 122879 40 -3.45 5.03 PlyA - 123866 123861 6 1.05 5.02 Term - 134239 134166 74 2 2 51 47 125 0.610 1.79 5.01 Init - 137279 137264 16 2 1 76 100 12 0.568 1.66 5.00 Prom - 141175 141136 40 -3.95 6.00 Prom + 147519 147558 40 -3.55 6.01 Init + 148977 149027 51 2 0 77 75 80 0.144 6.81 6.02 Intr + 157753 157890 138 0 0 93 75 19 0.008 0.84 6.03 Intr + 185811 185927 117 2 0 54 70 59 0.200 0.44 6.04 Intr + 186600 186692 93 2 0 53 81 84 0.318 3.44 6.05 Intr + 200837 200975 139 1 1 92 57 120 0.787 8.42 6.06 Intr + 204427 204521 95 2 2 105 62 134 0.719 11.26 6.07 Term + 216718 216729 12 0 0 98 43 1 0.026 -6.17 6.08 PlyA + 217105 217110 6 1.05 7.00 Prom + 219787 219826 40 -5.95 7.01 Init + 222226 222328 103 0 1 60 68 116 0.685 5.25 7.02 Intr + 222881 223133 253 0 1 58 97 189 0.202 12.37 7.03 Term + 239199 239334 136 0 1 46 47 91 0.031 -2.69 7.04 PlyA + 239448 239453 6 1.05 8.05 PlyA - 239777 239772 6 1.05 8.04 Term - 243923 243837 87 2 0 116 42 62 0.240 1.08 8.03 Intr - 268938 268886 53 1 2 98 66 75 0.203 4.01 8.02 Intr - 270095 269902 194 1 2 45 67 119 0.623 3.71 8.01 Init - 280972 280845 128 1 2 55 97 93 0.635 6.58 8.00 Prom - 305120 305081 40 -0.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 154623 154318 306 0 0 44 41 214 0.943 7.92 S.002 Init + 286850 286974 125 1 2 71 86 118 0.825 9.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:188383429_188694008|GENSCAN_predicted_peptide_1|135_aa MWESLELPKDLLNGFDQNADNDMDNDIQAEVVSDRDEELVGNWSQGDSCYVLGKRLAAFC SCPRDLWNFELERGDFGYLEEGMSKQQSIQDVTWVLLKTFSFIREAEHTSSENLQSDNAI EKKIPFSEEKFKLAA >gi568815596f:188383429_188694008|GENSCAN_predicted_CDS_1|408_bp atgtgggaaagtttggaactccctaaagacttgttgaatggctttgaccaaaatgctgat aatgatatggacaatgacatccaggctgaggtggtctcagacagagatgaggaacttgtt gggaactggagccaaggtgactcttgttatgttttaggaaagagactggcggcattttgc tcctgccctagagatttgtggaactttgaacttgagagaggtgatttcgggtatctggag gaaggaatgtctaagcagcaaagcattcaagatgtgacttgggtgctgttaaagacattc agttttataagggaagcagagcatacaagttcagaaaatttgcagtctgacaatgcaata gaaaagaaaatcccgttttctgaggagaaattcaagctagctgcataa >gi568815596f:188383429_188694008|GENSCAN_predicted_peptide_2|212_aa MRIQPRKLLYFSLVIPYAKDTGMVLWEGNRGSSRGHNNDSIKLEVKIATSYFGLFLPLSQ QAKKRVTVLAGVTEADVKMKSHYYFTMKKQGRNLTRKLKEITDALKEVVGYSLHLLSQAE NQVLRMWEWDKLPSGYTLLLQNLAIQATGEGLTLTSAETNLGSRGQYKSGSSSRKRRVHS QSPARMEGNHSSSYFTGNFVEACQLTQVTVAG >gi568815596f:188383429_188694008|GENSCAN_predicted_CDS_2|639_bp atgagaatacagcccagaaaactcctttatttcagcctggtgataccctacgccaaggac acaggtatggtgttatgggagggaaaccgtgggagttccagaggacacaacaatgattcc attaaactggaagttaagattgccaccagctactttgggctcttcctacctctgagtcaa caggctaagaagagagttacagtgttagctggggtcactgaagcggatgtgaagatgaaa tcacactattacttcacaatgaagaagcaaggtagaaacttaacaagaaaactgaaagaa atcacagatgctttgaaagaggtggtaggctacagcctgcatcttctaagccaggcagaa aaccaagtcctacgaatgtgggagtgggataaactgccttcagggtatacactcctactg cagaacctggcaatccaggctacaggggaaggccttaccctaaccagtgctgagactaat ttagggagccgtgggcaatataaaagtgggagcagcagcaggaagagacgtgtgcattcc cagtctccagcacggatggagggaaaccattcctcatcctacttcacaggaaatttcgtg gaagcctgccagctaactcaggtgactgtggcaggttga >gi568815596f:188383429_188694008|GENSCAN_predicted_peptide_3|248_aa MWKRIWNWVTGRGWNSLEGSEEDRKMWESLELPRHLLNGFDQNADSHMDNKVQAELVSGE EEELVRNWNKGDSYYVLAKRLVAFCSCPRDLWNFELERDDLGYLVEEISKQQSIQEVTWV LLKAFSFKWEIELKRLENLQPDKAIEKKIPFSEEKFKLAAVICISNLHKNVNHQDNGENV SRACQRPLQQPLPSRAGRPRKKNRFSGLGLRSPCCVQPRNLVPSIPAAPPSLKWTNVELG LWLQRVQT >gi568815596f:188383429_188694008|GENSCAN_predicted_CDS_3|747_bp atgtggaagcgaatttggaactgggtaacaggcagaggttggaacagtttggagggctca gaagaagacaggaaaatgtgggaaagtttggaacttcctagacacttgttgaatggcttt gaccaaaatgctgatagccatatggacaataaagtccaggctgagttggtctcaggtgaa gaggaggaacttgttaggaactggaacaaaggtgactcttattatgttttagcaaagaga ctggtggcattttgctcctgccctagagatttgtggaactttgaacttgagagagatgat ttagggtatctggtggaagaaatttctaaacagcaaagcattcaagaggtaacttgggtg ctgttaaaggcattcagtttcaaatgggaaatagagcttaaacgtttggaaaatttgcag cctgacaaggcaatagaaaagaaaatcccattttctgaggagaaattcaagctagctgca gtaatttgcataagtaatttgcataagaatgttaatcaccaagacaatggggaaaatgtc tccagggcatgtcagagacctttgcaacagcctctcccatcacgtgccggaaggcctagg aagaaaaatcggttttctgggctgggcctgaggtccccgtgctgtgtgcagcctaggaac ttggtgccttccatcccagccgctccgccatcgctgaaatggacgaatgtagagcttgga ctgtggcttcagagggtgcaaacttaa >gi568815596f:188383429_188694008|GENSCAN_predicted_peptide_4|65_aa MNTLSRDWLRRAHDPYQTYKGQQDSIQEDTLTSTSIKMIQEKMTSPNELNKTSGNNPGET EICDL >gi568815596f:188383429_188694008|GENSCAN_predicted_CDS_4|198_bp atgaacacattatccagagactggctaagaagggcacatgatccatatcaaacctacaag ggtcagcaagattctattcaggaagacacattaacatctacaagtatcaagatgatccag gaaaagatgacctcaccaaatgaactaaataagacatcagggaacaatcctggagaaaca gagatatgtgacctttga >gi568815596f:188383429_188694008|GENSCAN_predicted_peptide_5|29_aa MRIVEDGSPDQSADDKPCSSPQNRQLAFE >gi568815596f:188383429_188694008|GENSCAN_predicted_CDS_5|90_bp atgaggatagtggaagatgggtcacctgatcagtcagcagatgacaagccctgctcaagt ccacagaaccgccagttggcttttgagtga >gi568815596f:188383429_188694008|GENSCAN_predicted_peptide_6|214_aa MATTNYRYKMATTYVYREVQHNCQLHRISFCADDKTDKRIFTFICKDSESNKHLCYVFDS EKCAEEITLTIGQAFDLAYRKFLESGGKDVETRKQIAGLQKRIQDLETENMELKNKVQDL ENQLRITQVSAPPAGSMTPKSPSTDIFDMIPFSPISHQSSMPTRNGTQPPPVPSRSTEIK RDLFGAEPFDPFNCGAADFPPDIQSKLDEMQVHI >gi568815596f:188383429_188694008|GENSCAN_predicted_CDS_6|645_bp atggctaccactaactaccgctacaagatggctaccacctacgtttatagggaagttcaa cacaattgccagcttcatagaatatctttttgtgcagatgataaaactgacaagaggata ttcactttcatatgcaaagattctgagtcaaataaacatttgtgctatgtatttgacagc gaaaagtgtgctgaagagatcactttaacaattggccaagcatttgacctggcatacagg aaatttctagaatcaggaggaaaagatgttgaaacaagaaaacagatcgcagggttacaa aaaagaatccaagacttagaaacagaaaatatggaacttaaaaataaagtacaagatttg gaaaaccaactgagaataactcaagtatcagcacctccagcaggcagtatgacacctaag tcgccctccactgacatctttgatatgattccattttctccaatatcacaccagtcttcg atgcctactcgcaatggcacacagccacctccagtacctagtagatctactgagattaaa cgggacctgtttggagcagaaccttttgacccatttaactgtggagcagcagatttccct ccagatattcaatcaaaattagatgagatgcaggtacacatttaa >gi568815596f:188383429_188694008|GENSCAN_predicted_peptide_7|163_aa MKPHTLAVSVTALTVAHLEFVPSDVRMCLEFLPSDSEAQLASPSGSRTRAAGGAACQSGT VPPHSSALGWSMGLGAVEQGVALVGEVRAAQEPMEGVGGSGMAGCRSRALPRGKAAKARG SVNGSPLDAKRDLWLTARKWGCQSYNHKVINSANDLNELEKGR >gi568815596f:188383429_188694008|GENSCAN_predicted_CDS_7|492_bp atgaagccgcataccctcgcggtgagtgttacagctcttacagtggcgcatctggagttt gttccttctgatgttcggatgtgtttggagtttcttccttctgactcagaagcccagctg gcttcacctagtggatcccgcaccagggctgcaggtggagctgcctgccagtccggcacc gtgcccccgcactcctcagcccttgggtggtcgatgggactgggcgcagtggagcagggg gtggcgctcgtcggggaggttcgggcggcacaggagcccatggagggggtgggaggctca ggcatggcgggctgcaggtcccgagccctgccccgcgggaaggcagctaaggcccgaggg tctgtaaatggaagtcctctagatgctaagagagacctctggctgacagcaagaaaatgg ggatgccagtcatacaatcacaaggtaataaattctgccaatgacctgaatgaacttgaa aaaggacgctga >gi568815596f:188383429_188694008|GENSCAN_predicted_peptide_8|153_aa MTAHYIGINEEVNRRDAMDGKYRHMHNGRVSQTSSCESLTGNSNTDAAGVHYPKQINAGT ENQIPHVLTCKGELNMSIHGHKRGRVDTRAYLRVGGGKKWGWKNYLSEGSKDLLKKGKCG GLDKEDYAVEGIEYQTETGGRGIKSFEQGDNLG >gi568815596f:188383429_188694008|GENSCAN_predicted_CDS_8|462_bp atgactgcgcattatattggcataaacgaagaggtcaatagaagggatgccatggacggt aaatacaggcacatgcataatgggagagtgtcccagacttcatcctgtgagtcactgacg ggtaacagcaacacagatgcagctggagttcattatcctaagcaaattaatgcaggaaca gaaaaccaaatacctcatgttctcacttgtaagggagagctaaacatgagtatacatgga cacaagaggggaagagtagacaccagggcctacttaagggttgggggtgggaagaagtgg ggatggaaaaactacctgtcagaaggaagtaaagaccttctgaaaaaagggaagtgtgga ggactggataaggaggattatgctgtagaaggcattgagtatcagacagagactggagga aggggtattaaaagctttgagcaaggggataacttaggttaa