GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:02:55 Sequence gi568815594r:121038622_121264125 : 225504 bp : 37.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 1433 1309 125 1 2 78 115 119 0.714 12.91 1.01 Init - 7216 7029 188 1 2 73 51 185 0.930 10.08 1.00 Prom - 15457 15418 40 -6.15 2.00 Prom + 15639 15678 40 -7.95 2.01 Init + 16715 16763 49 1 1 67 119 29 0.887 5.08 2.02 Intr + 17462 17715 254 0 2 57 61 150 0.656 5.43 2.03 Intr + 24116 24243 128 1 2 83 40 142 0.420 7.46 2.04 Intr + 29291 29426 136 2 1 37 46 123 0.284 2.65 2.05 Intr + 32193 32536 344 2 2 40 69 163 0.093 2.90 2.06 Intr + 32795 32959 165 2 0 51 106 85 0.780 4.75 2.07 Intr + 34162 34378 217 1 1 121 7 147 0.177 7.48 2.08 Intr + 46089 46209 121 2 1 36 60 136 0.236 4.75 2.09 Intr + 70819 70962 144 2 0 -31 109 120 0.088 1.53 2.10 Intr + 86876 87097 222 0 0 11 11 226 0.607 4.38 2.11 Term + 87974 88197 224 0 2 100 33 175 0.804 9.30 2.12 PlyA + 88303 88308 6 1.05 3.15 PlyA - 88560 88555 6 1.05 3.14 Term - 96429 96161 269 1 2 57 44 118 0.196 -1.03 3.13 Intr - 100063 100003 61 1 1 70 81 62 0.622 1.09 3.12 Intr - 103293 103195 99 0 0 105 100 67 0.955 8.99 3.11 Intr - 104155 104105 51 1 0 100 91 27 0.839 2.39 3.10 Intr - 108553 108428 126 1 0 3 87 190 0.998 10.36 3.09 Intr - 111598 111482 117 1 0 57 94 103 0.988 7.54 3.08 Intr - 114318 114289 30 0 0 118 97 -1 0.698 1.21 3.07 Intr - 116058 115930 129 0 0 70 64 167 0.994 12.47 3.06 Intr - 118622 118473 150 2 0 36 64 170 0.347 8.84 3.05 Intr - 125525 125439 87 2 0 73 94 48 0.156 3.15 3.04 Intr - 139935 139816 120 0 0 45 57 111 0.675 3.47 3.03 Intr - 143892 143803 90 0 0 27 100 76 0.677 1.97 3.02 Intr - 144175 144055 121 0 1 93 116 93 0.236 12.18 3.01 Init - 152085 151985 101 0 2 65 89 52 0.174 2.78 3.00 Prom - 158225 158186 40 -3.95 4.00 Prom + 176204 176243 40 -3.15 4.01 Init + 192995 193050 56 1 2 86 67 60 0.005 4.51 4.02 Term + 203663 203816 154 2 1 8 48 234 0.886 7.81 4.03 PlyA + 204710 204715 6 1.05 5.00 Prom + 209457 209496 40 -4.95 5.01 Init + 214979 215005 27 1 0 74 81 28 0.493 0.42 5.02 Intr + 215450 215587 138 1 0 52 95 47 0.529 1.54 5.03 Term + 216462 216689 228 2 0 26 53 167 0.420 2.65 5.04 PlyA + 221178 221183 6 1.05 6.02 PlyA - 221250 221245 6 1.05 6.01 Term - 224480 224353 128 0 2 102 47 80 0.838 2.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 72092 72180 89 0 2 73 47 104 0.924 1.64 S.002 Intr + 133431 133521 91 0 1 100 82 28 0.811 2.48 S.003 Term + 137890 137952 63 0 0 -5 42 223 0.914 5.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:121038622_121264125|GENSCAN_predicted_peptide_1|105_aa MVLLHWCLLWLLFPLSSRTQKLPTRDEELFQMQIRDKAFFHDSSVIPDGAEISSYLFRDT PKRYFFVVEEDNTPLSVTVTPCDAPLEWKLSLQELPEDRSGEGSX >gi568815594r:121038622_121264125|GENSCAN_predicted_CDS_1|315_bp atggtgctgctccactggtgcctgctgtggctcctgtttccactcagctcaaggacccag aagttacccacccgggatgaggaactttttcagatgcagatccgggacaaggcatttttt catgattcgtcagtaattccagatggagctgaaattagcagttatctctttagagataca cctaagaggtatttctttgtggttgaagaagacaatactccattatcagtcacagtgacg ccctgtgatgcgcctttggagtggaagctgagcctccaggagctgccagaggacaggagc ggggaaggctcagnn >gi568815594r:121038622_121264125|GENSCAN_predicted_peptide_2|667_aa MEKLGICFAVGLPECADRLNKTCEPSYKLTPAKYHITYATGHLFGKKHHPWRNSKGNGII RNVANYTGTSREVADGELSEEKTRRELTAVTETVEKTRLERGSTKEKAAAGECRGQQRSV IITQDRACQEKKRSRKCLVVVIALKEAAKDTPAFGTGLGTGTANARCHLRRGKKYVKYTQ PSKQTYTEMGATRTQEQEMDSAPKGARRGTATWPEGQLRDSVEVPAANSEELMRGGSGEL PGALTPGPGIGCGQKGEPRAHNATKPHSKFSSLQSASKKTAGTCNHSFGLLGYRGRGDSA QLESSQTPTKNSCAAWTSSFSPYTQTPSAKPQSRRLSLAAGGGHEPGKGGNSPLGRRANP PSLSFSHTHTTLLSDPALPKQDPRGRHREGHAAVLTLAPLGENEGGPPSPRQSCKTQETR TRDHHTVKQTGWQYFVEAAVDSSSQFSQALELASSQSLRDVGANLPVPCPREIALARHPS FFECRTPLFSKQVSTHEAIMDSEFRWNKLKLMMAFSIFIAKAAVRGLVLLEHCKPLLERV ARLQDRAPSWHVDHSCGPGEGGAGVDHAQGEGSGEEQGQFKRCRSQNGHLLAEAPSAALP AEAAVSEELHSCQLRVWLWWMDVGHYPMAKRLSYKCGNGVSVSSKHTHTQQRADSGTAVK RWSLPEN >gi568815594r:121038622_121264125|GENSCAN_predicted_CDS_2|2004_bp atggaaaaactggggatatgctttgcagttggattgccagaatgtgctgataggctaaat aaaacctgtgagccttcctacaaactcacacctgccaaataccatataacttatgctact gggcacttatttggaaagaagcaccatccctggaggaattctaaaggcaatggaataatc cggaatgtagcgaactacacaggcaccagcagggaagtggcagatggtgagctcagtgaa gaaaagaccaggagggaattgactgcagtcactgagacagttgagaaaaccaggctagag agaggaagcactaaggaaaaggctgcagctggtgagtgtcgcggtcaacaacgttcagtg atcattacacaagatagagcctgccaagagaaaaagagatctcgaaaatgcctggtggtt gtcattgctttgaaggaagcagccaaggatacacctgcttttggcactggcttaggcact ggtacagcaaatgccaggtgtcatctcagaagaggcaagaaatacgtgaaatacacacag ccaagcaaacagacttacactgaaatgggagccacccgtacccaagagcaagaaatggac agcgccccgaagggggccaggaggggaaccgcaacttggccggagggacagctcagagac agcgtagaggtgcccgcggcaaactccgaggagctgatgcggggcgggtccggggaactg ccgggagctctcacgcctgggcctgggatagggtgcgggcagaaaggtgaaccgagagcg cacaatgcaacgaaaccgcactccaaattcagcagtttgcaaagtgcctccaagaaaaca gctgggacctgcaatcactcgttcggactcctaggttatcggggccgcggcgactccgct cagttagaaagctcccagacgcccaccaagaactcgtgcgctgcctggacctctagcttt tcgccctacacgcagacgccctctgcaaaaccccagtcccgcagactttccctagcggcg ggcggagggcatgaacctggaaagggaggcaacagccctcttggcaggcgggcaaaccct ccctccctctctttctctcacacacacaccaccctgctcagcgacccagcgctccccaaa caggaccctcgcgggcggcatcgcgagggacacgctgctgtcctaaccttagcgccactc ggggagaatgaaggaggcccaccgagcccccgacaaagctgcaaaacccaggagacgcgc acgcgggaccatcacactgtcaagcagaccgggtggcagtattttgtagaagcggctgtt gactccagttcacagttttcccaagctttagaactagcttcatcacagtccctcagagac gtcggtgccaatttaccagtgccctgtcctcgagaaatagcccttgcacgccatccttcc ttctttgagtgccgaacacctctcttcagcaaacaggtttcaactcatgaagcaataatg gattcggaattccggtggaataagctcaaactgatgatggcattttcaatatttatcgca aaagccgcagttcggggcctagtgcttctggaacactgtaagccactcctggagagagtc gcaaggttgcaagacagggcacccagctggcatgtggaccacagttgtgggccaggtgaa ggtggtgcaggtgtggaccatgcccagggtgaaggctctggagaggagcaggggcagttc aagagatgtagaagccagaatgggcaccttctggcagaggcgccatccgcagccctgcct gcagaggcagctgtgagtgaggagctgcattcttgccagctccgtgtgtggctgtggtgg atggatgtgggtcattatcctatggccaagaggctttcctacaaatgtggaaatggagtg tcagtttcatcgaagcatactcatacccagcaaagagcagactcaggaacagcagttaaa agatggagtcttccagaaaattaa >gi568815594r:121038622_121264125|GENSCAN_predicted_peptide_3|516_aa MALIPGNLERTSELQYTSRSLLTDIDILWLIADRCESMELDKKIQDLIERNASPHPKRFT PEAMPTHRNLCSLKPLPVTKEILQQKNLLALSTLKCPGSCDYKKFAVPLLGAPLGTTHCA KCAQCGICLRELQMQPNTIALATETPGKTASMAHFVQGTSRMIAAESSTEHKEVAELKTK LDAAERFLSTREKDPHQRQRKDDRQREDDRQRDLTRDRLQREEKEKERLNEELHELKEEN KLLKGKNTLANKEKEHYECEIKRLNKLLILVVSSMKALQDALNIKCSFSEDCLRKSRVEF CHEEMRTEMEVLKQQVQIYEEDFKKERSDRERLNQEKEELQQINETSQSQLNRLNSQIKA CQMEKEKLEKQLKQMYCPPCNCGLVFHLQDPWVPTGPGAVQKQREHPPDYQWYALDQLPP DVQHKANGLLSADLKTAQLGDPTLVHTPLTTLHTTLFRQPSGWEEDHMNLGSRPRITWAG TSGMCIWVPGGACSGGNVAHTVRKGVAGGGLEQSPL >gi568815594r:121038622_121264125|GENSCAN_predicted_CDS_3|1551_bp atggctttaattccagggaacttggaaaggacttctgaactccagtatacctccagatcc ctgctcactgacattgacattctatggcttatagcagacagatgtgaaagcatggagctg gacaaaaaaatccaggatctgattgagagaaatgcgtctcctcatccaaaacggttcacc ccagaggccatgccaactcacagaaatctatgttctctgaagccacttcctgttactaaa gaaattcttcagcagaagaatttgctggcactttccactttaaagtgtccaggcagctgt gactacaagaagtttgctgtaccactcttaggagcacctttgggtaccactcactgtgcc aagtgtgctcaatgtggaatatgtcttcgagaactccaaatgcaaccaaacactattgct cttgcgactgagactccaggaaaaacagcttccatggcacattttgtacagggcacatct agaatgattgccgcagaaagttctacggagcataaagaggtagcagagctgaagacgaaa ctggacgccgcggaaagattcctcagcacgcgggagaaggatccgcatcagaggcagaga aaggacgacaggcagagagaggacgacaggcagcgcgacctgacccgggaccggctgcag cgggaggagaaggaaaaggaacgcctaaatgaagaattacatgaattgaaagaagagaat aaacttttaaagggaaaaaatactcttgcgaacaaggaaaaggaacattacgaatgtgaa ataaaacgcctcaataagctactaatcttggttgttagttccatgaaggctcttcaggat gccttgaatatcaagtgttcattttccgaggactgtttgaggaagtctcgagtggaattc tgccatgaggagatgagaacagaaatggaagttctgaagcagcaggtgcaaatatacgaa gaagacttcaaaaaggaacgatcggatcgagagagacttaatcaagagaaagaggagcta cagcaaattaatgaaacttcccaatcccagttgaacaggctgaattcccagataaaagct tgtcagatggagaaagaaaaactagaaaagcaattaaaacagatgtattgcccaccctgt aactgcggcttggttttccacctgcaagatccatgggtaccaacaggccctggagctgtg cagaagcaacgggagcacccaccagactatcagtggtatgctcttgaccagcttccgcca gatgtacaacacaaggcaaatggactcttgagtgcagacttgaagacagcccagctggga gatcccacccttgtccatacacccctcaccactctacacacaacactctttcgacagcct tcaggctgggaagaggaccacatgaatcttggaagcaggcccagaatcacctgggcaggg acttcggggatgtgcatttgggtgccaggaggagcttgttctggaggaaacgtcgctcac actgtgaggaagggtgtggctggaggagggctagagcagagtcctctataa >gi568815594r:121038622_121264125|GENSCAN_predicted_peptide_4|69_aa MDVDKSEMQLFGTKVQLSLNFEEQPPSLKPSREHQQNHTGGLVSTEGHKLVKVNSAVAAE QLSGKSAGI >gi568815594r:121038622_121264125|GENSCAN_predicted_CDS_4|210_bp atggatgtagataaatcagaaatgcagctgtttgggaccaaagtccagttatcactgaat tttgaagagcagccaccctccctgaagccatctagggaacaccagcagaatcacactgga ggactggtgtccacagaaggccacaagcttgtgaaggtcaactcagctgtggcagctgag caactgtcgggcaagtctgctggaatctga >gi568815594r:121038622_121264125|GENSCAN_predicted_peptide_5|130_aa MKMQVTRFLVIKDSVASCLHSPLPSVTSLTLGKAGCHLVSSSPCSEKLSAPAHIPCCDYR YEPPCLTCVGYFYGQLMLHPAGPSEEPYVMNVRIVPLEDPRGKHFQQPHPSFVKYALSVL TLQYFQVTRG >gi568815594r:121038622_121264125|GENSCAN_predicted_CDS_5|393_bp atgaagatgcaagtaactcggttcctggtcattaaagatagtgtggcttcctgcttgcac tctcctttgccctctgttacatcgctcactctggggaaagctggctgccacctcgtgagc agcagtccatgtagtgagaaactgagtgctcctgctcacatcccttgttgcgattatagg tatgaaccaccatgcctgacatgtgttggttacttctatggtcaactgatgctccatcca gcaggaccttctgaggagccttatgtaatgaatgtcagaattgtccctctagaagatcca agggggaagcatttccagcagccccatccttcatttgtcaagtatgctctatcagtgtta actcttcagtacttccaggttacacgtgggtga >gi568815594r:121038622_121264125|GENSCAN_predicted_peptide_6|42_aa XQKVLRVSVPAPWYSPISCDHMKESKQSVESCLPGHPPRTRA >gi568815594r:121038622_121264125|GENSCAN_predicted_CDS_6|129_bp nagcaaaaggtacttcgtgtgagtgtgccagcaccatggtatagcccaatttcttgtgat cacatgaaagaaagcaagcagtcagtagagagctgcttaccaggccatcctcctagaacc agggcctga