GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:57:43 Sequence gi568815590f:127638244_127840955 : 202712 bp : 44.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 411 406 6 1.05 1.03 Term - 4960 4878 83 2 2 88 33 94 0.157 1.76 1.02 Intr - 14949 14641 309 1 0 42 21 245 0.144 9.48 1.01 Init - 19932 19887 46 0 1 48 78 40 0.786 -0.14 1.00 Prom - 20405 20366 40 -4.86 2.00 Prom + 21204 21243 40 -5.06 2.01 Init + 22710 22816 107 2 2 83 36 111 0.676 5.09 2.02 Intr + 32133 32209 77 0 2 50 93 26 0.006 -1.54 2.03 Intr + 50845 50905 61 2 1 108 101 5 0.349 1.59 2.04 Term + 53922 54291 370 0 1 49 38 174 0.434 2.62 2.05 PlyA + 54579 54584 6 1.05 3.00 Prom + 55549 55588 40 -3.36 3.01 Init + 57095 57140 46 1 1 79 72 81 0.869 4.81 3.02 Intr + 57865 57988 124 2 1 79 63 106 0.261 6.84 3.03 Intr + 97509 97593 85 0 1 63 66 77 0.021 2.82 3.04 Intr + 98152 98380 229 0 1 63 115 107 0.905 8.34 3.05 Intr + 99126 99306 181 1 1 7 34 147 0.617 0.13 3.06 Intr + 99611 99833 223 2 1 104 19 76 0.851 0.43 3.07 Intr + 100005 100776 772 2 1 66 107 1383 0.516 129.05 3.08 Term + 102153 102715 563 1 2 60 49 652 0.944 53.04 3.09 PlyA + 103010 103015 6 1.05 4.00 Prom + 105444 105483 40 -2.96 4.01 Init + 120079 120139 61 0 1 86 59 46 0.802 2.91 4.02 Term + 125754 125863 110 1 2 51 54 96 0.662 1.17 4.03 PlyA + 126727 126732 6 1.05 5.02 PlyA - 127640 127635 6 1.05 5.01 Sngl - 132503 131907 597 2 0 80 32 186 0.904 8.40 5.00 Prom - 141411 141372 40 -3.36 6.00 Prom + 142127 142166 40 -6.66 6.01 Init + 156322 156491 170 0 2 61 100 243 0.876 19.81 6.02 Term + 157560 157746 187 0 1 21 50 164 0.973 2.86 6.03 PlyA + 159390 159395 6 1.05 7.00 Prom + 159586 159625 40 -4.26 7.01 Init + 164605 164850 246 0 0 52 75 117 0.552 4.40 7.02 Intr + 171962 172096 135 1 0 132 69 -1 0.313 3.16 7.03 Intr + 178585 178641 57 0 0 78 111 12 0.126 1.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 78481 78400 82 0 1 123 35 32 0.842 -1.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:127638244_127840955|GENSCAN_predicted_peptide_1|145_aa MSFAATWMKLEALILSTRDLVNNEWFVVEATREAGTGDIDEEKRKALALKGVNYQPKEHG LYPVGVGEWWKGFIQMQNRIGLIFRKSIQQQCGGQMAVGEEGVRNSSYDTTERENKDDGQ WYGSTLADVMVDGNCDEQHYDASYI >gi568815590f:127638244_127840955|GENSCAN_predicted_CDS_1|438_bp atgtcttttgcagcaacttggatgaagctggaggcccttattctaagcactagggacctg gtgaacaatgagtggtttgtggtagaggctacaagggaggcagggacgggggacattgat gaggagaagagaaaggcgctggctttgaagggtgttaactaccagcctaaagaacacggg ctctatccggtaggtgttggggaatggtggaagggttttattcaaatgcagaacaggatt ggacttatatttaggaagagcatccagcagcagtgtggagggcagatggctgtgggcgag gaaggagtcaggaacagtagttatgacaccaccgaaagagagaataaagatgatggccaa tggtatggcagcacattggcagatgttatggtagatggaaactgtgatgaacaacactat gatgccagttacatttag >gi568815590f:127638244_127840955|GENSCAN_predicted_peptide_2|204_aa MDVNGSKKEEKINTLNQSDLLSQSWNPMRHEFKCRNSKSSALRETSQVPTTAHQALRDVA PDWAPVYRVSIIYCCITNELKIQHQQRKVLNGFPLPASFSALHLTVNVNTGSPGWHIKGP IDLLTQFAETAPCTTTLAAAVEKPFGHTSQKLLVHSLQLLLEPPVLPALDSETFLHHALR HPLQSSTGGLESCVCYGQRITKAF >gi568815590f:127638244_127840955|GENSCAN_predicted_CDS_2|615_bp atggatgtcaatggcagcaagaaagaagagaaaataaacacactcaaccagtcggatctg ctgagtcaatcatggaatccaatgagacacgaattcaaatgccgcaactcaaaatcttct gcacttagagaaacatcccaagtccctaccacagcccaccaggccctacgtgacgtggcc ccagactgggctcctgtttacagagtttccattatctactgctgcataacaaatgaactt aaaattcagcatcagcagagaaaagttctaaatggatttcctttgccggcatcattttct gccctacatctgaccgtcaacgtaaatacaggcagccctggttggcatataaaaggccca attgacctgttgacccagtttgcagagacggctccatgcacaaccactttagcagctgct gtagaaaaaccttttggccacacaagccaaaagctgctcgttcattcactgcaattactc cttgagcccccagtgctcccggcactggactcagagacattcctgcaccatgctttgaga cacccacttcagtctagcactggtggacttgagtcctgcgtctgttatggacaaagaata acaaaggctttttag >gi568815590f:127638244_127840955|GENSCAN_predicted_peptide_3|740_aa MVLKMGLSIMPALYPSIMVTNMETTFGHFLKELPAFGEYRHECRLSQHQMEGSEKEGTRM RFVKQYCYGGAAEKGRGFEREQKKMPAVRNPCRIHETLPIAAGGHFALELTTPEQGRDSP DAGRLFCPFGDTSPPLPGPASLKGSPCSCLDAGFFSGSGKPALTPLPRPPSGVLAPEMRR NCEERGSGRFQNSCYPWWGGSGGGIAAGSLAQLHLRIECEGRWRKTLCLGFWQIVFLTAT SRGFLRAPGPISIPLPLRGRLPGFALRAPGGAGARRAPSRWFTKCVSEIAGDCPKGQPPA TMPLNVSFTNRNYDLDYDSVQPYFYCDEEENFYQQQQQSELQPPAPSEDIWKKFELLPTP PLSPSRRSGLCSPSYVAVTPFSLRGDNDGGGGSFSTADQLEMVTELLGGDMVNQSFICDP DDETFIKNIIIQDCMWSGFSAAAKLVSEKLASYQAARKDSGSPNPARGHSVCSTSSLYLQ DLSAAASECIDPSVVFPYPLNDSSSPKSCASQDSSAFSPSSDSLLSSTESSPQGSPEPLV LHEETPPTTSSDSEEEQEDEEEIDVVSVEKRQAPGKRSESGSPSAGGHSKPPHSPLVLKR CHVSTHQHNYAAPPSTRKDYPAAKRVKLDSVRVLRQISNNRKCTSPRSSDTEENVKRRTH NVLERQRRNELKRSFFALRDQIPELENNEKAPKVVILKKATAYILSVQAEEQKLISEEDL LRKRREQLKHKLEQLRNSCA >gi568815590f:127638244_127840955|GENSCAN_predicted_CDS_3|2223_bp atggtgctgaagatggggctgtccatcatgcctgcactgtaccccagtattatggtcacg aatatggaaaccacatttggtcatttcctcaaggagctgccggcatttggagaatacaga catgaatgcagactatctcagcatcagatggaaggcagtgagaaggaagggacaaggatg cggtttgtcaaacagtactgctacggaggagcagcagagaaagggagagggtttgagagg gagcaaaagaaaatgccagcggtccgcaacccttgccgcatccacgaaactttgcccata gcagcgggcgggcactttgcactggaacttacaacacccgagcaaggacgcgactctccc gacgcggggaggctattctgcccatttggggacacttccccgccgctgccaggacccgct tctctgaaaggctctccttgcagctgcttagacgctggatttttttcgggtagtggaaaa ccagccctgactcccctgccgcggccgccctcgggtgtcctcgcgcccgagatgcggagg aactgcgaggagcggggctctgggcggttccagaacagctgctacccttggtggggtggc tccgggggaggtatcgcagcggggtctctggcgcagttgcatctccgtattgagtgcgaa gggaggtggcgcaaaactttgtgccttggattttggcaaattgttttcctcaccgccacc tcccgcggcttcttaagggcgccagggccgatttcgattcctctgccgctgcggggccga ctcccgggctttgcgctccgggctcccgggggagcgggggctcggcgggcaccaagccgc tggttcactaagtgcgtctccgagatagcaggggactgtccaaaggggcagcctcccgcg acgatgcccctcaacgttagcttcaccaacaggaactatgacctcgactacgactcggtg cagccgtatttctactgcgacgaggaggagaacttctaccagcagcagcagcagagcgag ctgcagcccccggcgcccagcgaggatatctggaagaaattcgagctgctgcccaccccg cccctgtcccctagccgccgctccgggctctgctcgccctcctacgttgcggtcacaccc ttctcccttcggggagacaacgacggcggtggcgggagcttctccacggccgaccagctg gagatggtgaccgagctgctgggaggagacatggtgaaccagagtttcatctgcgacccg gacgacgagaccttcatcaaaaacatcatcatccaggactgtatgtggagcggcttctcg gccgccgccaagctcgtctcagagaagctggcctcctaccaggctgcgcgcaaagacagc ggcagcccgaaccccgcccgcggccacagcgtctgctccacctccagcttgtacctgcag gatctgagcgccgccgcctcagagtgcatcgacccctcggtggtcttcccctaccctctc aacgacagcagctcgcccaagtcctgcgcctcgcaagactccagcgccttctctccgtcc tcggattctctgctctcctcgacggagtcctccccgcagggcagccccgagcccctggtg ctccatgaggagacaccgcccaccaccagcagcgactctgaggaggaacaagaagatgag gaagaaatcgatgttgtttctgtggaaaagaggcaggctcctggcaaaaggtcagagtct ggatcaccttctgctggaggccacagcaaacctcctcacagcccactggtcctcaagagg tgccacgtctccacacatcagcacaactacgcagcgcctccctccactcggaaggactat cctgctgccaagagggtcaagttggacagtgtcagagtcctgagacagatcagcaacaac cgaaaatgcaccagccccaggtcctcggacaccgaggagaatgtcaagaggcgaacacac aacgtcttggagcgccagaggaggaacgagctaaaacggagcttttttgccctgcgtgac cagatcccggagttggaaaacaatgaaaaggcccccaaggtagttatccttaaaaaagcc acagcatacatcctgtccgtccaagcagaggagcaaaagctcatttctgaagaggacttg ttgcggaaacgacgagaacagttgaaacacaaacttgaacagctacggaactcttgtgcg taa >gi568815590f:127638244_127840955|GENSCAN_predicted_peptide_4|56_aa MKNEDTVLTQMSFHEYSALQIIPGDASCHFDGCTLERPTEEEPMSPANSQQRPEKT >gi568815590f:127638244_127840955|GENSCAN_predicted_CDS_4|171_bp atgaaaaatgaggacacggttctgacacaaatgagtttccatgaatacagtgccttacaa atcatcccaggggacgccagctgccactttgatggctgcactttggagaggcccacagag gaggaaccaatgtctccagccaacagtcagcaaagacctgagaagacctga >gi568815590f:127638244_127840955|GENSCAN_predicted_peptide_5|198_aa MDLLPKVIYRFNAIPIKLPMTFFTELEKTTLKFTWNPKRAHIAKSILSQKKKAGGTMLPD FKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIIPHIYNHLIFDKPDKNKKWGKDSLFNK WCWENWLAICRKLKLNPFLTPYTKINSTWINDLNVRPNTIKTLEENLGNTIQDIGMGKDF MSKTQKAMATKAKIDNGI >gi568815590f:127638244_127840955|GENSCAN_predicted_CDS_5|597_bp atggacttactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcacatggaacccaaaaagagcc cacattgccaagtcaatcctaagccaaaagaaaaaagctggaggcaccatgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagaccaatggaacagaacagagccctcagaaataataccacacatctacaaccat ctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactgaatcccttccttaca ccttatacaaaaattaattcaacatggattaacgacttaaatgttagacctaataccata aaaaccctagaagaaaacttaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaacacaaaaagcgatggcaacaaaagccaaaattgacaatgggatctaa >gi568815590f:127638244_127840955|GENSCAN_predicted_peptide_6|118_aa MGPRAGRARGGRDEEGRRRAASKDVPRDPRHLPVDFLAERMLAVPVTCGDTARSALHPVV IANLSKALISQEGGCINPQNPEWNFLQDLLGEKPSRKLQKDKYRIRVWEKPRGLVSII >gi568815590f:127638244_127840955|GENSCAN_predicted_CDS_6|357_bp atgggcccgcgggccgggcgggctcggggcggccgggacgaggaggggcgacgacgagct gcgagcaaagatgtgccccgggacccccggcaccttccagtggatttccttgcggaaagg atgttggcggtccctgtgacctgtggagacacggccagatctgccctccatcctgtggtc atagcgaatctttctaaagctctgatcagtcaagaagggggttgtatcaatcctcagaac cctgagtggaactttctacaggatttattaggagaaaaaccttcccggaagctgcagaag gacaaatacagaatccgtgtctgggagaaacctcgtggcctggtctccattatttga >gi568815590f:127638244_127840955|GENSCAN_predicted_peptide_7|146_aa MGPQPTRPAVAQKLEETRPGGKEDTGVLGLSLGLGDSSGLAVQRFKALDCKSTKVAGGVG GGGKGRWGEGEGQDILESLNNQALQRPYLFLFMTLEIPYILSALGRHSLHADLVKKEKPR SILHLPPSFTEIPCCLQLPLPLCIML >gi568815590f:127638244_127840955|GENSCAN_predicted_CDS_7|438_bp atgggaccacagccaaccaggcctgctgtagcccagaagctggaagagactcgcccagga gggaaggaagacactggggtgttgggtctcagtctgggtcttggtgacagcagtgggctg gcagtgcagaggttcaaggccttggactgcaagtccactaaggtggctggtggtgtgggt ggaggtggaaaaggcaggtggggcgagggtgaggggcaggacatcctggagagcctgaat aaccaggccttgcaaaggccatatctcttccttttcatgactctggaaattccatacata ctttctgcattaggaagacattctctgcatgctgatctcgttaaaaaggaaaagccaagg agcatcctgcacctccctcctagctttacagaaattccatgctgccttcagcttcctttg ccgctctgcatcatgttg