GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:02:54 Sequence gi568815590f:127638263_127840955 : 202693 bp : 44.61% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 392 387 6 1.05 1.03 Term - 4941 4859 83 1 2 88 33 94 0.157 1.76 1.02 Intr - 14930 14622 309 0 0 42 21 245 0.144 9.48 1.01 Init - 19913 19868 46 2 1 48 78 40 0.786 -0.14 1.00 Prom - 20386 20347 40 -4.86 2.00 Prom + 21185 21224 40 -5.06 2.01 Init + 22691 22797 107 1 2 83 36 111 0.676 5.09 2.02 Intr + 32114 32190 77 2 2 50 93 26 0.006 -1.54 2.03 Intr + 50826 50886 61 1 1 108 101 5 0.349 1.59 2.04 Term + 53903 54272 370 2 1 49 38 174 0.434 2.62 2.05 PlyA + 54560 54565 6 1.05 3.00 Prom + 55530 55569 40 -3.36 3.01 Init + 57076 57121 46 0 1 79 72 81 0.869 4.81 3.02 Intr + 57846 57969 124 1 1 79 63 106 0.261 6.84 3.03 Intr + 97490 97574 85 2 1 63 66 77 0.021 2.82 3.04 Intr + 98133 98361 229 2 1 63 115 107 0.905 8.34 3.05 Intr + 99107 99287 181 0 1 7 34 147 0.617 0.13 3.06 Intr + 99592 99814 223 1 1 104 19 76 0.851 0.43 3.07 Intr + 99986 100757 772 1 1 66 107 1383 0.516 129.05 3.08 Term + 102134 102696 563 0 2 60 49 652 0.944 53.04 3.09 PlyA + 102991 102996 6 1.05 4.00 Prom + 105425 105464 40 -2.96 4.01 Init + 120060 120120 61 2 1 86 59 46 0.802 2.91 4.02 Term + 125735 125844 110 0 2 51 54 96 0.662 1.17 4.03 PlyA + 126708 126713 6 1.05 5.02 PlyA - 127621 127616 6 1.05 5.01 Sngl - 132484 131888 597 1 0 80 32 186 0.904 8.40 5.00 Prom - 141392 141353 40 -3.36 6.00 Prom + 142108 142147 40 -6.66 6.01 Init + 156303 156472 170 2 2 61 100 243 0.876 19.81 6.02 Term + 157541 157727 187 2 1 21 50 164 0.973 2.86 6.03 PlyA + 159371 159376 6 1.05 7.00 Prom + 159567 159606 40 -4.26 7.01 Init + 164586 164831 246 2 0 52 75 117 0.552 4.40 7.02 Intr + 171943 172077 135 0 0 132 69 -1 0.313 3.16 7.03 Intr + 178566 178622 57 2 0 78 111 12 0.126 1.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 78462 78381 82 2 1 123 35 32 0.842 -1.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:127638263_127840955|GENSCAN_predicted_peptide_1|145_aa MSFAATWMKLEALILSTRDLVNNEWFVVEATREAGTGDIDEEKRKALALKGVNYQPKEHG LYPVGVGEWWKGFIQMQNRIGLIFRKSIQQQCGGQMAVGEEGVRNSSYDTTERENKDDGQ WYGSTLADVMVDGNCDEQHYDASYI >gi568815590f:127638263_127840955|GENSCAN_predicted_CDS_1|438_bp atgtcttttgcagcaacttggatgaagctggaggcccttattctaagcactagggacctg gtgaacaatgagtggtttgtggtagaggctacaagggaggcagggacgggggacattgat gaggagaagagaaaggcgctggctttgaagggtgttaactaccagcctaaagaacacggg ctctatccggtaggtgttggggaatggtggaagggttttattcaaatgcagaacaggatt ggacttatatttaggaagagcatccagcagcagtgtggagggcagatggctgtgggcgag gaaggagtcaggaacagtagttatgacaccaccgaaagagagaataaagatgatggccaa tggtatggcagcacattggcagatgttatggtagatggaaactgtgatgaacaacactat gatgccagttacatttag >gi568815590f:127638263_127840955|GENSCAN_predicted_peptide_2|204_aa MDVNGSKKEEKINTLNQSDLLSQSWNPMRHEFKCRNSKSSALRETSQVPTTAHQALRDVA PDWAPVYRVSIIYCCITNELKIQHQQRKVLNGFPLPASFSALHLTVNVNTGSPGWHIKGP IDLLTQFAETAPCTTTLAAAVEKPFGHTSQKLLVHSLQLLLEPPVLPALDSETFLHHALR HPLQSSTGGLESCVCYGQRITKAF >gi568815590f:127638263_127840955|GENSCAN_predicted_CDS_2|615_bp atggatgtcaatggcagcaagaaagaagagaaaataaacacactcaaccagtcggatctg ctgagtcaatcatggaatccaatgagacacgaattcaaatgccgcaactcaaaatcttct gcacttagagaaacatcccaagtccctaccacagcccaccaggccctacgtgacgtggcc ccagactgggctcctgtttacagagtttccattatctactgctgcataacaaatgaactt aaaattcagcatcagcagagaaaagttctaaatggatttcctttgccggcatcattttct gccctacatctgaccgtcaacgtaaatacaggcagccctggttggcatataaaaggccca attgacctgttgacccagtttgcagagacggctccatgcacaaccactttagcagctgct gtagaaaaaccttttggccacacaagccaaaagctgctcgttcattcactgcaattactc cttgagcccccagtgctcccggcactggactcagagacattcctgcaccatgctttgaga cacccacttcagtctagcactggtggacttgagtcctgcgtctgttatggacaaagaata acaaaggctttttag >gi568815590f:127638263_127840955|GENSCAN_predicted_peptide_3|740_aa MVLKMGLSIMPALYPSIMVTNMETTFGHFLKELPAFGEYRHECRLSQHQMEGSEKEGTRM RFVKQYCYGGAAEKGRGFEREQKKMPAVRNPCRIHETLPIAAGGHFALELTTPEQGRDSP DAGRLFCPFGDTSPPLPGPASLKGSPCSCLDAGFFSGSGKPALTPLPRPPSGVLAPEMRR NCEERGSGRFQNSCYPWWGGSGGGIAAGSLAQLHLRIECEGRWRKTLCLGFWQIVFLTAT SRGFLRAPGPISIPLPLRGRLPGFALRAPGGAGARRAPSRWFTKCVSEIAGDCPKGQPPA TMPLNVSFTNRNYDLDYDSVQPYFYCDEEENFYQQQQQSELQPPAPSEDIWKKFELLPTP PLSPSRRSGLCSPSYVAVTPFSLRGDNDGGGGSFSTADQLEMVTELLGGDMVNQSFICDP DDETFIKNIIIQDCMWSGFSAAAKLVSEKLASYQAARKDSGSPNPARGHSVCSTSSLYLQ DLSAAASECIDPSVVFPYPLNDSSSPKSCASQDSSAFSPSSDSLLSSTESSPQGSPEPLV LHEETPPTTSSDSEEEQEDEEEIDVVSVEKRQAPGKRSESGSPSAGGHSKPPHSPLVLKR CHVSTHQHNYAAPPSTRKDYPAAKRVKLDSVRVLRQISNNRKCTSPRSSDTEENVKRRTH NVLERQRRNELKRSFFALRDQIPELENNEKAPKVVILKKATAYILSVQAEEQKLISEEDL LRKRREQLKHKLEQLRNSCA >gi568815590f:127638263_127840955|GENSCAN_predicted_CDS_3|2223_bp atggtgctgaagatggggctgtccatcatgcctgcactgtaccccagtattatggtcacg aatatggaaaccacatttggtcatttcctcaaggagctgccggcatttggagaatacaga catgaatgcagactatctcagcatcagatggaaggcagtgagaaggaagggacaaggatg cggtttgtcaaacagtactgctacggaggagcagcagagaaagggagagggtttgagagg gagcaaaagaaaatgccagcggtccgcaacccttgccgcatccacgaaactttgcccata gcagcgggcgggcactttgcactggaacttacaacacccgagcaaggacgcgactctccc gacgcggggaggctattctgcccatttggggacacttccccgccgctgccaggacccgct tctctgaaaggctctccttgcagctgcttagacgctggatttttttcgggtagtggaaaa ccagccctgactcccctgccgcggccgccctcgggtgtcctcgcgcccgagatgcggagg aactgcgaggagcggggctctgggcggttccagaacagctgctacccttggtggggtggc tccgggggaggtatcgcagcggggtctctggcgcagttgcatctccgtattgagtgcgaa gggaggtggcgcaaaactttgtgccttggattttggcaaattgttttcctcaccgccacc tcccgcggcttcttaagggcgccagggccgatttcgattcctctgccgctgcggggccga ctcccgggctttgcgctccgggctcccgggggagcgggggctcggcgggcaccaagccgc tggttcactaagtgcgtctccgagatagcaggggactgtccaaaggggcagcctcccgcg acgatgcccctcaacgttagcttcaccaacaggaactatgacctcgactacgactcggtg cagccgtatttctactgcgacgaggaggagaacttctaccagcagcagcagcagagcgag ctgcagcccccggcgcccagcgaggatatctggaagaaattcgagctgctgcccaccccg cccctgtcccctagccgccgctccgggctctgctcgccctcctacgttgcggtcacaccc ttctcccttcggggagacaacgacggcggtggcgggagcttctccacggccgaccagctg gagatggtgaccgagctgctgggaggagacatggtgaaccagagtttcatctgcgacccg gacgacgagaccttcatcaaaaacatcatcatccaggactgtatgtggagcggcttctcg gccgccgccaagctcgtctcagagaagctggcctcctaccaggctgcgcgcaaagacagc ggcagcccgaaccccgcccgcggccacagcgtctgctccacctccagcttgtacctgcag gatctgagcgccgccgcctcagagtgcatcgacccctcggtggtcttcccctaccctctc aacgacagcagctcgcccaagtcctgcgcctcgcaagactccagcgccttctctccgtcc tcggattctctgctctcctcgacggagtcctccccgcagggcagccccgagcccctggtg ctccatgaggagacaccgcccaccaccagcagcgactctgaggaggaacaagaagatgag gaagaaatcgatgttgtttctgtggaaaagaggcaggctcctggcaaaaggtcagagtct ggatcaccttctgctggaggccacagcaaacctcctcacagcccactggtcctcaagagg tgccacgtctccacacatcagcacaactacgcagcgcctccctccactcggaaggactat cctgctgccaagagggtcaagttggacagtgtcagagtcctgagacagatcagcaacaac cgaaaatgcaccagccccaggtcctcggacaccgaggagaatgtcaagaggcgaacacac aacgtcttggagcgccagaggaggaacgagctaaaacggagcttttttgccctgcgtgac cagatcccggagttggaaaacaatgaaaaggcccccaaggtagttatccttaaaaaagcc acagcatacatcctgtccgtccaagcagaggagcaaaagctcatttctgaagaggacttg ttgcggaaacgacgagaacagttgaaacacaaacttgaacagctacggaactcttgtgcg taa >gi568815590f:127638263_127840955|GENSCAN_predicted_peptide_4|56_aa MKNEDTVLTQMSFHEYSALQIIPGDASCHFDGCTLERPTEEEPMSPANSQQRPEKT >gi568815590f:127638263_127840955|GENSCAN_predicted_CDS_4|171_bp atgaaaaatgaggacacggttctgacacaaatgagtttccatgaatacagtgccttacaa atcatcccaggggacgccagctgccactttgatggctgcactttggagaggcccacagag gaggaaccaatgtctccagccaacagtcagcaaagacctgagaagacctga >gi568815590f:127638263_127840955|GENSCAN_predicted_peptide_5|198_aa MDLLPKVIYRFNAIPIKLPMTFFTELEKTTLKFTWNPKRAHIAKSILSQKKKAGGTMLPD FKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIIPHIYNHLIFDKPDKNKKWGKDSLFNK WCWENWLAICRKLKLNPFLTPYTKINSTWINDLNVRPNTIKTLEENLGNTIQDIGMGKDF MSKTQKAMATKAKIDNGI >gi568815590f:127638263_127840955|GENSCAN_predicted_CDS_5|597_bp atggacttactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcacatggaacccaaaaagagcc cacattgccaagtcaatcctaagccaaaagaaaaaagctggaggcaccatgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagaccaatggaacagaacagagccctcagaaataataccacacatctacaaccat ctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactgaatcccttccttaca ccttatacaaaaattaattcaacatggattaacgacttaaatgttagacctaataccata aaaaccctagaagaaaacttaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaacacaaaaagcgatggcaacaaaagccaaaattgacaatgggatctaa >gi568815590f:127638263_127840955|GENSCAN_predicted_peptide_6|118_aa MGPRAGRARGGRDEEGRRRAASKDVPRDPRHLPVDFLAERMLAVPVTCGDTARSALHPVV IANLSKALISQEGGCINPQNPEWNFLQDLLGEKPSRKLQKDKYRIRVWEKPRGLVSII >gi568815590f:127638263_127840955|GENSCAN_predicted_CDS_6|357_bp atgggcccgcgggccgggcgggctcggggcggccgggacgaggaggggcgacgacgagct gcgagcaaagatgtgccccgggacccccggcaccttccagtggatttccttgcggaaagg atgttggcggtccctgtgacctgtggagacacggccagatctgccctccatcctgtggtc atagcgaatctttctaaagctctgatcagtcaagaagggggttgtatcaatcctcagaac cctgagtggaactttctacaggatttattaggagaaaaaccttcccggaagctgcagaag gacaaatacagaatccgtgtctgggagaaacctcgtggcctggtctccattatttga >gi568815590f:127638263_127840955|GENSCAN_predicted_peptide_7|146_aa MGPQPTRPAVAQKLEETRPGGKEDTGVLGLSLGLGDSSGLAVQRFKALDCKSTKVAGGVG GGGKGRWGEGEGQDILESLNNQALQRPYLFLFMTLEIPYILSALGRHSLHADLVKKEKPR SILHLPPSFTEIPCCLQLPLPLCIML >gi568815590f:127638263_127840955|GENSCAN_predicted_CDS_7|438_bp atgggaccacagccaaccaggcctgctgtagcccagaagctggaagagactcgcccagga gggaaggaagacactggggtgttgggtctcagtctgggtcttggtgacagcagtgggctg gcagtgcagaggttcaaggccttggactgcaagtccactaaggtggctggtggtgtgggt ggaggtggaaaaggcaggtggggcgagggtgaggggcaggacatcctggagagcctgaat aaccaggccttgcaaaggccatatctcttccttttcatgactctggaaattccatacata ctttctgcattaggaagacattctctgcatgctgatctcgttaaaaaggaaaagccaagg agcatcctgcacctccctcctagctttacagaaattccatgctgccttcagcttcctttg ccgctctgcatcatgttg