GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:50:07 Sequence gi568815597f:44113754_44320615 : 206862 bp : 45.91% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5179 5235 57 0 0 70 109 76 0.262 7.27 1.02 Intr + 15600 15807 208 2 1 76 48 49 0.107 -1.65 1.03 Intr + 16219 16443 225 2 0 78 72 156 0.234 10.96 1.04 Term + 16759 17003 245 2 2 107 42 128 0.966 6.16 1.05 PlyA + 18853 18858 6 1.05 2.05 PlyA - 21495 21490 6 1.05 2.04 Term - 24258 24068 191 1 2 91 46 120 0.980 5.71 2.03 Intr - 27926 24911 3016 2 1 111 117 2529 0.946 245.47 2.02 Intr - 37698 37304 395 1 2 58 39 385 0.451 24.97 2.01 Init - 58099 57937 163 1 1 82 21 114 0.461 2.00 2.00 Prom - 67082 67043 40 -3.36 3.00 Prom + 89735 89774 40 -1.76 3.01 Init + 95540 95613 74 1 2 25 79 81 0.404 1.44 3.02 Intr + 98977 99158 182 1 2 141 41 40 0.759 4.11 3.03 Intr + 99996 100105 110 1 2 98 64 158 0.818 14.40 3.04 Intr + 100597 100692 96 0 0 77 11 138 0.518 5.21 3.05 Intr + 100987 101145 159 0 0 -6 121 160 0.920 10.08 3.06 Intr + 104558 104716 159 1 0 50 105 228 0.988 20.88 3.07 Intr + 104835 105002 168 2 0 107 115 255 0.999 30.24 3.08 Intr + 105303 105488 186 2 0 88 66 381 0.994 35.79 3.09 Intr + 105653 105724 72 1 0 93 87 82 0.641 8.20 3.10 Intr + 106008 106125 118 2 1 67 92 113 0.469 9.64 3.11 Intr + 106264 106556 293 2 2 90 103 433 0.978 41.85 3.12 Term + 106806 106865 60 2 0 114 54 82 0.985 5.20 3.13 PlyA + 106901 106906 6 -5.99 4.13 PlyA - 107360 107355 6 1.05 4.12 Term - 107887 107805 83 2 2 110 38 182 0.896 13.26 4.11 Intr - 123291 123106 186 1 0 6 89 115 0.282 3.06 4.10 Intr - 134285 134186 100 2 1 130 57 89 0.333 9.68 4.09 Intr - 135172 135149 24 1 0 114 65 20 0.532 0.52 4.08 Intr - 171154 171082 73 0 1 108 79 50 0.463 5.51 4.07 Intr - 173140 172993 148 2 1 91 10 50 0.124 -3.21 4.06 Intr - 180199 180116 84 2 0 64 92 34 0.186 1.19 4.05 Intr - 181838 181728 111 0 0 81 95 52 0.169 5.55 4.04 Intr - 188115 187973 143 2 2 6 55 91 0.003 -2.40 4.03 Intr - 194648 194557 92 2 2 68 84 73 0.032 3.69 4.02 Intr - 199475 199416 60 2 0 107 92 41 0.252 5.33 4.01 Intr - 205991 205875 117 2 0 75 94 27 0.252 2.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:44113754_44320615|GENSCAN_predicted_peptide_1|244_aa MEQEAGELSRWQAAHQAAQDNENSAPILNMSSSSGSSGVHTSWNQGLPSIQHFPHSAEML GSPLVSVEAPGQNVNEGGPQFSMPLPERGSQDSLVSQPDSQEGPFLPEQPGPAPQTVEKN SRPQEGTGRRGSSEARPYCCNYENCGKAYTKRSHLVSHQRKHTGERPYSCNWESCSWSFF RSDELRRHMRVHTRYRPYKCDQCSREFMRSDHLKQHQKTHRPGPSDPQANNNNGEQDSPP AAGP >gi568815597f:44113754_44320615|GENSCAN_predicted_CDS_1|735_bp atggaacaggaggctggggagctgagccggtggcaggcggcgcaccaggctgcccaggat aacgagaactcagcgcccatcttgaacatgtcttcatcttctggaagctctggagtgcac acctcttggaaccaaggcctaccaagcattcagcactttcctcacagcgcagagatgctg gggtcccctttggtgtctgttgaggcgccggggcagaatgtgaatgaaggggggccacag ttcagtatgccactgcctgagcgtggatctcaggactctcttgtcagtcagccagactct caagaaggcccatttctaccagagcagcccggacctgctccacagacagtagagaagaac tccaggcctcaggaagggactggtagaaggggctcctcagaggcaaggccttactgctgc aactacgagaactgcggaaaagcttataccaaacgctcccacctcgtgagccaccagcgc aagcacacaggtgagaggccatattcttgcaactgggaaagttgttcatggtctttcttc cgttctgatgagcttagacgacatatgcgggtacacaccagatatcgaccatataaatgt gatcagtgcagccgggagttcatgaggtctgaccatctcaagcaacaccagaagactcat cggccgggaccctcagacccacaggccaacaacaacaatggagagcaggacagtcctcct gctgctggtccttag >gi568815597f:44113754_44320615|GENSCAN_predicted_peptide_2|1254_aa MQPGAGILVLGLLLLPQDAPGLSSVASLNCLSHPRHYDWKHVFWKLSSCWDSGTEKLCEY NKCGKSFCQKESLPTSQKIRTEERPFESKDCGKVLIQKSNFIQYQRMHTEEKPFVYKECG KTFSGKSNLTEQEKFHIGEKPFKCSECGTAIGQKKKYLIRNHTGEKSYECKECGNIFCQQ IPLIVDDPITKDLVQSSGNIKEMDSSLLQAIEEIEKFFQHLSERHTEQAETPDAPEPQNC MPLTAHAEESQHESTQSKTMPPLGSTMMTSACTNIPGTVLTQDLTMHPLKALEDLSETYS MGQKVTSFDQVKHTAGSQMTDVTVTPKSSPTDCQKTTITASNMTISNESSQLNTPSSDQT LNESQIPALLGDQMKTLSDNQTLCGDQVTFSSDQTLTDGHTVTSGSDETLSGGQMTTSLD LYGGQMMTSIDNQTLCGEQMTTSSGNQAFYGRQMTTSTGNQTLCGEQMTTSTGNQALYGG QMTTSASNQTLCGEQMTTSTSNQTLCGEQVMTSTGNQALCGGQMTTSTGNQNLYGGQMMT STGNQTLYWGQMMTSTGNQNLCGEQVMTSTGNQALCGGQMTTSTGNQNLYGGQMMTSTGN QTLYWGQMMTSTGNQNLCGEQVMTSTGNQALCGGQMTTSTGNQNLCGEQVMTSTSNQTLC GEQTTTSTSNQTLCGEQVTTSTGNQALYGGQMMTSTGNQTLYWGQMMTSTGNQNLCGEQM TTSTGNQALYGGQMTTSTSNQTLCGEQMTTPTSNQTLCGEQVTTSTGNQALYGGQITTST SNQTLCGEQMTTSTSNQTLCGEQVTTSTGNQALYGGQMMTSTGNQALYGGQMTTSASNQT LCGEQMTTSTSNQTLCEEQVMTSTGNQALCGEQMTTSTGNQALYGGQMTTSTSNQTLCGE QTTTSTSNQTLCGEQVTTSTGNQALYGGQMMTSTGNQTLYWGQMMTSTGNQNLCGEQMTT STGNQALYGGQMTTSTSNQTLCGEQMTTPTSNQTLCGEQVTTSTGNQALYRGQITTSTSN QTLCGEQMTTSTSNQTLCGEQVTTSTGNQALYGGQMMTSTGNQNLYGGQNMTSTDNQALY GGQMATYSGNQTLYGDQMLTLQVGNMTTLTDDHSLYGGYMMSHQFSSLPYPGFLCFSSSH LIQGQLPKQKTQSCQFWKNPEVSRPYVCTYEDCKMSYSKACHLRTHMRKHTGEKPYVCDV EGCTWKFARSDELNRHKKRHTGERPYLCSICSKNFARSDHLKQHAKVHNIRPGL >gi568815597f:44113754_44320615|GENSCAN_predicted_CDS_2|3765_bp atgcagcctggggctggcatccttgttctgggcctgctgctcctgcctcaggacgcccca ggcctgtcctcagtggcttccctgaactgcctctcccaccccaggcattatgactggaaa catgttttctggaagctaagcagctgctgggactcagggactgagaagctttgtgaatat aacaaatgtggaaaatccttttgccagaaggaaagtctccctaccagtcagaaaattcgc actgaagagagaccttttgagagtaaagactgtgggaaagttctcattcagaagtcaaac ttcatccagtaccagagaatgcacacagaagagaaaccctttgtgtataaggagtgtgga aaaacctttagtggaaaatcaaaccttactgagcaagagaaatttcatattggagagaaa ccctttaaatgtagtgaatgtggaacagctattggccagaagaagaagtacctcataaga aaccatactggagagaaatcctatgagtgtaaggaatgtggaaacatcttctgtcagcaa ataccacttattgtagatgatcccataactaaagatctagttcagtcatcaggcaacatc aaagagatggattccagtcttctccaggcaattgaggaaattgagaaatttttccagcat ctctctgagcggcatacagaacaggcagaaaccccagatgcaccagaaccacagaactgt atgcctctgactgcccatgctgaggagagccaacatgagtcaacccagagcaagacgatg cctcccttggggtccacaatgatgacttctgcatgcactaacatccctgggacagttctc acccaggacttaacaatgcaccctcttaaagcccttgaagacctaagtgagacttattcc atgggccagaaagtgacttcctttgatcaggtaaaacacacagcaggctcccagatgaca gatgttactgtcaccccaaagtcatcacccactgattgccagaagacaaccatcactgca agcaacatgacaatctccaatgaaagtagccagctgaacaccccaagcagtgaccagacc ctcaatgagagtcagataccagccctgcttggagatcagatgaagaccctcagtgataac cagactctctgtggggaccaggtgaccttcagtagtgaccagaccctcactgatggtcat acagtgacttccggtagcgatgagaccctctctgggggccaaatgacaaccagtctagac ctctatggagggcaaatgatgacttccattgataaccagaccctctgtggggagcagatg acgacctccagtggtaaccaggccttctacgggagacagatgacgacctccactggtaac cagaccctctgtggagagcagatgacaacctccactggtaaccaggccctctatgggggg cagatgacgacctccgctagtaaccagaccctctgtggagagcagatgacaacctccact agtaaccagaccctctgtggggagcaagtgatgacctccactggtaaccaagccctctgt ggagggcagatgacgacctccactggtaaccagaacctctatggggggcagatgatgacc tccactggtaaccagaccctctactgggggcagatgatgacctccactggtaaccagaac ctctgtggagagcaagtgatgacctccactggtaaccaagccctctgtggagggcagatg acgacctccactggtaaccagaacctctatggggggcagatgatgacctccactggtaac cagaccctctactgggggcagatgatgacctccactggtaaccagaacctctgtggagag caagtgatgacctccactggtaaccaagccctctgtggagggcagatgacgacctccact ggtaaccagaacctctgtggagagcaagtgatgacctccactagtaaccagaccctctgt ggagagcagacgacgacctccactagtaaccagaccctctgtggggagcaggtgacgacc tccactggtaaccaggccctctacggggggcagatgatgacctccactggtaaccagacc ctctactgggggcagatgatgacctccactggtaaccagaacctctgtggagagcagatg acaacctccactggtaaccaggccctctacggggggcagatgacaacctctactagtaac cagaccctctgtggagagcagatgacgacccccactagtaaccagaccctctgtggggag caggtgacgacctccactggtaaccaggccctctacggggggcagatcacgacctccact agtaaccagaccctctgtggagagcagatgacgacctccactagtaaccagaccctttgt ggggagcaggtgacgacctccactggtaaccaggccctctatggggggcagatgatgacc tccactggtaaccaggccctctacggggggcagatgacaacctctgctagtaaccagacc ctctgtggagagcagatgacaacctccactagtaaccagaccctctgtgaggagcaggtg atgacctccactggtaaccaggccctctgtggagagcagatgacaacctccactggtaac caggccctctacggggggcagatgacgacctccactagtaaccagaccctctgtggagag cagacgacgacctccactagtaaccagaccctctgtggggagcaggtgacgacctccact ggtaaccaggccctctatggggggcagatgatgacctccactggtaaccagaccctctac tgggggcagatgatgacctccactggtaaccagaacctctgtggagagcagatgacaacc tccactggtaaccaggccctctacggggggcagatgacgacctctactagtaaccagacc ctctgtggagagcagatgacgacccccactagtaaccagaccctctgtggggagcaggtg acgacctccactggtaaccaggccctctacagggggcagatcacgacctccactagtaac cagaccctctgtggagagcagatgacgacctccactagtaaccagaccctctgtggggag caggtgacgacctccactggtaaccaggccctctatggggggcagatgatgacctccact ggtaaccagaacctctacggggggcagaatatgacctccactgataaccaggccctctat ggaggccagatggcaacatatagtggcaaccagaccctctatggggaccagatgctgact cttcaggtgggcaatatgacaacgctcactgatgaccacagcctttatgggggatatatg atgtcccatcagttctcatcattgccatacccaggattcctatgcttttcaagttcccat ttgattcaaggacaactcccaaaacagaagacacagagctgccagttctggaagaatcct gaggtttcaaggccctatgtctgcacttacgaggactgcaagatgtcttattcaaaggct tgccacctccgaacccacatgcgcaagcacaccggggagaagccctatgtatgcgatgta gagggatgtacgtggaaatttgcccgctcagatgagctcaacagacacaagaaaaggcac actggggaaaggccctacctgtgttcaatatgcagcaagaattttgcaagatctgatcac ctaaagcagcatgcaaaggtccacaacattcgcccaggattatga >gi568815597f:44113754_44320615|GENSCAN_predicted_peptide_3|558_aa MRVLLEISIAKIGNKVVFATASVVRKIPRFIPLRDTSKLVPAWGCVLGFFVAQRVQLAPV LVKCHEAGLQEIPTRPQPTAISIRGGAMATGADVRDILELGGPEGDAASGTISKKDIINP DKKKSKKSSETLTFKRPEGMHREVYALLYSDKKQGYRTVKAKLGSKKVRPWKWMPFTNPA RKDGAMFFHWRRAAEEGKDYPFARFNKTVQVPVYSEQEYQLYLHDDAWTKAETDHLFDLS RRFDLRFVVIHDRYDHQQFKKRSVEDLKERYYHICAKLANVRAVPGTDLKIPVFDAGHER RRKEQLERLYNRTPEQVAEEEYLLQELRKIEARKKEREKRSQDLQKLITAADTTAEQRRT ERKAPKKKLPQKKEAEKPAVPETAGIKFPDFKSAGVTLRSQRPLVAGTGLACQLSLQMKL PSSVGQKKIKALEQMLLELGVELSPTPTEELVHMFNELRSDLVLLYELKQACANCEYELQ MLRHRHEALARAGVLGGPATPASGPGPASAEPAVTEPGLGPDPKDTIIDVVGAPLTPNSR KRRESASSSSSVKKAKKP >gi568815597f:44113754_44320615|GENSCAN_predicted_CDS_3|1677_bp atgagagttctgttggagatcagcattgcaaaaatcgggaataaagttgtctttgctaca gcatcagtagtaaggaagatccccaggtttattcccttgagggatacttctaagctggtt cctgcttggggctgtgttttgggcttcttcgtagcccaaagagtgcagctagctcctgta ctagtgaagtgccacgaagctggcctccaggaaatccccacgaggccacaacccacagct atctccatcagaggaggcgcgatggctacgggcgcggatgtacgggacattctagaactc gggggtccagaaggggatgcagcctctgggaccatcagcaagaaggacattatcaacccg gacaagaaaaaatccaagaagtcctctgagacactgactttcaagaggcccgagggcatg caccgggaagtctatgccttgctctactctgacaagaagcaaggataccgtacagtgaag gccaagttgggctccaagaaggtgcggccttggaagtggatgccattcaccaacccggcc cgcaaggacggagcaatgttcttccactggcgacgtgcagcggaggagggcaaggactac ccctttgccaggttcaataagactgtgcaggtgcctgtgtactcggagcaggagtaccag ctttatctccacgatgatgcttggactaaggcagaaactgaccacctctttgacctcagc cgccgctttgacctgcgttttgttgttatccatgaccggtatgaccaccagcagttcaag aagcgttctgtggaagacctgaaggagcggtactaccacatctgtgctaagcttgccaac gtgcgggctgtgccaggcacagaccttaagataccagtatttgatgctgggcacgaacga cggcggaaggaacagcttgagcgtctctacaaccggaccccagagcaggtggcagaggag gagtacctgctacaggagctgcgcaagattgaggcccggaagaaggagcgggagaaacgc agccaggacctgcagaagctgatcacagcggcagacaccactgcagagcagcggcgcacg gaacgcaaggcccccaaaaagaagctaccccagaaaaaggaggctgagaagccggctgtt cctgagactgcaggcatcaagtttccagacttcaagtctgcaggtgtcacgctgcggagc caacggcctcttgtggctgggacaggcttagcttgccagctttcattgcagatgaagctg ccaagctctgtgggacagaagaagatcaaggccctggaacagatgctgctggagcttggt gtggagctgagcccgacacctacggaggagctggtgcacatgttcaatgagctgcgaagc gacctggtgctgctctacgagctcaagcaggcctgtgccaactgcgagtatgagctgcag atgctgcggcaccgtcatgaggcactggcccgggctggtgtgctagggggccctgccaca ccagcatcaggcccaggcccggcctctgctgagccggcagtgactgaacccggacttggt cctgaccccaaggacaccatcattgatgtggtgggcgcacccctcacgcccaattcgaga aagcgacgggagtcggcctccagctcatcttccgtgaagaaagccaagaagccgtga >gi568815597f:44113754_44320615|GENSCAN_predicted_peptide_4|406_aa EIIEFPILKLNGRTMEIESTFHMYVQPVVHPQLTPFCTELTGIIQAMVDGQPSLQQVLER VDEWMAKEGLLDPNVKSIFVTCGDWDLKVMPTTTYVASRVQGDLSKSVRSTSETWIAKGP NFSFYLEDRNSSSAIGVGILLEIAIEGADPSISRKCWHSGSSLKSAPVESCSGPWLSLPG WVHPGLEPSPVLHGSRDPCQARGTLEVDKDMWLGEGNSDGSFLDSQILEGFCEEKMEVLF SVFSKDILWTHGLPGQCQYLGLPVADYFKQWINLKKVEVAGSQVAYSFAMGCWPKNGLLD MNKGLSLQHIGRPHSGIAHLRVNALDYLKGGQEEQVGLLESLEEVGLARGTALEGHKHKI KKSPVAQVGRGSGSACFGGDDCKNIANIMKTLAYRGFIFKQTSKPF >gi568815597f:44113754_44320615|GENSCAN_predicted_CDS_4|1221_bp gaaatcatcgagttccccatcctaaagctaaatggccggaccatggagattgagtctacc tttcacatgtatgtccagcctgtagtccatccacagcttaccccattctgtacagagctc accgggattattcaagccatggtggatggtcagccaagcctgcagcaagtgctggagagg gtcgatgaatggatggcgaaggaaggcctcttagatccaaacgtcaagtcaatttttgtc acctgtggagactgggacttaaaagtcatgcctacaaccacctatgtggctagcagagtg cagggtgacctttccaagtctgtgagaagtacatcggaaacttggatagccaaaggcccc aatttctctttttacctagaagaccgcaattcctcctcagccattggagtgggaatcctg ttggaaattgccattgaaggtgctgaccccagcatatcccggaagtgctggcattccggt agtagcctaaagtcagcacctgttgagagctgctctgggccctggctttctctgcccggc tgggttcaccctggcctggagcccagcccagtgttacatggcagccgtgacccgtgccag gccagaggaactcttgaagttgataaggatatgtggcttggagaagggaactctgatgga agctttctggactctcagatcttggaagggttctgtgaggaaaagatggaagtattgttc tctgtgttttctaaagacatactttggacccatgggctcccaggccagtgccagtacttg ggcttgccagtggcggattacttcaagcagtggattaatctgaaaaaggtggaggttgct ggcagccaggtggcttacagcttcgccatgggctgctggcccaagaatggacttctagac atgaacaagggcctcagcctgcaacacataggccggccccacagcggcattgcccatttg agggttaatgcccttgactacctcaagggtgggcaagaagagcaagtaggactcctggag tccctggaggaggtgggcctagcacgagggacagccttagaaggccacaagcacaagatc aagaaatcaccagttgctcaagttggcagaggtagtgggagtgcatgttttggaggtgac gactgcaagaacattgccaacatcatgaagacactcgcctatcgaggcttcatcttcaag cagacatcgaagccgttctga