GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:27:03 Sequence gi568815587r:74596501_74830888 : 234388 bp : 43.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6147 6184 38 2 2 94 100 29 0.687 1.76 1.02 Intr + 8192 8294 103 2 1 69 94 48 0.844 3.68 1.03 Intr + 14999 15038 40 1 1 70 115 13 0.509 -0.00 1.04 Intr + 16378 16510 133 2 1 81 82 175 0.861 15.90 1.05 Intr + 22037 22304 268 2 1 107 101 241 0.987 24.83 1.06 Intr + 23517 23589 73 2 1 83 54 57 0.745 0.78 1.07 Intr + 28908 29073 166 1 1 63 97 142 0.612 11.62 1.08 Intr + 32717 32823 107 2 2 42 86 102 0.880 5.26 1.09 Intr + 33749 33819 71 0 2 67 89 35 0.695 0.40 1.10 Intr + 39697 39775 79 0 1 80 82 44 0.589 2.12 1.11 Intr + 44064 44233 170 1 2 81 37 203 0.201 14.17 1.12 Intr + 48073 48123 51 0 0 93 87 5 0.048 0.00 1.13 Intr + 64228 64267 40 0 1 111 87 59 0.739 5.90 1.14 Intr + 72277 72352 76 2 1 79 75 91 0.115 5.47 1.15 Intr + 77138 77166 29 2 2 84 96 8 0.141 -1.04 1.16 Term + 78312 78349 38 1 2 122 37 40 0.276 -0.20 1.17 PlyA + 78930 78935 6 1.05 2.00 Prom + 80134 80173 40 -2.46 2.01 Init + 83916 83958 43 2 1 67 84 75 0.729 5.58 2.02 Intr + 95696 95830 135 0 0 68 87 49 0.620 3.34 2.03 Intr + 96969 97073 105 1 0 67 86 54 0.413 3.29 2.04 Term + 97585 97745 161 2 2 76 41 56 0.341 -2.20 2.05 PlyA + 98597 98602 6 -0.45 3.12 PlyA - 99002 98997 6 1.05 3.11 Term - 100085 100009 77 0 2 114 42 137 0.977 9.70 3.10 Intr - 100797 100705 93 1 0 130 97 75 0.994 12.54 3.09 Intr - 106467 106294 174 1 0 85 113 156 0.993 17.71 3.08 Intr - 106999 106805 195 2 0 84 86 166 0.999 15.39 3.07 Intr - 108154 107986 169 1 1 110 81 75 0.789 8.52 3.06 Intr - 111895 111713 183 1 0 58 61 90 0.569 3.28 3.05 Intr - 114491 114349 143 0 2 93 61 111 0.974 8.97 3.04 Intr - 116979 116886 94 0 1 113 106 -23 0.812 1.54 3.03 Intr - 118783 118622 162 1 0 74 21 132 0.833 5.27 3.02 Intr - 122332 122220 113 2 2 63 92 113 0.504 9.30 3.01 Init - 134388 134307 82 0 1 63 105 143 0.853 12.64 3.00 Prom - 149128 149089 40 -6.66 4.02 PlyA - 149212 149207 6 1.05 4.01 Sngl - 149614 149216 399 1 0 90 49 388 0.990 31.36 4.00 Prom - 150853 150814 40 -8.66 5.00 Prom + 152061 152100 40 -9.16 5.01 Init + 152381 152882 502 1 1 90 49 382 0.534 27.69 5.02 Term + 156807 156835 29 1 2 85 48 7 0.065 -5.36 5.03 PlyA + 158327 158332 6 1.05 6.03 PlyA - 158482 158477 6 1.05 6.02 Term - 159327 159208 120 0 0 79 38 92 0.776 1.77 6.01 Init - 163568 162393 1176 2 0 70 86 367 0.778 28.15 6.00 Prom - 164519 164480 40 -4.96 7.02 PlyA - 164688 164683 6 1.05 7.01 Sngl - 165924 165631 294 0 0 88 54 221 0.751 14.40 7.00 Prom - 181743 181704 40 -2.36 8.00 Prom + 185233 185272 40 -4.36 8.01 Init + 194158 194292 135 0 0 93 17 146 0.017 6.15 8.02 Intr + 213684 213830 147 2 0 63 86 125 0.975 10.23 8.03 Intr + 221096 221214 119 1 2 74 90 85 0.744 6.56 8.04 Term + 230782 230899 118 1 1 74 49 40 0.156 -3.29 8.05 PlyA + 231187 231192 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:74596501_74830888|GENSCAN_predicted_peptide_1|493_aa YIIHITFSDPSERMLYDYVERKRKENSGAQLHVTYLVSGSLIQNGHSCHKVAVVREDKLE AVKSKLAVTASIHVYSIQKAMLKDSGPLFNTDYDILKSNLQNCSKFSAIQCAAAVPRAPA ESSSSSKKFEQSHLHMSSETQANNELTTNGHGPPASKQVSQQPKGIMGMFASKAAAKTQE TNKETKTEAKEVTNASAAGNKAPGKGNMMSNFFGKAAMNKFKVNLDSEQAVKEEKIVEQP TVSVTEPKLATPAGLKKSSKKAEPVKVLQKEKKRGKRVALSDDETKETENMRKKRRRIKL PESDSSEDEETTLRGIENGKRGFRKKDISSTIQSSSGENKRKRKRVLKSKTYLDGEGCIV TEKVYESESCTDSEEELNMKTSSVHRPPAMTVKKEPREERKGPKKGTAALGKANRQVRTT HRLPPRLFSQRNLVRCAGFLFTFRHDLVLDMQSHSLDILEHYVEMPIQRLERKPSLTMPS AVHKDLCDYIGLI >gi568815587r:74596501_74830888|GENSCAN_predicted_CDS_1|1482_bp tatatcatccacataaccttctcagatccttctgaaaggatgctgtatgattatgttgaa aggaaacgaaaagaaaattcaggagcccaactgcatgttacctacttggtgtctggcagt ctcattcagaatggacattcctgccacaaggttgcagtagtgagagaagataaattggaa gcagtgaagtccaagctagctgtgactgccagcatccatgtgtacagcatccagaaagcc atgctaaaggacagtgggcctctgttcaatactgactatgacatccttaaaagcaacttg cagaactgcagcaaatttagtgctatacaatgtgcagctgccgtccctagagctcctgct gaatcctcttcgtcttccaaaaagtttgagcagtcacatcttcacatgtcaagtgagaca caagccaacaatgagctgaccaccaatggtcatggcccacctgcatccaagcaggtttcc cagcagcccaaaggaattatgggaatgtttgcctccaaagctgctgctaaaacccaagaa accaacaaggaaacgaaaacagaggctaaagaagtaacaaatgcatctgcagcaggcaac aaggcaccagggaaagggaatatgatgagcaacttttttggaaaagctgctatgaataaa tttaaagtcaatttggactcagaacaagcagtgaaagaagaaaaaatagtggagcagcct acagtgtctgtcacggaaccaaagctggcaactcctgcaggcctgaaaaaatccagcaaa aaagcagagcctgttaaggtgctgcagaaggaaaaaaaaagggggaagcgagtagcatta tctgatgatgagacaaaggaaactgaaaacatgaggaaaaagaggagaagaatcaaactt cctgaatctgatagcagtgaagatgaagaaactactctaagaggaattgaaaatggtaag agaggcttccgaaaaaaggatattagctcaactatacaaagctcaagtggagaaaacaaa agaaaacgaaaacgcgtactaaaatctaaaacttacctggatggggaaggctgcatagtg actgaaaaagtctacgagagtgaatcctgcacagatagtgaagaggagcttaacatgaag acatcctcagtacacagaccccctgccatgactgtgaaaaaagaacccagagaggaacga aagggccccaagaaagggactgctgctctgggcaaagccaacagacaggtgcggaccacc cacagattaccacccaggctctttagccagagaaacttggtaaggtgtgccggcttcctc ttcaccttccgccatgatttggtgctggacatgcagtcccactcactagacattttggag cattacgtggaaatgcccatccagcggttggagaggaaaccttctctgaccatgccctca gcagttcataaggatctttgtgattatattgggctcatctag >gi568815587r:74596501_74830888|GENSCAN_predicted_peptide_2|147_aa MPEKEEKGTQVLTEGPGPFLCSGTHKDTESYHIKVESGKVDDRDWEGYGEREDEEHWAED LLNPEGCGYLLSLSVCSAETSGFFSQKVSPSETPALATLRGSDAAPSSISSSRLEALKAQ EGQLAPCTVLVLEHKAHPRAAPKLFYE >gi568815587r:74596501_74830888|GENSCAN_predicted_CDS_2|444_bp atgccagaaaaagaggagaaaggaacacaggtcctcaccgagggacccggccccttcctg tgctctgggacccataaggatacagaatcctatcacataaaggtagagagtggaaaggta gatgacagagactgggaagggtatggggagcgggaagatgaagagcactgggctgaagac ctattaaacccagagggctgtggatacctcctgagcctcagcgtgtgctcagctgaaact tctggcttcttctcccagaaggtatcaccaagtgagacaccagcactggcaacactgagg ggctcggatgctgctccctcctctatctcctcctccagacttgaggctctaaaagcacag gaagggcaattagctccctgcactgtgttggtgctcgagcataaagcccacccgagagca gcgccgaagctgttttatgaatga >gi568815587r:74596501_74830888|GENSCAN_predicted_peptide_3|494_aa MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLR CTCSEDVSSGRAETLLVPLIPSAQKKAWHPVDLQQVLSVNESTSTHQRQLSIVESTRALG AHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSA HELFPSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKGESVPPSGSTH PPLHSMGLAMDCESREELQRHPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGA GSTTVKIVLKEKHKKACVHGGKTYSHGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCP TEYPCRHPEKVAGKCCKICPEDKADPGHSEISSTRCPKAPGRVLVHTSVSPSPDNLRRFA LEHEASDLVEIYLWKLVKGIFHLTQIKKVRKQDFQKEAQHFRLLAGPHEGHWNVFLAQTL ELKVTASPDKVTKT >gi568815587r:74596501_74830888|GENSCAN_predicted_CDS_3|1485_bp atggttcccgaggtgagggtcctctcctccttgctgggactcgcgctgctctggttcccc ctggactcccacgctcgagcccgcccagacatgttctgccttttccatgggaagagatac tcccccggcgagagctggcacccctacttggagccacaaggcctgatgtactgcctgcgc tgtacctgctcagaggatgtgagctccgggagggcagaaaccttgttggtcccactaatt cctagtgctcagaagaaggcctggcacccagtagacctgcagcaagtattgagcgtgaat gagtccacgtctacccaccagaggcagctcagcatcgtggaaagtacacgggccctgggc gcccatgtgagttgttaccgcctccactgtccgcctgtccactgcccccagcctgtgacg gagccacagcaatgctgtcccaagtgtgtggaacctcacactccctctggactccgggcc ccaccaaagtcctgccagcacaacgggaccatgtaccaacacggagagatcttcagtgcc catgagctgttcccctcccgcctgcccaaccagtgtgtcctctgcagctgcacagagggc cagatctactgcggcctcacaacctgccccgaaccaggctgcccagcacccctcccgctg ccagactcctgctgccaggcctgcaaaggtgagtctgtccctccatctggcagcactcac ccacccctccactccatgggcctagccatggactgtgagagccgtgaggaactgcagaga catcctcaggatccatgttccagtgatgctgggagaaagagaggcccgggcaccccagcc cccactggcctcagcgcccctctgagcttcatccctcgccacttcagacccaagggagca ggcagcacaactgtcaagatcgtcctgaaggagaaacataagaaagcctgtgtgcatggc gggaagacgtactcccacggggaggtgtggcacccggccttccgtgccttcggccccttg ccctgcatcctatgcacctgtgaggatggccgccaggactgccagcgtgtgacctgtccc accgagtacccctgccgtcaccccgagaaagtggctgggaagtgctgcaagatttgccca gaggacaaagcagaccctggccacagtgagatcagttctaccaggtgtcccaaggcaccg ggccgggtcctcgtccacacatcggtatccccaagcccagacaacctgcgtcgctttgcc ctggaacacgaggcctcggacttggtggagatctacctctggaagctggtaaaaggaatc ttccacttgactcagatcaagaaagtcaggaagcaagacttccagaaagaggcacagcac ttccgactgctcgctggcccccacgaaggtcactggaacgtcttcctagcccagaccctg gagctgaaggtcacggccagtccagacaaagtgaccaagacataa >gi568815587r:74596501_74830888|GENSCAN_predicted_peptide_4|132_aa MAEEGIAAGGVMEVNTALQEVLKTALIHDGLARGIREAAKVLDKRQAHLCVLASNCDEPM YVKLVEALCAEHQINLIKVDDNKKLGEWVGLCKIDREGKPHKVVGYSCVVVKDYGKESQA KDVIEEYFKCKK >gi568815587r:74596501_74830888|GENSCAN_predicted_CDS_4|399_bp atggccgaggaaggcattgctgctggaggtgtaatggaggttaatactgctttacaagag gtgctgaagaccgccctcatccacgatggcctagcacgtggaattcgcgaagctgccaaa gtcttagacaagcgccaagcccatctttgtgtgcttgcatccaactgtgatgagcctatg tatgtcaagttggtggaggccctttgtgctgaacaccaaatcaacctaattaaggttgat gacaacaagaaactaggagaatgggtaggcctctgtaaaattgacagagaggggaaaccc cataaagtggttggttacagttgtgtagtagttaaggactatggcaaggagtctcaggcc aaggatgtcatcgaagagtatttcaaatgcaagaaatga >gi568815587r:74596501_74830888|GENSCAN_predicted_peptide_5|176_aa MAAAGPSTRASSAAAAAALSRRGRRGRCDETAAAKTGAPGPASGPSLLVLSPPLLQPPLP PRPEESGCAGCLEPPGEAAALPCGHSLCRGCAQRAADAAGPGCPRCRARGPGWARRRARD DGQADSEVLGECARRSQPERCRPRRDGGAAAAGPRPEQEPRAAPAEPEEKTEAQNR >gi568815587r:74596501_74830888|GENSCAN_predicted_CDS_5|531_bp atggcggctgcaggtccgagtactcgggcctcttccgcggcggcagcagccgctctgagt cggcggggccggcggggccgctgtgacgagacggcggcagctaagactggggccccaggc ccggcttctggaccttcgctgttggtgttgtcgccgccgttgctgcagccgccgctgccg ccgcggccggaggaatcgggctgcgccgggtgcctggagccccccggagaagcagcggcc ctgccgtgcggccactcgctttgccgaggctgcgcccaacgcgccgccgacgcggcgggc ccgggttgccctcgctgccgcgcccgcggcccaggctgggcccgccgtcgggcccgcgac gacggccaggccgactcagaggtgctgggcgagtgcgcccgccgcagccaacccgagcgc tgccgcccgcgccgggacgggggcgcggctgccgcggggcccaggccagagcaggagccg cgtgccgcgcctgcggagccagaagagaaaactgaggcccagaacagataa >gi568815587r:74596501_74830888|GENSCAN_predicted_peptide_6|431_aa MDTFLDTYTLPRLNQEEAESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFDEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NQIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKI QQPFMLKTLNKLGTDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKV SGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENHKPL LKEIKEDTNKWKNIPCSWVGRINIVKMAILPKSTLHLYLAFCQSLNIPSTFKPFYMQALL PGMDIPSFLIW >gi568815587r:74596501_74830888|GENSCAN_predicted_CDS_6|1296_bp atggatacattcctcgacacatacactctcccaagactaaaccaggaagaagctgaatct ctgaatagaccaataacaggagctgaaattgtggcaataatcaatagtttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaagaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca tttgatgaggccagcatcattctgataccaaagccgggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatccgcaaatcaataaatgtaatccagcatataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaagcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtactgatgggacgtatttcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaaaggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtttatcta gaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatat ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaaccacaaaccactg ctcaaggaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtagga agaatcaatatcgtgaaaatggccatactgcccaagtccacactccacctatatttagct ttttgccagtccctgaacatcccaagcaccttcaagcctttctacatgcaggctcttcta cctggaatggacatcccctccttcctcatctggtga >gi568815587r:74596501_74830888|GENSCAN_predicted_peptide_7|97_aa MGKKHNRKTGNSKRQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKA >gi568815587r:74596501_74830888|GENSCAN_predicted_CDS_7|294_bp atggggaaaaaacacaatagaaaaactggaaactctaaaaggcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgtttaaaggagctgatggagctgaaaaccaaggcttga >gi568815587r:74596501_74830888|GENSCAN_predicted_peptide_8|172_aa MGSSCCALLWLGGELQVLAWALAPVRLWLDQDYCKWLPLLALGNVLREEKLQEEKPSEDQ IHKLLPEDTETGKRKMDEQKKRDEPLVLKTNLERCPARLSDSENEEPSRGQMTQTHRSAF VSKNNSYSLAFLAGHGWGVGTKRLHKAARPWAQPTKPFSPPSFLGLQWEGLL >gi568815587r:74596501_74830888|GENSCAN_predicted_CDS_8|519_bp atggggtccagctgctgtgccttgctgtggctgggtggggagctccaggtgctggcatgg gcactggctcctgtgaggctgtggctggaccaggactactgcaagtggcttccactgctg gcactggggaacgtgctgagagaagaaaagttacaagaggaaaaaccctctgaagatcaa atccacaagctgttaccagaggatacagaaacagggaaaaggaaaatggatgaacagaaa aaaagagatgaaccattagtactgaaaacaaatctggaacgttgtcctgcacgtctctca gattcagagaatgaagaaccttctcgaggccagatgacacagacacatcgctcggcattt gtttccaagaacaactcctactccttagctttcctggcaggccacggctggggtgtgggc accaagagacttcacaaagcagcaaggccctgggcccagcccacgaaaccattttctcct cctagctttctgggcctgcagtgggagggcttgctgtga