GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:53:44 Sequence gi568815591r:96920739_97124623 : 203885 bp : 41.27% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1148 1257 110 2 2 64 113 65 0.865 5.61 1.02 Term + 12610 12776 167 2 2 24 44 152 0.099 1.50 1.03 PlyA + 15232 15237 6 1.05 2.00 Prom + 23496 23535 40 -4.95 2.01 Init + 33094 33191 98 0 2 85 106 62 0.769 7.53 2.02 Intr + 59272 59363 92 1 2 51 92 47 0.062 0.12 2.03 Intr + 71518 71601 84 2 0 66 115 79 0.001 7.27 2.04 Intr + 78058 78178 121 2 1 70 54 110 0.039 4.43 2.05 Intr + 83221 83348 128 1 2 -9 66 227 0.058 10.20 2.06 Intr + 83413 83627 215 2 2 63 86 141 0.093 8.91 2.07 Term + 84165 84281 117 2 0 74 34 62 0.054 -3.04 2.08 PlyA + 84355 84360 6 1.05 3.00 Prom + 85043 85082 40 -9.35 3.01 Init + 85249 85675 427 0 1 82 94 671 0.593 63.61 3.02 Intr + 86137 86500 364 2 1 56 16 229 0.575 5.82 3.03 Intr + 86577 86791 215 0 2 37 76 160 0.572 7.14 3.04 Intr + 86900 87093 194 0 2 113 91 189 0.918 19.99 3.05 Intr + 89058 89220 163 2 1 104 10 99 0.680 2.33 3.06 Term + 89280 89407 128 1 2 87 49 155 0.984 8.96 3.07 PlyA + 90039 90044 6 -0.45 4.12 PlyA - 90555 90550 6 1.05 4.11 Term - 93038 92845 194 0 2 76 48 124 0.030 3.80 4.10 Intr - 96894 96753 142 0 1 45 58 94 0.050 1.11 4.09 Intr - 100327 100202 126 1 0 101 42 180 0.258 14.66 4.08 Intr - 101631 101447 185 1 2 46 99 145 0.998 9.99 4.07 Intr - 103789 103531 259 1 1 52 88 287 0.820 21.21 4.06 Intr - 103979 103893 87 2 0 129 60 116 0.808 12.15 4.05 Intr - 105704 105567 138 2 0 19 80 115 0.582 3.54 4.04 Intr - 105857 105716 142 1 1 20 21 131 0.360 -0.97 4.03 Intr - 106981 106855 127 2 1 58 28 122 0.190 2.02 4.02 Intr - 113583 113378 206 2 2 38 34 136 0.218 1.12 4.01 Init - 113870 113818 53 2 2 88 97 59 0.264 5.65 4.00 Prom - 122084 122045 40 -5.95 5.00 Prom + 126028 126067 40 -6.15 5.01 Init + 130103 130243 141 1 0 87 26 84 0.841 2.28 5.02 Intr + 130643 130823 181 1 1 104 39 96 0.918 4.82 5.03 Term + 131092 131522 431 2 2 -8 41 313 0.892 11.48 5.04 PlyA + 131573 131578 6 1.05 6.00 Prom + 136043 136082 40 -3.65 6.01 Init + 139441 139576 136 0 1 74 72 68 0.679 4.15 6.02 Term + 139882 140294 413 2 2 46 49 167 0.930 3.02 6.03 PlyA + 140667 140672 6 1.05 7.03 PlyA - 141459 141454 6 1.05 7.02 Term - 154201 153975 227 2 2 -36 39 281 0.544 6.76 7.01 Init - 154401 154302 100 0 1 78 73 51 0.565 3.09 7.00 Prom - 176073 176034 40 -4.65 8.00 Prom + 180877 180916 40 -6.55 8.01 Init + 180969 181186 218 2 2 74 79 104 0.205 6.31 8.02 Intr + 194899 195005 107 1 2 120 64 64 0.770 6.14 8.03 Intr + 196417 196619 203 2 2 6 54 139 0.325 0.38 8.04 Term + 196920 197222 303 2 0 -8 32 350 0.801 13.99 8.05 PlyA + 197512 197517 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 76896 76775 122 2 2 87 92 73 0.872 6.82 S.002 Intr - 78485 78320 166 0 1 107 36 101 0.866 4.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:96920739_97124623|GENSCAN_predicted_peptide_1|92_aa XYTQLSPSESGITDVETYFCRAVYDDIYELCLTVISQEWEPETLKQSHTFTEQQEKGLPA MSPYGKICPGFKAPSTALGTIKERLWQISTDT >gi568815591r:96920739_97124623|GENSCAN_predicted_CDS_1|279_bp nnatacacacaattgagcccttctgaatctggcatcacagacgttgagacatacttttgc agagcagtttatgatgacatttatgagctatgcctcacagtcatctctcaagaatgggag ccagaaacactaaaacaaagtcatactttcacagagcagcaagagaaaggacttccagcg atgagcccctatggaaaaatctgccctggcttcaaggccccaagcacagccctgggaacc attaaagagaggctctggcagatctcaactgacacatag >gi568815591r:96920739_97124623|GENSCAN_predicted_peptide_2|284_aa MRSGQGPQRISSGKGHNEGTLGNQITKLHLWVGWCKSLLTLSSNFQLNQKLDIQGRGQEC GQPGWMRGKGEFGGIRDRRFLVEKYKSVRPAVRVSLQICTRQWLVLGGSLPPEFHYRAPG SLQTAARGSQLRSKVGVQTERIFRSRPPNLKPLEGRAAGDLLQSRDPEPGTLDAGLNHAP GLRCLCLAALTCVGLETGGRKARSAALGCSELNVFPQSIFEREVVLPPGCADINPLANNK WINNNKNTLLRSEKKKKTPPPPTTFCTIYKKYGLKRQGPSQNRV >gi568815591r:96920739_97124623|GENSCAN_predicted_CDS_2|855_bp atgagaagtggacaaggtccacagcgtatcagctcaggtaagggccacaacgaagggact ctgggtaaccagattacaaaattacacctctgggttggatggtgcaagagtcttctaact ctgtcttccaacttccagttaaaccagaagctggacattcaagggaggggccaggaatgt ggccagccaggctggatgagggggaagggggagtttgggggtattcgggacaggaggttt ctggtggaaaaatacaaatcggttcgccctgcagtgagggtttctctacaaatctgtaca agacagtggctggttcttggaggatctctgcctcctgaattccattatcgggcccctggt tccctgcagacggcagctcgtgggagccagctgcgatcgaaagtcggggtccagaccgag aggatcttcaggtctcgtccacccaatctcaagcctctggaagggcgggctgccggcgac ctcctccagtcccgggacccggaacccgggaccctggacgccggcctgaaccacgcaccg gggctccgctgtctctgcttggctgccttaacctgcgtgggtctggagaccggagggaga aaggctagatccgcagccttgggctgctccgaacttaatgtctttccccagtccatcttt gaaagagaagtggttctaccacctggatgtgcggatataaatccccttgcaaataataaa tggattaataataacaagaacacgctgctgcgttctgaaaagaaaaaaaagacgcccccc ccccccacaaccttttgtacaatatataagaaatacggcctgaaacggcagggaccgagc cagaacagggtctga >gi568815591r:96920739_97124623|GENSCAN_predicted_peptide_3|496_aa MTTMADGLEGQDSSKSAFMEFGQQQQQQQQQQQQQQQQQQQPPPPPPPPPQPHSQQSSPA MAGAHYPLHCLHSAAAAAAAGSHHHHHHQHHHHGSPYASGGGNSYNHRSLAAYPYMSHSQ HSPYLQSYHNSSAAAQTRGDDTVTRREAGLAFALVKLVTVRLFPVGFKVSLTFIPDRAPY LAFRYSVGCSFVRGGEAKNLGLMVLSTRGGQSCRERRSLLVGWSELLVVFCLRIWSSSGE SFQNMPRMLDALSLELVTSSQLAACRLAALRTPESLRHRLRCGKAGCLSEGLWRSLGPDA AHCSGQGLGQGDYADAGSTRAEWTKRGRGIIGIFRDQQKTTVIENGEIRFNGKGKKIRKP RTIYSSLQLQALNHRFQQTQYLALPERAELAASLGLTQTQVKIWFQNKRSKFKKLLKQGS NPHESDPLQGSAALSPRSPALPPVWDVSASAKGVRHDAETTDDVSCPREHPRETSEQGKE DPGPACICEKEPKEQA >gi568815591r:96920739_97124623|GENSCAN_predicted_CDS_3|1491_bp atgactacgatggctgacggcttggaaggccaggactcgtccaaatccgccttcatggag ttcgggcagcagcagcagcagcagcagcaacagcagcagcagcagcagcagcaacagcaa cagccgccgccgccgccgccgccgccgccgcagccgcactcgcagcagagctccccggcc atggcaggcgcgcactaccctctgcactgcctgcactcggcggcggcggcggcagcggcc ggctcgcaccaccaccaccaccaccagcaccaccaccacggctcgccctacgcgtcgggc ggagggaactcctacaaccaccgctcgctcgccgcctacccctacatgagccactcgcag cacagcccttacctccagtcctaccacaacagcagcgcagccgcccagacgcgaggggac gacacagtgactcggagggaagctggcctggctttcgctctggttaaactagtaacagtc aggctctttccagtgggctttaaagtgtccctcaccttcatccctgaccgggccccctat ctagcttttcgctacagtgttggctgctcctttgttcgtggaggggaagcaaagaacctc gggctgatggttctcagcacccgagggggacaaagttgccgtgagaggcgcagcctgctc gtgggctggtctgagctgttggttgtgttttgtttgaggatttggtcctcctcaggagag tcatttcaaaacatgccgaggatgcttgatgcactgagcttggagctggtgacatcaagt caactggccgcctgccgccttgctgcgctgcgcaccccagagagcctccggcaccgcctc cgctgcggcaaagcgggctgtttgtctgagggcctctggcgctcccttggcccagacgct gcacattgttcgggccagggactgggtcaaggagattacgcagacgctgggagcacacgg gcagagtggaccaagagaggtcggggcatcatagggatttttcgagatcaacaaaaaact acagtgattgaaaacggggaaatcaggttcaatggaaaagggaaaaagattcggaagcct cggaccatttattccagcctgcagctccaggctttaaaccatcgctttcagcagacacag tatctggcccttccagagagagccgaactggcagcttccttaggactgacacaaacacag gtgaagatatggtttcagaacaaacgctctaagtttaagaaactgctgaagcagggcagt aatcctcatgagagcgaccccctccagggctcggcggccctgtcgccacgctcgccagcg ctgcctccagtctgggacgtttctgcctcggccaagggtgtcagacacgatgcagagacc acagatgatgtgagttgcccaagggaacaccctagggaaacgtctgaacaaggaaaagag gatccgggacctgcttgtatctgcgaaaaggagccaaaggagcaggcttag >gi568815591r:96920739_97124623|GENSCAN_predicted_peptide_4|552_aa MAGCRSRALPCGEAAKARRGSRLQPRPAQRRAPTVQQRAEGLLKRGQNGHQGRGGTESKR WLQGLPARCHLSKQNQTTTTKTATAYSGSALLEEFDAVGLFLHGWICRWILRGNSVQPAG LRTKLEALVLSQRSPRQPRPKARDSGQWEKENAAGRQSPGAEVSPERKCDNSCVFLTAKA GSWCTHGSSALEFLSGAVSCSEFQTPYLYPSACFRPSGSTLQPCLLRPEQPHSQLGQLPP PQQQGQPLPPPESPTLPESSATDSDYYSPTGGAPHGYCSPTSASYGKALNPYQYQYHGVN GSAGSYPAKAYADYSYASSYHQYGGAYNRVPSATNQPEKEVTEPEVRMVNGKPKKVRKPR TIYSSFQLAALQRRFQKTQYLALPERAELAASLGLTQTQVKIWFQNKRSKIKKIMKNGEM PPEHSPSSSDPMACNSPQSPAWASSYSPPLTYNTSPPPSELLTNPRKAGACGQGQRDSSV PRPPPGPSLKRVVVLLSERIRVRMGSLSIRFVFLVFYVVDVFGAACDWSLRKVPCFYKVW RDSAGDSKLKSI >gi568815591r:96920739_97124623|GENSCAN_predicted_CDS_4|1659_bp atggcgggctgcaggtcccgagccctgccctgtggggaggcagctaaggcccgccgaggg agccggctccagcctcggccagcccagagaagggctcccacggtgcagcagcgggctgaa gggctcctcaagcgcggccagaatgggcaccaaggccgaggaggcaccgagagcaagcga tggctgcaagggctgccagcacgctgtcacctctcaaaacaaaaccaaacaactacaact aaaacagccacagcttattcgggaagcgcactgctggaagagtttgatgccgtggggctg tttctccacggctggatttgtcggtggattttgagaggcaatagcgtccagcccgctggc cttaggacaaagctggaggccctagttctatctcagagaagcccgcggcagcccaggccc aaagctcgtgactcgggccagtgggagaaggagaatgctgctgggcgccagtctcccggt gctgaagtttcccccgaaaggaagtgcgacaatagctgtgtttttctgaccgcaaaggca gggagttggtgcacgcacggcagctctgctttggagttcctgagcggggctgtatcttgc tctgagttccagacaccctacctctacccgtcggcctgttttaggccgagtggctccaca ctccaaccatgtctgcttagaccagagcagccccacagccaactagggcagctgccgccg ccacaacagcaaggacagccgctgccgccgcccgaatcgccaactttgcccgagtcttca gctaccgattctgactactacagccctacggggggagccccgcacggctactgctctcct acctcggcttcctatggcaaagctctcaacccctaccagtatcagtatcacggcgtgaac ggctccgccgggagctacccagccaaagcttatgccgactatagctacgctagctcctac caccagtacggcggcgcctacaaccgcgtcccaagcgccaccaaccagccagagaaagaa gtgaccgagcccgaggtgagaatggtgaatggcaaaccaaagaaagttcgtaaacccagg actatttattccagctttcagctggccgcattacagagaaggtttcagaagactcagtac ctcgccttgccggaacgcgccgagctggccgcctcgctgggattgacacaaacacaggtg aaaatctggtttcagaacaaaagatccaagatcaagaagatcatgaaaaacggggagatg cccccggagcacagtcccagctccagcgacccaatggcgtgtaactcgccgcagtctcca gcgtgggcgtcttcatactccccaccccttacttacaacacctcacccccaccttcagag ctcctcacaaaccccaggaaggctggggcatgcggtcagggccagcgcgattcttcggtg cccagaccacctcctgggccatccttaaaaagggttgttgttctgttgagtgaaaggatt cgagtgagaatgggatccttaagtattcgatttgtctttcttgttttctatgttgttgat gtatttggtgcagcttgtgactggagcctgcgaaaagtaccatgtttctataaggtgtgg agagactctgctggagattccaagctaaaatctatttga >gi568815591r:96920739_97124623|GENSCAN_predicted_peptide_5|250_aa MRDCAMKDGAIRPRYYAFPMVFTTCRPEDSLGCLHHQSPGFQAQSWEGISERKAAALLRG LEIKLPSPWNRAPGGRGSCGPTFSKLKRFCLLALKRAVDLLAQTASSKSNSININKKDAH AKTPSEGHQKPKVDKSTKMRKNQHKKSENSKNQNNSSPPKDHNYSPAREQNWMENEFDKL TEVGFRRWVITNSSPLKEHVLAQCKEAKNLYKWLQELLTRITSLDKTINDLMELKNTAQE LCEAYTSINS >gi568815591r:96920739_97124623|GENSCAN_predicted_CDS_5|753_bp atgagggactgtgccatgaaggacggtgctatccggcccagatactatgcttttcccatg gtcttcacaacctgcagaccagaagattccctcgggtgcctacaccaccagagccctgga tttcaagcacaaagctgggagggcatctctgaaagaaaggcagcagccctactcaggggc ttagagataaaactcccatctccctggaacagagcacctgggggaaggggcagctgtggg cccaccttcagcaaacttaaacgtttctgcctgctagcactgaagagagcagtggatctc ctagcacagactgcctcctcaaaaagcaatagcatcaacatcaacaaaaaggacgcccac gcaaaaaccccatcagaaggtcatcaaaaaccaaaggtagataaatccacgaagatgagg aaaaaccagcacaaaaagtctgaaaattccaaaaaccagaacaactcttctcctccgaag gatcacaactactcgccggcaagggaacaaaactggatggagaatgagtttgacaaattg acagaagtaggtttcagaaggtgggtgataacaaactcctccccgctaaaggagcatgtt ctagcccaatgcaaggaagctaagaacctttataaatggttacaggaactgctaactaga ataaccagtttagacaagaccataaatgacctgatggagctaaaaaacacagcacaagaa ctttgtgaagcatatacaagtatcaatagctga >gi568815591r:96920739_97124623|GENSCAN_predicted_peptide_6|182_aa MTKKENFRPISLMNINVKIPSKILANRIQQHIKKLIHHDRVGFIPVLEVLARAIWQEEER KGIQIAREGVKLSLFANHMTLYLENSIVSAPKLPKLISNFSKVSGYNINVQKSQAFLYTN NRQVESQIMSELPFRIPIKRIKYLGIQLTRDVNDLFKENYKPLLNEIKEDTNKWKNIPCS WI >gi568815591r:96920739_97124623|GENSCAN_predicted_CDS_6|549_bp atgacaaaaaaagaaaatttcaggccaatatccctaatgaacatcaatgtgaaaatcccc agtaaaatactggcaaacagaatccagcagcatatcaaaaaacttatccaccatgatcga gttggcttcatccctgtattggaagttctggccagggcaatctggcaagaggaagaaaga aaaggtattcaaatagcaagagaaggagtcaaattatctctgtttgccaatcacatgacc ctatatttagaaaactccatagtctcagccccaaaactccctaagctgataagcaacttc agcaaagtctcaggatacaatatcaatgtgcaaaaatcacaagctttcctatacaccaac aatagacaagtggagagccaaatcatgagtgaactcccattcagaattcctataaagaga ataaaatacctaggaatccaacttacaagggatgtgaatgacctcttcaaggagaactac aaaccacttctcaatgaaataaaagaggacacaaacaaatggaagaatattccatgctca tggatatga >gi568815591r:96920739_97124623|GENSCAN_predicted_peptide_7|108_aa MAAGHFLDGGKPHVLSPGVLGLMDSKEWNLGPCTIQEQWQAFSLIWSGSGRLAGSEAQQT PCRIRRDGSQQQVCDGGKQQWWTASKSSARVVKNTDQKNAVARFNRVK >gi568815591r:96920739_97124623|GENSCAN_predicted_CDS_7|327_bp atggcagcgggccacttcctagatggtggcaagcctcatgttctctcacctggggttctt ggcctcatggattccaaggaatggaatcttgggccatgcaccatccaggaacaatggcaa gcttttagcctgatctggagtggcagtgggcgcctcgctggatcagaagcacagcagaca ccctgtcggatccggagggatggaagtcagcagcaggtctgtgacggcggcaaacagcag tggtggacggcaagcaaaagctcagctcgagtcgtaaaaaacacggaccagaagaatgca gttgcaagatttaatagagtgaaatag >gi568815591r:96920739_97124623|GENSCAN_predicted_peptide_8|276_aa MVILPKAIYKFNAIPIKIPSSFLIELEKTFLKFIWNQKRACIAKARLSKNKSGGITLPNF KLYYKAIVTKKARKFLHRKKSGVCETITQISTFLPLLHQQEYEASDTVRRAHAQSINWEN SAVQRAWVGELVDGEINLTLIPKAGCSNTLNASNGLGTMEGIPEEPDLNEAFLRPLFAEV RERAVPLRRRSRRSAWGAMPGRHVSRVRALYKRVLQLHRVLPPDLKSLGDQYVKDEFRRH KTVGSDEAQRFLQEWEASDAPFSLPKTFGVPRCPGF >gi568815591r:96920739_97124623|GENSCAN_predicted_CDS_8|831_bp atggtcatactgccaaaagcaatctacaaattcaatgcaattcccatcaaaataccatca tcatttctcatcgaactagaaaaaacattcctaaaattcatatggaaccaaaaaagagcc tgcatagccaaagcaagactaagcaagaacaaatctggaggcatcacattacccaacttc aaactatactacaaggctatagttaccaaaaaagcaagaaagtttttgcacagaaagaag tctggggtctgtgagacaatcacacaaatttcaacatttctgcctcttcttcaccagcag gaatatgaggcatctgacaccgttagaagagcccatgcccagagtataaattgggagaac agcgctgtgcaaagagcgtgggttggtgagctagtggatggcgagattaatctcactctg atccctaaagcagggtgcagtaatactctaaatgcctcaaatgggttgggaaccatggaa ggaatcccggaggagcccgatcttaacgaagcttttctcaggcctttgttcgcagaagtg cgggaacgcgccgtccctctgcgcaggcgcagtcggcggtcggcgtggggcgctatgccg gggcggcacgtttctcgagtccgggcattgtacaagcgcgtcttgcagctgcaccgtgtt ctgcccccggacctcaaatccctgggcgaccagtacgtgaaagacgaatttaggagacat aagaccgttggttctgacgaggcacagcgtttcttgcaagaatgggaggcaagtgacgct cccttctctttgcccaagacctttggggttcctcggtgtccaggtttctag