GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:30:14 Sequence gi568815592r:132488676_132689686 : 201011 bp : 37.89% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1875 2083 209 2 2 67 81 102 0.460 5.74 1.02 Intr + 13255 13397 143 1 2 -23 66 167 0.197 2.48 1.03 Intr + 15640 15723 84 2 0 51 94 91 0.462 4.87 1.04 Term + 16525 16652 128 2 2 82 49 54 0.485 -1.64 1.05 PlyA + 16932 16937 6 1.05 2.00 Prom + 22280 22319 40 -4.55 2.01 Init + 37349 37507 159 1 0 67 3 192 0.539 8.47 2.02 Term + 37613 37765 153 1 0 99 48 106 0.259 4.64 2.03 PlyA + 39672 39677 6 1.05 3.00 Prom + 43942 43981 40 -6.05 3.01 Sngl + 49615 50661 1047 0 0 65 42 259 0.889 15.89 3.02 PlyA + 50715 50720 6 1.05 4.04 PlyA - 51116 51111 6 1.05 4.03 Term - 59701 59599 103 0 1 90 42 75 0.418 -0.13 4.02 Intr - 59998 59865 134 1 2 75 69 84 0.262 3.72 4.01 Init - 61819 61814 6 1 0 69 110 0 0.306 1.33 4.00 Prom - 62106 62067 40 -5.25 5.00 Prom + 63682 63721 40 -3.75 5.01 Init + 64018 64593 576 0 0 92 87 347 0.297 30.18 5.02 Intr + 68533 68563 31 0 1 71 90 -22 0.002 -6.91 5.03 Term + 81564 82684 1121 1 2 104 47 291 0.301 17.79 5.04 PlyA + 82774 82779 6 1.05 6.02 PlyA - 83012 83007 6 1.05 6.01 Sngl - 101011 99998 1014 1 0 18 37 616 0.985 46.26 6.00 Prom - 101370 101331 40 -8.75 7.00 Prom + 101649 101688 40 -11.24 7.01 Init + 101910 101916 7 2 1 89 69 0 0.233 -0.68 7.02 Intr + 106382 106650 269 0 2 -69 88 366 0.736 17.23 7.03 Term + 111773 111826 54 1 0 65 48 119 0.894 2.18 7.04 PlyA + 111851 111856 6 1.05 8.00 Prom + 115405 115444 40 -5.45 8.01 Init + 117388 117446 59 0 2 62 74 55 0.505 2.33 8.02 Term + 119901 120102 202 0 1 110 42 241 0.813 17.58 8.03 PlyA + 121388 121393 6 1.05 9.00 Prom + 125282 125321 40 -5.35 9.01 Sngl + 129132 129389 258 2 0 77 42 183 0.872 7.48 9.02 PlyA + 130819 130824 6 1.05 10.02 PlyA - 132448 132443 6 1.05 10.01 Sngl - 138847 138590 258 1 0 62 47 171 0.773 5.28 10.00 Prom - 145849 145810 40 -4.55 11.05 PlyA - 146133 146128 6 1.05 11.04 Term - 146926 146563 364 0 1 87 48 186 0.332 7.55 11.03 Intr - 147367 147241 127 2 1 61 76 70 0.241 1.92 11.02 Intr - 167011 166856 156 2 0 67 43 80 0.000 0.46 11.01 Init - 187196 187073 124 2 1 81 49 141 0.765 9.88 11.00 Prom - 191546 191507 40 -6.45 12.03 PlyA - 191709 191704 6 1.05 12.02 Term - 193302 193180 123 0 0 18 53 118 0.639 -1.30 12.01 Intr - 195830 195660 171 2 0 84 110 124 0.939 13.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 175797 175934 138 2 0 63 116 104 0.993 10.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:132488676_132689686|GENSCAN_predicted_peptide_1|187_aa MTKFSKTCTYGQKQQKIAGPEQEAVWVGRRITGVAELHTVTSKSSEELIPPRVRMGRSRA EKHRGLCWRRSDARGAKQSQRRWPVTNCSSSQCTIPTVPAVDNVVPGTVHFLVLPEIEEG ERMTLPFQQPASQNRRDTEILPQEAGGGEIQSQGSDAKAIHACYHIELMPQKISGREEAT VKYVSLL >gi568815592r:132488676_132689686|GENSCAN_predicted_CDS_1|564_bp atgaccaaattctcaaaaacgtgtacatacggccaaaagcaacagaaaatagcaggtcct gagcaggaagccgtatgggtggggcggagaataaccggggttgcagagctccacactgtc accagcaaatcatcagaagagctgatcccacccagagtgagaatgggcagaagcagggct gagaagcaccggggactctgctggagaagatctgatgccagaggagcaaagcagagccaa cgccggtggcctgtgacaaactgcagcagttctcaatgtaccattcctacagtccctgct gtggacaacgtggtgcctggcactgtgcacttccttgttcttccagaaatagaggaagga gaacgaatgactctcccctttcagcagccagcatctcagaatagaagagacactgagata ctgccacaggaggcaggtggaggtgagattcagtcccagggttctgatgccaaggctatc catgcttgctaccacatagaactaatgccacagaagatttctggtagagaagaagccaca gtgaagtatgtgtctttgctctaa >gi568815592r:132488676_132689686|GENSCAN_predicted_peptide_2|103_aa MGDRIRQGQDHGIWELFPPQRGNLRAHGTGRKDPFVTDEWLPELLIQRRCNGWGNCLLFH HFQRPHVETPGQKIIPPDSEWIKDDRQGPTGGKFEPCQCDIQC >gi568815592r:132488676_132689686|GENSCAN_predicted_CDS_2|312_bp atgggtgaccggattaggcaaggacaggaccatgggatatgggaacttttccctccccag aggggaaacttgagagctcatgggactggaagaaaagatccctttgtgactgatgagtgg ctgcctgaacttttgattcagcgacgctgcaatgggtggggcaactgtctgctctttcat catttccagagaccacatgttgaaactccaggtcagaaaatcattccacctgactctgag tggatcaaagatgacagacagggcccaacagggggcaagtttgaaccttgccagtgtgat attcagtgctag >gi568815592r:132488676_132689686|GENSCAN_predicted_peptide_3|348_aa MVNNFSQAEAVELCYKNVNESCIKTPYSPGPRSILYAVLGFGAVLAAFGNLLVMIAILHF KQLHTPTNFLIASLACADFLVGVTVMPFSTVRSVESCWYFGDSYCKFHTCFDTSFCFASL FHLCCISVDRYIAVTDPLTYPTKFTVSVSGICIVLSWFFSVTYSFSIFYTGANEEGIEEL VVALTCVGGCQAPLNQNWVLLCFLLFFIPNVAMVFIYSKIFLVAKHQARKIESTASQAQS SSESYKERVAKRERKAAKTLGIAMAAFLVSWLPYLVDAVIDAYMNFITPPYVYEILVWCV YYNSAMNPLIYAFFYQWFGKAIKLIVSGKVLRTDSSTTNLFSEEVETD >gi568815592r:132488676_132689686|GENSCAN_predicted_CDS_3|1047_bp atggtgaacaatttctcccaagctgaggctgtggagctgtgttacaagaacgtgaacgaa tcctgcattaaaactccttactcgccaggtcctcgatctatcctctacgccgtccttggt tttggggctgtgctggcagcgtttggaaacttactggtcatgattgctatccttcacttc aaacaactgcacacacctacaaactttctgattgcgtcgctggcctgtgctgacttcttg gtgggagtcactgtgatgcccttcagcacagtgaggtctgtggagagctgttggtacttt ggggacagttactgtaaattccatacatgttttgacacatccttctgttttgcttcttta tttcatttatgctgtatctctgttgatagatacattgctgttactgatcctctgacctat ccaaccaagtttactgtgtcagtttcagggatatgcattgttctttcctggttcttttct gtcacatacagcttttcgatcttttacacgggagccaacgaagaaggaattgaggaatta gtagttgctctaacctgtgtaggaggctgccaggctccactgaatcaaaactgggtccta ctttgttttcttctattctttatacccaatgtcgccatggtgtttatatacagtaagata tttttggtggccaagcatcaggctaggaagatagaaagtacagccagccaagctcagtcc tcctcagagagttacaaggaaagagtagcaaaaagagagagaaaggctgccaaaaccttg ggaattgctatggcagcatttcttgtctcttggctaccatacctcgttgatgcagtgatt gatgcttatatgaattttataactcctccttatgtttatgagattttagtttggtgtgtt tattataattcagctatgaaccccttgatttatgctttcttttaccaatggtttgggaag gcaataaaacttattgtaagcggcaaggtcttaaggactgattcgtcaacaactaattta ttttctgaagaagtagagacagattaa >gi568815592r:132488676_132689686|GENSCAN_predicted_peptide_4|80_aa MQASLASAAGKKCGPWMTVSRCSCHLCEMPVSRFFDPLRHRSQKSEKQDEDFFRTFSQDG SVSSFIYPCAPDENMERAGI >gi568815592r:132488676_132689686|GENSCAN_predicted_CDS_4|243_bp atgcaggcatcacttgcttctgctgctggcaagaaatgtggtccatggatgacggtgtcc cgctgctcctgtcacctctgtgagatgcctgtctccaggttttttgatcccctcaggcat agaagtcagaagtcagaaaaacaagatgaagatttctttaggactttcagtcaagatggt tcagttagttcattcatctacccctgtgctccagatgaaaatatggaaagagcaggcata taa >gi568815592r:132488676_132689686|GENSCAN_predicted_peptide_5|575_aa MTSNFSQPVVQLCYEDVNGSCIETPYSPGSRVILYTAFSFGSLLAVFGNLLVMTSVLHFK QLHSPTNFLIASLACADFLVGVTVMLFSMVRTVESCWYFGAKFCTLHSCCDVAFCYSSVL HLCFICIDRYIVVTDPLVYATKFTVSVSGICISVSWILPLTYSGAVFYTGVNDDGLEELV SALNCVGGCQIISGRLLFSKHQGALKIRNNPVFFVIINKDKTSPYVNNSVMSSNSSLLVA VQLCYANVNGSCVKIPFSPGSRVILYIVFGFGAVLAVFGNLLVMISILHFKQLHSPTNFL VASLACADFLVGVTVMPFSMVRTVESCWYFGRSFCTFHTCCDVAFCYSSLFHLCFISIDR YIAVTDPLVYPTKFTVSVSGICISVSWILPLMYSGAVFYTGVYDDGLEELSDALNCIGGC QTVVNQNWVLTDFLSFFIPTFIMIILYGNIFLVARRQAKKIENTGSKTESSSESYKARVA RRERKAAKTLGVTVVAFMISWLPYSIDSLIDAFMGFITPACIYEICCWCAYYNSAMNPLI YALFYPWFRKAIKVIVTGQVLKNSSATMNLFSEHI >gi568815592r:132488676_132689686|GENSCAN_predicted_CDS_5|1728_bp atgaccagcaatttttcccaacctgttgtgcagctttgctatgaggatgtgaatggatct tgtattgaaactccctattctcctgggtcccgggtaattctgtacacggcgtttagcttt gggtctttgctggctgtatttggaaatctcttagtaatgacttctgttcttcattttaag cagctgcactctccaaccaattttctcattgcctctctggcctgtgctgacttcttggta ggtgtgactgtgatgcttttcagcatggtcaggacggtggagagctgctggtattttgga gccaaattttgtactcttcacagttgctgtgatgtggcattttgttactcttctgtcctc cacttgtgcttcatctgcatcgacaggtacattgtggttactgatcccctggtctatgct accaagttcaccgtgtctgtgtcgggaatttgcatcagcgtgtcctggattctgcctctc acgtacagcggtgctgtgttctacacaggtgtcaatgatgatgggctggaggaattagta agtgctctcaactgcgtaggtggctgtcaaattatttctggaagactactattctctaaa catcagggagcactgaaaatcaggaacaatcctgtattttttgtgataatcaacaaggac aaaacttctccatatgtaaataacagcgttatgagcagcaattcatccctgctggtggct gtgcagctgtgctacgcgaacgtgaatgggtcctgtgtgaaaatccccttctcgccggga tcccgggtgattctgtacatagtgtttggctttggggctgtgctggctgtgtttggaaac ctcctggtgatgatttcaatcctccatttcaagcagctgcactctccgaccaattttctc gttgcctctctggcctgcgctgatttcttggtgggtgtgactgtgatgcccttcagcatg gtcaggacggtggagagctgctggtattttgggaggagtttttgtactttccacacctgc tgtgatgtggcattttgttactcttctctctttcacttgtgcttcatctccatcgacagg tacattgcggttactgaccccctggtctatcctaccaagttcaccgtatctgtgtcagga atttgcatcagcgtgtcctggatcctgcccctcatgtacagcggtgctgtgttctacaca ggtgtctatgacgatgggctggaggaattatctgatgccctaaactgtataggaggttgt cagaccgttgtaaatcaaaactgggtgttgacagattttctatccttctttatacctacc tttattatgataattctgtatggtaacatatttcttgtggctagacgacaggcgaaaaag atagaaaatactggtagcaagacagaatcatcctcagagagttacaaagccagagtggcc aggagagagagaaaagcagctaaaaccctgggggtcacagtggtagcatttatgatttca tggttaccatatagcattgattcattaattgatgcctttatgggctttataacccctgcc tgtatttatgagatttgctgttggtgtgcttattataactcagccatgaatcctttgatt tatgctttattttacccatggtttaggaaagcaataaaagttattgtaactggtcaggtt ttaaagaacagttcagcaaccatgaatttgttttctgaacatatataa >gi568815592r:132488676_132689686|GENSCAN_predicted_peptide_6|337_aa MRAVFIQGAEEHPAAFCYQVNGSCPRTVHTLGIQLVIYLACAAGMLIIVLGNVFVAFAVS YFKALHTPTNFLLLSLALADMFLGLLVLPLSTIRSVESCWFFGDFLCRLHTYLDTLFCLT SIFHLCFISIDRHCAICDPLLYPSKFTVRVALRYILAGWGVPAAYTSLFLYTDVVETRLS QWLEEMPCVGSCQLLLNKFWGWLNFPLFFVPCLIMISLYVKIFVVATRQAQQITTLSKSL AGAAKHERKAAKTLGIAVGIYLLCWLPFTIDTMVDSLLHFITPPLVFDIFIWFAYFNSAC NPIIYVFSYQWFRKALKLTLSQKVFSPQTRTVDLYQE >gi568815592r:132488676_132689686|GENSCAN_predicted_CDS_6|1014_bp atgagagctgtcttcatccaaggtgctgaagagcaccctgcggcattctgctaccaggtg aatgggtcttgccccaggacagtacatactctgggcatccagttggtcatctacctggcc tgtgcagcaggcatgctgattatcgtgctagggaatgtatttgtggcatttgctgtgtcc tacttcaaagcgcttcacacgcccaccaacttcctgctgctctccctggccctggctgac atgtttctgggtctgctggtgctgcccctcagcaccattcgctcagtggagagctgctgg ttcttcggggacttcctctgccgcctgcacacctacctggacaccctcttctgcctcacc tccatcttccatctctgtttcatttccattgaccgccactgtgccatctgtgaccccctg ctctatccctccaagttcacagtgagggtggctctcaggtacatcctggcaggatggggg gtgcccgcagcatacacttcgttattcctctacacagatgtggtagagacaaggctcagc cagtggctggaagagatgccttgtgtgggcagttgccagctgctgctcaataaattttgg ggctggttaaacttccctttgttctttgtcccctgcctcattatgatcagcttgtatgtg aagatctttgtggttgctaccagacaggctcagcagattaccacattgagcaaaagcctg gctggggctgccaagcatgagagaaaagctgccaagaccctgggcattgctgtgggcata tacctcttgtgctggctgcccttcaccatagacacgatggtcgacagcctccttcacttt atcacacccccactggtctttgacatctttatctggtttgcttacttcaactcagcctgc aaccccatcatctatgtcttttcctaccagtggtttcggaaggcactgaaactcacactg agccagaaggtcttctcaccgcagacacgcactgttgatttgtaccaagaatga >gi568815592r:132488676_132689686|GENSCAN_predicted_peptide_7|109_aa MTDEAEVKNGGGAEYDVTAAVDFAKEVSKIPAGLDGCDHTEGHDHTAQQKVSDGHGEDQE VGRGVELLEVSDGNHYDHVAQHCHHYRPDHDQLNAQALNSPIGHEEELL >gi568815592r:132488676_132689686|GENSCAN_predicted_CDS_7|330_bp atgacagatgaagcagaggtgaaaaatggaggtggtgcagagtatgatgtcacagcagct gtggactttgcaaaagaggtctccaaaataccagcaggactcgatggatgtgatcacact gaagggcatgaccacacagctcagcaaaaagtcagtgatggccatggagaggatcaagaa gttggtcggggagtggagctgcttgaagtgagcgatggaaatcattacgatcatgttgcc cagcattgtcatcactatagacccgatcatgaccagcttaatgcacaggctctgaactcc ccaatcggccatgaggaggaactgctctga >gi568815592r:132488676_132689686|GENSCAN_predicted_peptide_8|86_aa MVIKMVKVDTGEYKRGQGGKHMADDSSMLFGNDKDFAINTNHDGARGKETCSEQYCPPEF VESKGTEIEASNKYLIALHTGNIGLR >gi568815592r:132488676_132689686|GENSCAN_predicted_CDS_8|261_bp atggtcataaaaatggtaaaagtagacactggggaatacaagagaggacaaggagggaag cacatggctgatgactcgagcatgctgtttggaaacgataaagattttgccataaatacc aaccatgatggagccaggggtaaagaaacatgtagtgaacaatattgtcccccagaattt gttgaaagtaagggcacagaaattgaagcaagcaacaagtatcttatagctctgcatacc ggaaacatcggcctcagatag >gi568815592r:132488676_132689686|GENSCAN_predicted_peptide_9|85_aa MKNGCYAKHQVKTIINLAKCKPKIPAVLHRSDHTIWHDGESQEEISDGHGEDEEVGWCVK LLEVGNGNYHGKIAKYCDEYGSCHK >gi568815592r:132488676_132689686|GENSCAN_predicted_CDS_9|258_bp atgaaaaatggatgttatgctaagcatcaggtcaaaactataataaatcttgcaaaatgt aagcccaaaataccagcagttctccaccgatctgatcatactatatggcatgatggtgaa tcccaggaggaaatcagtgatggccatggagaggatgaggaagttggttggtgtgtgaag ctgcttgaagtaggaaatggaaattatcatggcaagattgccaaatattgtgatgaatat ggatcctgccataaatga >gi568815592r:132488676_132689686|GENSCAN_predicted_peptide_10|85_aa MSKDFMTKTPKAMATKANIDKWDLIKLRSFCTAKETIIRVNRKPTEWEKNFAIYPSDKGL ISRVYEELKQIYKKKTTPSKSVQGL >gi568815592r:132488676_132689686|GENSCAN_predicted_CDS_10|258_bp atgagcaaggacttcatgactaaaacaccaaaagcaatggcaacaaaagccaatattgac aaatgggatctaattaaactaaggagcttctgcacagcaaaagaaactatcatcagagtg aacaggaaacctacagaatgggagaaaaattttgcaatctatccatctgacaaagggcta atatccagagtctacgaggaacttaaacaaatttacaagaaaaaaacaaccccatcaaaa agtgtgcaaggactatga >gi568815592r:132488676_132689686|GENSCAN_predicted_peptide_11|256_aa MGPTGSATSDGGNASKKQRKVTLQEKVELPDMDHRLWSTAAGSNRSDTVRKVKSDRVWGL TSVFLMLLEAKPGGLLGAPGIQDQSGQLEIPPDESGTSTADLLEFAGGPLQTLFAWVSPA KAAEQQKLLPAPSSGRDVLPIEEESREAVWLQLLFCAAVGCAQSKLPGGIVYTVRGKQPT QASVTVDAPPNTKLECPRLTADCCAASENFKPVDLSLLGSMGEGPSEQDHLTPWLYPLFH GSEWFCLAGIPGATGI >gi568815592r:132488676_132689686|GENSCAN_predicted_CDS_11|771_bp atgggacccacaggaagtgccactagtgatggtggaaatgcttccaagaagcagagaaaa gtcacattacaagaaaaagttgaattgcctgatatggaccatagattgtggtctacagct gcaggcagcaatcggagtgatactgttagaaaagtcaaatcagatcgggtgtgggggctt acttctgtattcctaatgcttttggaggccaagccaggaggattgcttggagccccaggc attcaagaccagtctgggcaacttgaaatacctcctgatgagtcaggcacctctactgct gatctgctggagtttgctgggggtccactccagaccctgtttgcctgggtatcaccagcc aaggctgcagaacagcaaaaattgctgcctgctccttcctctggaagagatgtcctgccc atagaggaggaatctagagaggcagtctggctacagctacttttctgtgctgcagtgggc tgtgctcagtccaaacttccaggaggcattgtttacactgtgaggggaaaacagcctact caagcctcagtaacggtggacgcccctcccaacaccaagctcgagtgtcccaggttgact gcagactgctgtgctgccagtgagaatttcaagccagtggatcttagcttgctgggctcc atgggagagggaccctctgagcaagaccacttgactccctggctttatccccttttccat gggagtgaatggttctgtcttgctggcattccaggtgccactgggatatga >gi568815592r:132488676_132689686|GENSCAN_predicted_peptide_12|97_aa ICTLLKCKTTNLNTCGDSAETASTRFEMFSLSGTFGTQYVFPEVLLSENQLAPGEFQCCD DGQLHSTDNSHALCKACPSAQESKALAVVIYASKTSF >gi568815592r:132488676_132689686|GENSCAN_predicted_CDS_12|294_bp atttgtaccctgttgaaatgtaaaacgactaatttaaacacttgcggtgactcagctgaa acagcttctaccaggtttgaaatgttctccctcagtggcactttcggaacccagtatgtc tttcctgaggtgttgctgagtgaaaatcagcttgcacctggagaatttcagtgctgtgat gatggacaattacatagtaccgataacagccatgcactgtgcaaagcatgcccttctgca caggagagcaaggcacttgcagtagtgatctatgccagcaaaacatcattttga