GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:53:23 Sequence gi568815592f:132438290_132639333 : 201044 bp : 37.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 6119 5904 216 2 0 34 105 106 0.043 4.68 1.07 Intr - 18628 18498 131 2 2 92 81 61 0.234 5.29 1.06 Intr - 21059 20954 106 2 1 55 61 19 0.183 -5.23 1.05 Intr - 25786 25704 83 2 2 100 91 149 0.963 14.84 1.04 Intr - 26593 26536 58 1 1 95 85 72 0.787 5.14 1.03 Intr - 33311 33174 138 2 0 81 84 108 0.979 9.44 1.02 Intr - 34086 33993 94 2 1 82 69 95 0.993 6.05 1.01 Init - 46060 45885 176 1 2 41 81 143 0.750 7.77 1.00 Prom - 47708 47669 40 -6.55 2.00 Prom + 48229 48268 40 -7.45 2.01 Init + 52261 52469 209 0 2 67 81 102 0.297 5.74 2.02 Intr + 63641 63783 143 2 2 -23 66 167 0.154 2.48 2.03 Intr + 66026 66109 84 0 0 51 94 91 0.421 4.87 2.04 Term + 66911 67038 128 0 2 82 49 54 0.463 -1.64 2.05 PlyA + 67318 67323 6 1.05 3.00 Prom + 72666 72705 40 -4.55 3.01 Init + 87735 87893 159 2 0 67 3 192 0.539 8.47 3.02 Term + 87999 88151 153 2 0 99 48 106 0.259 4.64 3.03 PlyA + 90058 90063 6 1.05 4.00 Prom + 94328 94367 40 -6.05 4.01 Sngl + 100001 101047 1047 1 0 65 42 259 0.889 15.89 4.02 PlyA + 101101 101106 6 1.05 5.04 PlyA - 101502 101497 6 1.05 5.03 Term - 110087 109985 103 1 1 90 42 75 0.418 -0.13 5.02 Intr - 110384 110251 134 2 2 75 69 84 0.262 3.72 5.01 Init - 112205 112200 6 2 0 69 110 0 0.306 1.33 5.00 Prom - 112492 112453 40 -5.25 6.00 Prom + 114068 114107 40 -3.75 6.01 Init + 114404 114979 576 1 0 92 87 347 0.297 30.18 6.02 Intr + 118919 118949 31 1 1 71 90 -22 0.002 -6.91 6.03 Term + 131950 133070 1121 2 2 104 47 291 0.301 17.79 6.04 PlyA + 133160 133165 6 1.05 7.02 PlyA - 133398 133393 6 1.05 7.01 Sngl - 151397 150384 1014 2 0 18 37 616 0.985 46.26 7.00 Prom - 151756 151717 40 -8.75 8.00 Prom + 152035 152074 40 -11.24 8.01 Init + 152296 152302 7 0 1 89 69 0 0.233 -0.68 8.02 Intr + 156768 157036 269 1 2 -69 88 366 0.736 17.23 8.03 Term + 162159 162212 54 2 0 65 48 119 0.894 2.18 8.04 PlyA + 162237 162242 6 1.05 9.00 Prom + 165791 165830 40 -5.45 9.01 Init + 167774 167832 59 1 2 62 74 55 0.505 2.33 9.02 Term + 170287 170488 202 1 1 110 42 241 0.813 17.58 9.03 PlyA + 171774 171779 6 1.05 10.00 Prom + 175668 175707 40 -5.35 10.01 Sngl + 179518 179775 258 0 0 77 42 183 0.872 7.48 10.02 PlyA + 181205 181210 6 1.05 11.02 PlyA - 182834 182829 6 1.05 11.01 Sngl - 189233 188976 258 2 0 62 47 171 0.768 5.28 11.00 Prom - 196235 196196 40 -4.55 12.03 PlyA - 196519 196514 6 1.05 12.02 Term - 197312 196949 364 1 1 87 48 186 0.684 7.55 12.01 Intr - 197753 197627 127 0 1 61 76 70 0.767 1.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:132438290_132639333|GENSCAN_predicted_peptide_1|334_aa MAADPRGTEWIFIAVEEPETGHCFVFDDPVVHLFRMCRKWYYFSVMKNVTYSLKSRLWKQ QKQQYTNQLAKETDKYIKEFGSLPTTPSEQRQRKIQKDRLVAEFTTSLTNFQKVQRQAAE REKEFVARVRASSRVSAPGGWEEKLPPIEKASWALDSIEANVENAEVHVQQANQQLSRAA DYQGVIDFTYFVRLVEVALHIRKTPFFSTKGITENSFVMASEERWSLLFSWSTLNLEPET EGVCGKCGFWSTCTAEDILEREWILTKVIQLKQPGEPMLLSPYNNHYNVRKVTLDKLGKE DFIQDNCKRDQDYHNGGEIELNHTETDRRILSIE >gi568815592f:132438290_132639333|GENSCAN_predicted_CDS_1|1002_bp atggcagcagacccacgtggaacagaatggatcttcattgctgtagaagagccagaaacg ggacactgttttgtttttgatgacccagtagtgcatctttttcggatgtgtaggaaatgg tattatttttctgttatgaaaaatgtaacatactctttgaagtcacgtctgtggaagcaa cagaagcagcagtatactaaccagcttgccaaagaaacagataagtacattaaagagttt ggatctctgcccaccacccccagtgaacagcgtcaaaggaaaatacagaaggatcgctta gtggcagagttcacaacatcactgacaaacttccagaaggtccagaggcaggctgctgag cgagagaaagagtttgttgctcgagtaagagccagttccagagtgtctgcccctggaggg tgggaagaaaaactgccaccaattgagaaggcttcatgggccctcgatagcatagaagcc aatgtggaaaatgcagaggtgcacgttcagcaagcaaatcagcagctgtcaagggcagca gattatcagggtgtcatcgattttacctactttgtcagactggtagaagttgctttgcat atcagaaaaactccatttttttccacaaaagggattacagaaaactcttttgtgatggcc tctgaggaaagatggtctctccttttctcatggtcaactctcaacttagaaccagagaca gaaggagtttgtgggaagtgtggtttctggtctacctgcactgcagaggacattctagaa agagaatggatattgactaaagttatacagctgaaacaaccaggagaacccatgcttctt tctccgtataataaccattataatgtaagaaaagttacactggacaagttaggcaaggaa gactttattcaagacaattgcaaaagagatcaagactatcacaatggaggagaaattgaa ctcaatcacactgaaacagacaggagaattttaagcattgag >gi568815592f:132438290_132639333|GENSCAN_predicted_peptide_2|187_aa MTKFSKTCTYGQKQQKIAGPEQEAVWVGRRITGVAELHTVTSKSSEELIPPRVRMGRSRA EKHRGLCWRRSDARGAKQSQRRWPVTNCSSSQCTIPTVPAVDNVVPGTVHFLVLPEIEEG ERMTLPFQQPASQNRRDTEILPQEAGGGEIQSQGSDAKAIHACYHIELMPQKISGREEAT VKYVSLL >gi568815592f:132438290_132639333|GENSCAN_predicted_CDS_2|564_bp atgaccaaattctcaaaaacgtgtacatacggccaaaagcaacagaaaatagcaggtcct gagcaggaagccgtatgggtggggcggagaataaccggggttgcagagctccacactgtc accagcaaatcatcagaagagctgatcccacccagagtgagaatgggcagaagcagggct gagaagcaccggggactctgctggagaagatctgatgccagaggagcaaagcagagccaa cgccggtggcctgtgacaaactgcagcagttctcaatgtaccattcctacagtccctgct gtggacaacgtggtgcctggcactgtgcacttccttgttcttccagaaatagaggaagga gaacgaatgactctcccctttcagcagccagcatctcagaatagaagagacactgagata ctgccacaggaggcaggtggaggtgagattcagtcccagggttctgatgccaaggctatc catgcttgctaccacatagaactaatgccacagaagatttctggtagagaagaagccaca gtgaagtatgtgtctttgctctaa >gi568815592f:132438290_132639333|GENSCAN_predicted_peptide_3|103_aa MGDRIRQGQDHGIWELFPPQRGNLRAHGTGRKDPFVTDEWLPELLIQRRCNGWGNCLLFH HFQRPHVETPGQKIIPPDSEWIKDDRQGPTGGKFEPCQCDIQC >gi568815592f:132438290_132639333|GENSCAN_predicted_CDS_3|312_bp atgggtgaccggattaggcaaggacaggaccatgggatatgggaacttttccctccccag aggggaaacttgagagctcatgggactggaagaaaagatccctttgtgactgatgagtgg ctgcctgaacttttgattcagcgacgctgcaatgggtggggcaactgtctgctctttcat catttccagagaccacatgttgaaactccaggtcagaaaatcattccacctgactctgag tggatcaaagatgacagacagggcccaacagggggcaagtttgaaccttgccagtgtgat attcagtgctag >gi568815592f:132438290_132639333|GENSCAN_predicted_peptide_4|348_aa MVNNFSQAEAVELCYKNVNESCIKTPYSPGPRSILYAVLGFGAVLAAFGNLLVMIAILHF KQLHTPTNFLIASLACADFLVGVTVMPFSTVRSVESCWYFGDSYCKFHTCFDTSFCFASL FHLCCISVDRYIAVTDPLTYPTKFTVSVSGICIVLSWFFSVTYSFSIFYTGANEEGIEEL VVALTCVGGCQAPLNQNWVLLCFLLFFIPNVAMVFIYSKIFLVAKHQARKIESTASQAQS SSESYKERVAKRERKAAKTLGIAMAAFLVSWLPYLVDAVIDAYMNFITPPYVYEILVWCV YYNSAMNPLIYAFFYQWFGKAIKLIVSGKVLRTDSSTTNLFSEEVETD >gi568815592f:132438290_132639333|GENSCAN_predicted_CDS_4|1047_bp atggtgaacaatttctcccaagctgaggctgtggagctgtgttacaagaacgtgaacgaa tcctgcattaaaactccttactcgccaggtcctcgatctatcctctacgccgtccttggt tttggggctgtgctggcagcgtttggaaacttactggtcatgattgctatccttcacttc aaacaactgcacacacctacaaactttctgattgcgtcgctggcctgtgctgacttcttg gtgggagtcactgtgatgcccttcagcacagtgaggtctgtggagagctgttggtacttt ggggacagttactgtaaattccatacatgttttgacacatccttctgttttgcttcttta tttcatttatgctgtatctctgttgatagatacattgctgttactgatcctctgacctat ccaaccaagtttactgtgtcagtttcagggatatgcattgttctttcctggttcttttct gtcacatacagcttttcgatcttttacacgggagccaacgaagaaggaattgaggaatta gtagttgctctaacctgtgtaggaggctgccaggctccactgaatcaaaactgggtccta ctttgttttcttctattctttatacccaatgtcgccatggtgtttatatacagtaagata tttttggtggccaagcatcaggctaggaagatagaaagtacagccagccaagctcagtcc tcctcagagagttacaaggaaagagtagcaaaaagagagagaaaggctgccaaaaccttg ggaattgctatggcagcatttcttgtctcttggctaccatacctcgttgatgcagtgatt gatgcttatatgaattttataactcctccttatgtttatgagattttagtttggtgtgtt tattataattcagctatgaaccccttgatttatgctttcttttaccaatggtttgggaag gcaataaaacttattgtaagcggcaaggtcttaaggactgattcgtcaacaactaattta ttttctgaagaagtagagacagattaa >gi568815592f:132438290_132639333|GENSCAN_predicted_peptide_5|80_aa MQASLASAAGKKCGPWMTVSRCSCHLCEMPVSRFFDPLRHRSQKSEKQDEDFFRTFSQDG SVSSFIYPCAPDENMERAGI >gi568815592f:132438290_132639333|GENSCAN_predicted_CDS_5|243_bp atgcaggcatcacttgcttctgctgctggcaagaaatgtggtccatggatgacggtgtcc cgctgctcctgtcacctctgtgagatgcctgtctccaggttttttgatcccctcaggcat agaagtcagaagtcagaaaaacaagatgaagatttctttaggactttcagtcaagatggt tcagttagttcattcatctacccctgtgctccagatgaaaatatggaaagagcaggcata taa >gi568815592f:132438290_132639333|GENSCAN_predicted_peptide_6|575_aa MTSNFSQPVVQLCYEDVNGSCIETPYSPGSRVILYTAFSFGSLLAVFGNLLVMTSVLHFK QLHSPTNFLIASLACADFLVGVTVMLFSMVRTVESCWYFGAKFCTLHSCCDVAFCYSSVL HLCFICIDRYIVVTDPLVYATKFTVSVSGICISVSWILPLTYSGAVFYTGVNDDGLEELV SALNCVGGCQIISGRLLFSKHQGALKIRNNPVFFVIINKDKTSPYVNNSVMSSNSSLLVA VQLCYANVNGSCVKIPFSPGSRVILYIVFGFGAVLAVFGNLLVMISILHFKQLHSPTNFL VASLACADFLVGVTVMPFSMVRTVESCWYFGRSFCTFHTCCDVAFCYSSLFHLCFISIDR YIAVTDPLVYPTKFTVSVSGICISVSWILPLMYSGAVFYTGVYDDGLEELSDALNCIGGC QTVVNQNWVLTDFLSFFIPTFIMIILYGNIFLVARRQAKKIENTGSKTESSSESYKARVA RRERKAAKTLGVTVVAFMISWLPYSIDSLIDAFMGFITPACIYEICCWCAYYNSAMNPLI YALFYPWFRKAIKVIVTGQVLKNSSATMNLFSEHI >gi568815592f:132438290_132639333|GENSCAN_predicted_CDS_6|1728_bp atgaccagcaatttttcccaacctgttgtgcagctttgctatgaggatgtgaatggatct tgtattgaaactccctattctcctgggtcccgggtaattctgtacacggcgtttagcttt gggtctttgctggctgtatttggaaatctcttagtaatgacttctgttcttcattttaag cagctgcactctccaaccaattttctcattgcctctctggcctgtgctgacttcttggta ggtgtgactgtgatgcttttcagcatggtcaggacggtggagagctgctggtattttgga gccaaattttgtactcttcacagttgctgtgatgtggcattttgttactcttctgtcctc cacttgtgcttcatctgcatcgacaggtacattgtggttactgatcccctggtctatgct accaagttcaccgtgtctgtgtcgggaatttgcatcagcgtgtcctggattctgcctctc acgtacagcggtgctgtgttctacacaggtgtcaatgatgatgggctggaggaattagta agtgctctcaactgcgtaggtggctgtcaaattatttctggaagactactattctctaaa catcagggagcactgaaaatcaggaacaatcctgtattttttgtgataatcaacaaggac aaaacttctccatatgtaaataacagcgttatgagcagcaattcatccctgctggtggct gtgcagctgtgctacgcgaacgtgaatgggtcctgtgtgaaaatccccttctcgccggga tcccgggtgattctgtacatagtgtttggctttggggctgtgctggctgtgtttggaaac ctcctggtgatgatttcaatcctccatttcaagcagctgcactctccgaccaattttctc gttgcctctctggcctgcgctgatttcttggtgggtgtgactgtgatgcccttcagcatg gtcaggacggtggagagctgctggtattttgggaggagtttttgtactttccacacctgc tgtgatgtggcattttgttactcttctctctttcacttgtgcttcatctccatcgacagg tacattgcggttactgaccccctggtctatcctaccaagttcaccgtatctgtgtcagga atttgcatcagcgtgtcctggatcctgcccctcatgtacagcggtgctgtgttctacaca ggtgtctatgacgatgggctggaggaattatctgatgccctaaactgtataggaggttgt cagaccgttgtaaatcaaaactgggtgttgacagattttctatccttctttatacctacc tttattatgataattctgtatggtaacatatttcttgtggctagacgacaggcgaaaaag atagaaaatactggtagcaagacagaatcatcctcagagagttacaaagccagagtggcc aggagagagagaaaagcagctaaaaccctgggggtcacagtggtagcatttatgatttca tggttaccatatagcattgattcattaattgatgcctttatgggctttataacccctgcc tgtatttatgagatttgctgttggtgtgcttattataactcagccatgaatcctttgatt tatgctttattttacccatggtttaggaaagcaataaaagttattgtaactggtcaggtt ttaaagaacagttcagcaaccatgaatttgttttctgaacatatataa >gi568815592f:132438290_132639333|GENSCAN_predicted_peptide_7|337_aa MRAVFIQGAEEHPAAFCYQVNGSCPRTVHTLGIQLVIYLACAAGMLIIVLGNVFVAFAVS YFKALHTPTNFLLLSLALADMFLGLLVLPLSTIRSVESCWFFGDFLCRLHTYLDTLFCLT SIFHLCFISIDRHCAICDPLLYPSKFTVRVALRYILAGWGVPAAYTSLFLYTDVVETRLS QWLEEMPCVGSCQLLLNKFWGWLNFPLFFVPCLIMISLYVKIFVVATRQAQQITTLSKSL AGAAKHERKAAKTLGIAVGIYLLCWLPFTIDTMVDSLLHFITPPLVFDIFIWFAYFNSAC NPIIYVFSYQWFRKALKLTLSQKVFSPQTRTVDLYQE >gi568815592f:132438290_132639333|GENSCAN_predicted_CDS_7|1014_bp atgagagctgtcttcatccaaggtgctgaagagcaccctgcggcattctgctaccaggtg aatgggtcttgccccaggacagtacatactctgggcatccagttggtcatctacctggcc tgtgcagcaggcatgctgattatcgtgctagggaatgtatttgtggcatttgctgtgtcc tacttcaaagcgcttcacacgcccaccaacttcctgctgctctccctggccctggctgac atgtttctgggtctgctggtgctgcccctcagcaccattcgctcagtggagagctgctgg ttcttcggggacttcctctgccgcctgcacacctacctggacaccctcttctgcctcacc tccatcttccatctctgtttcatttccattgaccgccactgtgccatctgtgaccccctg ctctatccctccaagttcacagtgagggtggctctcaggtacatcctggcaggatggggg gtgcccgcagcatacacttcgttattcctctacacagatgtggtagagacaaggctcagc cagtggctggaagagatgccttgtgtgggcagttgccagctgctgctcaataaattttgg ggctggttaaacttccctttgttctttgtcccctgcctcattatgatcagcttgtatgtg aagatctttgtggttgctaccagacaggctcagcagattaccacattgagcaaaagcctg gctggggctgccaagcatgagagaaaagctgccaagaccctgggcattgctgtgggcata tacctcttgtgctggctgcccttcaccatagacacgatggtcgacagcctccttcacttt atcacacccccactggtctttgacatctttatctggtttgcttacttcaactcagcctgc aaccccatcatctatgtcttttcctaccagtggtttcggaaggcactgaaactcacactg agccagaaggtcttctcaccgcagacacgcactgttgatttgtaccaagaatga >gi568815592f:132438290_132639333|GENSCAN_predicted_peptide_8|109_aa MTDEAEVKNGGGAEYDVTAAVDFAKEVSKIPAGLDGCDHTEGHDHTAQQKVSDGHGEDQE VGRGVELLEVSDGNHYDHVAQHCHHYRPDHDQLNAQALNSPIGHEEELL >gi568815592f:132438290_132639333|GENSCAN_predicted_CDS_8|330_bp atgacagatgaagcagaggtgaaaaatggaggtggtgcagagtatgatgtcacagcagct gtggactttgcaaaagaggtctccaaaataccagcaggactcgatggatgtgatcacact gaagggcatgaccacacagctcagcaaaaagtcagtgatggccatggagaggatcaagaa gttggtcggggagtggagctgcttgaagtgagcgatggaaatcattacgatcatgttgcc cagcattgtcatcactatagacccgatcatgaccagcttaatgcacaggctctgaactcc ccaatcggccatgaggaggaactgctctga >gi568815592f:132438290_132639333|GENSCAN_predicted_peptide_9|86_aa MVIKMVKVDTGEYKRGQGGKHMADDSSMLFGNDKDFAINTNHDGARGKETCSEQYCPPEF VESKGTEIEASNKYLIALHTGNIGLR >gi568815592f:132438290_132639333|GENSCAN_predicted_CDS_9|261_bp atggtcataaaaatggtaaaagtagacactggggaatacaagagaggacaaggagggaag cacatggctgatgactcgagcatgctgtttggaaacgataaagattttgccataaatacc aaccatgatggagccaggggtaaagaaacatgtagtgaacaatattgtcccccagaattt gttgaaagtaagggcacagaaattgaagcaagcaacaagtatcttatagctctgcatacc ggaaacatcggcctcagatag >gi568815592f:132438290_132639333|GENSCAN_predicted_peptide_10|85_aa MKNGCYAKHQVKTIINLAKCKPKIPAVLHRSDHTIWHDGESQEEISDGHGEDEEVGWCVK LLEVGNGNYHGKIAKYCDEYGSCHK >gi568815592f:132438290_132639333|GENSCAN_predicted_CDS_10|258_bp atgaaaaatggatgttatgctaagcatcaggtcaaaactataataaatcttgcaaaatgt aagcccaaaataccagcagttctccaccgatctgatcatactatatggcatgatggtgaa tcccaggaggaaatcagtgatggccatggagaggatgaggaagttggttggtgtgtgaag ctgcttgaagtaggaaatggaaattatcatggcaagattgccaaatattgtgatgaatat ggatcctgccataaatga >gi568815592f:132438290_132639333|GENSCAN_predicted_peptide_11|85_aa MSKDFMTKTPKAMATKANIDKWDLIKLRSFCTAKETIIRVNRKPTEWEKNFAIYPSDKGL ISRVYEELKQIYKKKTTPSKSVQGL >gi568815592f:132438290_132639333|GENSCAN_predicted_CDS_11|258_bp atgagcaaggacttcatgactaaaacaccaaaagcaatggcaacaaaagccaatattgac aaatgggatctaattaaactaaggagcttctgcacagcaaaagaaactatcatcagagtg aacaggaaacctacagaatgggagaaaaattttgcaatctatccatctgacaaagggcta atatccagagtctacgaggaacttaaacaaatttacaagaaaaaaacaaccccatcaaaa agtgtgcaaggactatga >gi568815592f:132438290_132639333|GENSCAN_predicted_peptide_12|163_aa XSGTSTADLLEFAGGPLQTLFAWVSPAKAAEQQKLLPAPSSGRDVLPIEEESREAVWLQL LFCAAVGCAQSKLPGGIVYTVRGKQPTQASVTVDAPPNTKLECPRLTADCCAASENFKPV DLSLLGSMGEGPSEQDHLTPWLYPLFHGSEWFCLAGIPGATGI >gi568815592f:132438290_132639333|GENSCAN_predicted_CDS_12|492_bp nagtcaggcacctctactgctgatctgctggagtttgctgggggtccactccagaccctg tttgcctgggtatcaccagccaaggctgcagaacagcaaaaattgctgcctgctccttcc tctggaagagatgtcctgcccatagaggaggaatctagagaggcagtctggctacagcta cttttctgtgctgcagtgggctgtgctcagtccaaacttccaggaggcattgtttacact gtgaggggaaaacagcctactcaagcctcagtaacggtggacgcccctcccaacaccaag ctcgagtgtcccaggttgactgcagactgctgtgctgccagtgagaatttcaagccagtg gatcttagcttgctgggctccatgggagagggaccctctgagcaagaccacttgactccc tggctttatccccttttccatgggagtgaatggttctgtcttgctggcattccaggtgcc actgggatatga