GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:21:55 Sequence gi568815597r:57956136_58156480 : 200345 bp : 41.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1002 1048 47 2 2 45 110 70 0.829 5.11 1.02 Term + 3818 4040 223 2 1 126 49 88 0.775 4.11 1.03 PlyA + 4359 4364 6 -0.45 2.00 Prom + 4415 4454 40 -1.85 2.01 Init + 10999 11055 57 0 0 87 48 44 0.204 1.66 2.02 Term + 21914 22729 816 1 0 86 50 269 0.597 15.05 2.03 PlyA + 23451 23456 6 1.05 3.00 Prom + 27818 27857 40 -4.15 3.01 Init + 28226 28228 3 1 0 48 115 0 0.280 -1.25 3.02 Intr + 31726 31816 91 0 1 107 56 69 0.072 4.25 3.03 Intr + 62633 62691 59 0 2 115 86 36 0.027 3.78 3.04 Term + 82338 82481 144 2 0 83 43 117 0.261 3.63 3.05 PlyA + 83237 83242 6 1.05 4.08 PlyA - 83694 83689 6 1.05 4.07 Term - 92674 92174 501 1 0 7 49 529 0.329 34.39 4.06 Intr - 93045 92923 123 0 0 -28 98 157 0.563 4.96 4.05 Intr - 96645 96495 151 2 1 -49 86 95 0.082 -5.16 4.04 Intr - 100345 100002 344 1 2 50 75 382 0.011 26.40 4.03 Intr - 104167 104090 78 1 0 78 72 49 0.007 1.13 4.02 Intr - 104633 104452 182 0 2 89 98 81 0.028 7.87 4.01 Init - 106460 106337 124 2 1 61 37 126 0.025 5.18 4.00 Prom - 110613 110574 40 -5.95 5.00 Prom + 112206 112245 40 -0.25 5.01 Init + 115087 115228 142 0 1 69 66 79 0.731 4.15 5.02 Intr + 121854 121928 75 1 0 120 121 15 0.923 6.57 5.03 Intr + 123876 123971 96 1 0 118 68 59 0.478 5.96 5.04 Intr + 137024 137162 139 0 1 19 86 146 0.196 6.10 5.05 Term + 138304 138421 118 1 1 104 38 60 0.332 -0.37 5.06 PlyA + 139064 139069 6 1.05 6.00 Prom + 140224 140263 40 -9.75 6.01 Init + 141042 141256 215 2 2 93 121 103 0.958 12.36 6.02 Intr + 142055 142137 83 2 2 94 44 26 0.938 -2.84 6.03 Term + 142781 142944 164 0 2 43 47 196 0.900 8.12 6.04 PlyA + 144680 144685 6 1.05 7.00 Prom + 147109 147148 40 -2.55 7.01 Init + 151813 152019 207 0 0 64 93 138 0.878 10.77 7.02 Term + 153205 153369 165 0 0 20 39 142 0.419 -0.57 7.03 PlyA + 155372 155377 6 1.05 8.03 PlyA - 156126 156121 6 1.05 8.02 Term - 157637 157029 609 2 0 95 38 195 0.251 8.71 8.01 Init - 173569 171908 1662 1 0 66 86 555 0.195 44.98 8.00 Prom - 174030 173991 40 -6.15 9.02 PlyA - 174199 174194 6 1.05 9.01 Sngl - 175443 174667 777 0 0 88 46 631 0.979 54.66 9.00 Prom - 176792 176753 40 -5.45 10.05 PlyA - 177465 177460 6 1.05 10.04 Term - 177708 177598 111 0 0 -1 43 91 0.135 -6.72 10.03 Intr - 178131 178032 100 2 1 51 37 157 0.030 6.09 10.02 Intr - 186900 186767 134 0 2 26 81 102 0.021 1.82 10.01 Intr - 197191 197139 53 2 2 118 89 51 0.844 5.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:57956136_58156480|GENSCAN_predicted_peptide_1|89_aa MCEEIVQGHDWMSRTRQPLHKNIHSPTVHQLFFLLLAYLMQHPQMPERVSAAASTMPGLI AIRSCYGEECGRGGRDGVVWHGKPRPCGN >gi568815597r:57956136_58156480|GENSCAN_predicted_CDS_1|270_bp atgtgtgaagagattgtacagggacatgactggatgtcgaggaccaggcaacccttgcat aaaaatatccactcaccaactgttcaccaactgttcttcctcttgctggcataccttatg cagcatcctcagatgccagagcgcgtttctgctgcagcctccacgatgcctgggctgata gcaatcaggtcctgctatggtgaggagtgtgggagaggagggagagatggtgttgtctgg cacgggaagcccaggccgtgtggcaattag >gi568815597r:57956136_58156480|GENSCAN_predicted_peptide_2|290_aa MQIVGHLSMGATATVGLSKNCKKITLKFIWNRKRACIAKKILGRKNKAGGIMLPDFKLHY KATVTKTAWYWYQNRHIDQWNRTEASEITPHIYNNLIFDKPETNKQWGKDALFNKWCWEN WLVMCRKLKQDPFLTPYIKINSRWIKDSNVRPRTIKILEENLGNTIEDTGTGKDFMSRTP KAMATKAKIDKWDLIKLNGFCKAKETIIRVNRQPTKWEKNFAIYPSDKGLISRIHKELKQ IYKKKTNNPIKKWAKDMNRHFSKEDIYAASRHMKNAHHHWSLWKCKSKPQ >gi568815597r:57956136_58156480|GENSCAN_predicted_CDS_2|873_bp atgcagattgtaggtcacctttccatgggagccacagctacagtggggctgtccaagaat tgcaaaaaaattactttaaagttcatatggaaccgaaaaagagcctgcatagccaagaaa atcctgggcaggaagaacaaagctggaggcattatgctacctgacttcaaactacactac aaggctacagtaaccaaaacagcatggtactggtaccaaaacagacatatagaccagtgg aacagaacagaggcctcagaaataacaccacacatctacaacaatctgatctttgacaaa cctgaaacaaataagcaatggggaaaagatgccctatttaataaatggtgctgggaaaac tggctagtcatgtgcagaaaactgaaacaggaccccttccttacaccttatataaaaatc aactcaagatggatcaaagactcaaacgtaagacccagaaccataaaaatcctagaagaa aacctgggcaataccattgaggacacaggcaccggaaaagacttcatgtctagaacacca aaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaacggcttc tgcaaagcaaaagaaactatcatcagagtcaacaggcaacctacaaaatgggagaaaaat tttgcaatctatccatctgacaaagggctaatatccagaatccacaaagaacttaaacaa atttacaagaaaaaaacaaacaaccccatcaaaaaatgggcaaaggatatgaacagacac ttctcaaaagaagacatttatgcagccagcagacatatgaaaaatgctcatcatcactgg tcattatggaaatgcaaatcaaaaccacaatga >gi568815597r:57956136_58156480|GENSCAN_predicted_peptide_3|98_aa MENTGRLLMEKKKKQQRQRVYLGCDRMPGASGPSKSSLKNSKKYIVIKLLWEREKETFVN QRLPTEHSAMTVVLHQGFLSPGFNKLLFLPNPKHATGT >gi568815597r:57956136_58156480|GENSCAN_predicted_CDS_3|297_bp atggaaaacactggtcggttactaatggagaagaagaaaaagcagcaaagacaacgagta tacttgggatgcgatcgcatgcctggagcatctggcccttccaagtcttccctgaaaaac tcaaagaagtacattgttataaaactcctatgggaaagagaaaaggaaacatttgtgaat cagcggttaccaactgagcactcagccatgaccgtggttcttcaccagggcttcctctca cctgggtttaataagctcctttttttaccaaatccaaagcatgcaactggcacttaa >gi568815597r:57956136_58156480|GENSCAN_predicted_peptide_4|500_aa MPCSVTVDSVEQQSSKMKNGCFKEEARIVVWKVLEAGIAEAAHGCWPRKPRIGSLPWDLC TLTYNTRGMLLCSQARTLWALGQETPKTPQRVQKGQQQRNCQRASREACWKELAFGFLLK LQIQPLHSMTKKRRNNGRAKKGRGHVQPIRCTNCARCVPKDKAIKTFVIRNIVKAAAVRD ISEASVFDAYVLPKLYVKLHYCVSCAIHSKVVRNRSREARKDRTPPSRFRPAGAAPRPPP KPMNPTYKGCEDLFKENYKPLLNEIKEDRNKWKNIPCLWIGRINIMKMAILPKGTLTDCV VLRDPNTKCSGGFGFVTYATVEEVDAATNARPHKTVIQKYYTVNGHNCEVRKALSKQEMA SVSSSQRGEVVLETLVVVVEVVSVGMTTLVVEETLVVMVALVAAVMSVDMVAVGMVIMGL AIMVVMEEVALVTLEEAEAMEVVDRVMETRAVAMAGVAAMTAITIEVEVALAVAVEAVLE VVEATVIWAIKTISFQILDP >gi568815597r:57956136_58156480|GENSCAN_predicted_CDS_4|1503_bp atgccttgctctgtgactgtggactctgtggagcagcagagcagcaagatgaaaaatgga tgtttcaaagaggaagcaaggatagttgtatggaaggtcttggaagctggaatagcagag gcagcccatggctgctggcccagaaagccgagaattggcagtttaccctgggacttgtgc acactcacttacaacactcgaggcatgcttctctgttctcaggccaggactttgtgggcc cttgggcaggagaccccgaaaaccccacagagggtgcagaaagggcagcaacagaggaac tgtcagagagccagcagagaagcctgttggaaagagttggcttttggttttcttcttaaa ctgcagatccaaccacttcattcgatgacaaagaaaagaaggaacaatggtcgtgccaaa aagggccgcggccacgtgcagcctattcgctgcactaactgtgcccgatgcgtgcccaag gacaaggccattaagacattcgtcattcgaaacatagtgaaggccgcagcagtcagggac atttctgaagcgagcgtcttcgatgcctatgtgcttcccaagctgtatgtgaagctacat tactgtgtgagttgtgcaattcacagcaaagtagtcaggaatcgatctcgtgaagcccgc aaggaccgaacacccccatcccgatttagacctgcgggtgctgccccacgtcccccacca aagcccatgaatccaacttacaagggatgtgaggacctcttcaaggagaactacaaacca ctgctcaatgaaataaaagaggacaggaacaaatggaagaacattccatgcttatggata ggaagaatcaatatcatgaaaatggccatactgcccaagggaacgctcacggactgtgtg gtcctgagagatccaaacaccaagtgctccgggggctttgggtttgtcacatatgccact gtggaggaggtggatgcagccacaaatgcaaggccacacaagactgtcattcagaaatac tatactgtgaatggccacaactgtgaagttaggaaagccctgtcaaagcaagagatggct agtgtttcatccagccaaagaggtgaagtggttctggaaactttggtggtggtcgtggag gtggtttcagtgggaatgacaactttggttgtggaggaaactttagtggtcatggtggct ttggtggcagctgtgatgtcagtggatatggtggcagtggggatggttataatgggtttg gcaattatggtggttatggaggaagtggccctggttactctggaggaagcagaggctatg gaagtggtggacagggttatggaaaccagggcagtggctatggcgggagtggcagctatg acagctataacaatagaggtggaggtagctttggcagtggcagtggaagcagttttggag gtggtggaagctacagtgatttgggcaattaaaacaatcagttttcaaattttggaccca tga >gi568815597r:57956136_58156480|GENSCAN_predicted_peptide_5|189_aa MKGVTNPAEEGFSDVICTILGEHTVWMGKIQNPCEEEIGTLPPSKENAEISNLQLPLGMP AGPREGERRIQEGPGKLPERMSFEAVACKCGLANGKGRDMCPGTAISVQSPVQVEAQSTE SDSWPWGRSEFDQEAAEYAIGKADRQCRGRRKQNNMMMKRKWLAVANQDSNSSSAIHLLT ALVKITQPL >gi568815597r:57956136_58156480|GENSCAN_predicted_CDS_5|570_bp atgaagggtgttactaatccagctgaggaaggattctctgatgtgatctgtactatcctg ggagaacacacagtatggatggggaagatccagaatccatgtgaagaagaaataggaact ctacctccatcaaaggagaatgctgagattagcaatttacaactacctttgggcatgcct gctggccccagggaaggagagaggagaatccaagaaggtcctggaaagttaccagaaagg atgagtttcgaggcagttgcatgtaaatgtggcttagccaatgggaaaggaagggatatg tgtcctggtacagcaatatctgttcaatctcctgtacaagtggaggcccagagcactgaa tctgacagctggccatgggggcgtagtgaatttgaccaagaggcagctgagtatgcaatc gggaaagctgatagacagtgtaggggcagaaggaagcagaacaacatgatgatgaagaga aagtggcttgcagttgctaaccaggattcaaactccagctctgccattcatttactgaca gccttggtcaaaatcacccagcctctttaa >gi568815597r:57956136_58156480|GENSCAN_predicted_peptide_6|153_aa MALGNRSYFYPHFTNVETEAQRLSHMPKVTQLASDGAEFTPRPSALTHCTDCYKKDTTSA LMELPSNGKYKSIQDLSRKRMAHSNWVTRREFNRRPIYKAKQNNAEVQEASSGTEPGGEE ETEAMEGQMEEIQNNSTQAPQKWVFQDVLEQYQ >gi568815597r:57956136_58156480|GENSCAN_predicted_CDS_6|462_bp atggccctaggaaataggtcctatttttacccccacttcacaaatgtggaaactgaggca cagagactgagtcatatgcccaaggtcacacagctggcaagtgatggagcagaattcaca ccccggccatctgctcttacacactgtactgactgttacaagaaagacacaacctctgcc ctcatggagcttccatctaatgggaaatacaaaagcattcaggatctcagcaggaaacgg atggcacactcaaattgggtaacacgaagggagtttaacagaaggcctatttacaaagcc aagcaaaataatgcagaagtccaggaagcctccagtggcacagagccaggtggagaagaa gagacagaggccatggaggggcaaatggaagagatccagaacaactctacccaggcccct cagaaatgggtgtttcaagatgtcctagagcagtatcagtaa >gi568815597r:57956136_58156480|GENSCAN_predicted_peptide_7|123_aa MGKTWEENEWGNLKDPQFVQGDWTMDSKRSREKVAVREELMSLAAHSTGEKLRLHSESKG EPWRDFQQKNLWVQATETSTNRLKQVAGFARSLRVALEWIRELNNRTSGKKEPGGSRDLG EEN >gi568815597r:57956136_58156480|GENSCAN_predicted_CDS_7|372_bp atgggcaaaacctgggaggaaaatgaatggggaaatctgaaggacccacagtttgttcag ggggactggacaatggacagcaagagaagtagagagaaagttgctgtgagagaggaactc atgagtctggctgcacacagcactggtgagaaattgagacttcattctgagagcaagggg gagccctggcgggattttcagcagaagaatctttgggtgcaagcaacagaaaccagcacc aacagacttaaacaagtagcaggatttgctagatcactcagggtagctctagaatggatc agagaattgaataatcggacttcaggcaagaaagagccaggaggctctagggatcttggt gaggagaactaa >gi568815597r:57956136_58156480|GENSCAN_predicted_peptide_8|756_aa MKAEIKMFFETNENKDTTYQNLWDSFKAVCRGKFIALNAHKRKQERSKIDTVTSQLKELE KQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKR EKNQIDTIKNDKGDITTDPTEIQTAIREYYKHLYANKLENLEEMDKFLDTYSLPRLNQEE VESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGIL PNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILAKQIQQHIKKLIHHDQVG FIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGT YLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLDVLARAIRQKEIK GIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNN RQTESQIMSEFPFTIASKRIKYLGIQLTRDMKDLFKENCKPLLQEIKEDTNKWKNIPCSW VGRINIVKMAILPKMLPVLQGPSILSKHCTNFLASNLPLFELPLSFLQQHSLLRECVNYH NRVIEVSSSFTKTLQGFQYTNTCPHSYTHSHYTEVFFAICASAFDNRYFYDGRVEDAAEK LELGGIRSLLMANDTGLCQSVEEFGKDTIHLGITSHQVAQTRILGTIFNMSLSHAPSNQS TARSCLARILPIISSPCKILLILHSVWASCLPGSIS >gi568815597r:57956136_58156480|GENSCAN_predicted_CDS_8|2271_bp atgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacacaacataccag aatctctgggactcattcaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatccaaaattgacaccgtaacatcacaattaaaagaactagaa aagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaaatcagagca gaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatccaggagctgg ttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaagaaaaaaaga gagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccaccgatcccaca gaaatacaaactgccatcagggaatactacaaacacctctacgcaaataaactagaaaat ctagaagaaatggataaattcctggacacatacagtctcccaagactaaaccaggaagaa gttgaatccctgaatagaccaataacaggatctgaaattgtggcaataatcaatagctta ccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccagaggtacaag gaggaactggtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctc cctaactcattttatgaggccagcatcattctgataccaaagccgggcagagacacaacc aaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaa atactagcaaaacaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggc ttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcat ataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaaaaagccttt gacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtattgatgggacg tatttaaaaataataagagctatctatgacaaacccacagccaatatcatactgaatggg caaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctctcacca ctcctattcaacatagtgttggacgttctggccagggcaattaggcagaaggaaataaag ggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgta tatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagc aaagtctcaggatacaaaatcaatgtacaaaaatcacaggcattcttatacaccaacaac agacaaacagagagccaaatcatgagtgaattcccattcacaattgcgtcaaagagaata aaatacctaggaatccaacttacaagggatatgaaggacctcttcaaggagaactgcaaa ccactgctccaggaaataaaagaggatacaaacaaatggaagaacattccatgctcatgg gtaggaagaatcaatatcgtgaaaatggccatactgcccaagatgctccccgtacttcaa ggaccttctatactttccaaacactgcacaaatttcctggcatcaaatctgcccctattt gagctccccttatctttcctgcagcagcacagtctgcttagggagtgtgttaattaccac aatagggtcattgaagttagctcttcgtttacaaaaaccctccagggattccagtacaca aacacatgcccacactcatacacacactctcattacacagaagttttttttgccatttgt gcttcagcttttgacaacagatacttttatgatggcagagtggaagatgcagctgaaaag ctggaacttggaggaatccgaagtttgctaatggcaaatgatacaggcctgtgtcagtca gtggaggaattcgggaaggacaccattcacctgggcatcacgagccaccaggttgcccaa accagaatccttggtaccatctttaacatgtccctctcccacgcaccctccaaccaatca acagctcggtcctgtctggccagaatcctgcccatcatctcatctccttgcaaaatcctc ctcattctccactcagtctgggcatcatgtcttccaggaagcatttcctga >gi568815597r:57956136_58156480|GENSCAN_predicted_peptide_9|258_aa MGKKQNRKTGNSKKQSASPPPKERSSSPAMEQSWMENDFDELREEGFRRSNYSELWEDIQ TKGNEVENFEKNLEECITRITNREKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKREGKSREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLSAETL >gi568815597r:57956136_58156480|GENSCAN_predicted_CDS_9|777_bp atggggaaaaaacagaacagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaatggaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctatgggaggacattcaa accaaaggcaatgaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatagagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaagcgagaagggaagtctagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagtggaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaagggaagcccatcagactaacggcggatctctcggcagaaaccctataa >gi568815597r:57956136_58156480|GENSCAN_predicted_peptide_10|132_aa XLFGKSNKMKDVEVLCKSVWLKSGTPEGTPRISISNKCPGEADATGLGTTLRAIALENLW HKWMQQEGTILEADSKPSPDAKSADALILDFPTFRTLCEEKLHSLMPPSWLSFCNTWLPS VNDQRTLGNFTE >gi568815597r:57956136_58156480|GENSCAN_predicted_CDS_10|399_bp nggctttttggaaaatcaaataagatgaaagatgtggaagtactttgtaaatcagtctgg ttgaagagtggcactccagaaggaacaccgagaatttccatttccaataagtgcccaggt gaagctgatgctacaggtctagggacaactttgagagccattgccttagagaacttatgg cacaaatggatgcagcaagaaggcaccatcttggaagcagacagcaagccttcaccagat gccaaatctgctgatgctttgatcttagacttcccaacgttcagaactctctgtgaagaa aaattgcattcactgatgccaccaagctggctgtccttctgcaatacctggctgccaagt gtaaatgatcaaagaacacttggaaacttcacagagtga