GENSCAN 1.0 Date run: 8-Nov-116 Time: 02:06:41 Sequence gi568815591r:123931864_124132901 : 201038 bp : 37.60% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 21937 22661 725 2 2 76 108 488 0.174 39.82 1.02 Intr + 23134 23223 90 0 0 94 115 43 0.990 6.87 1.03 Term + 27621 28106 486 2 0 84 32 252 0.889 12.91 1.04 PlyA + 28447 28452 6 1.05 2.00 Prom + 34582 34621 40 -4.85 2.01 Init + 40731 40834 104 2 2 50 93 40 0.132 -1.54 2.02 Intr + 41667 41749 83 0 2 102 58 79 0.133 4.66 2.03 Term + 69522 69739 218 1 2 95 46 113 0.205 4.12 2.04 PlyA + 70339 70344 6 -0.45 3.03 PlyA - 70450 70445 6 1.05 3.02 Term - 72885 72400 486 0 0 87 45 244 0.995 13.71 3.01 Init - 82351 82181 171 1 0 98 32 112 0.598 6.09 3.00 Prom - 98052 98013 40 -6.35 4.02 PlyA - 99086 99081 6 1.05 4.01 Sngl - 101140 99998 1143 1 0 77 41 991 0.691 89.45 4.00 Prom - 104237 104198 40 -5.55 5.05 PlyA - 105771 105766 6 1.05 5.04 Term - 107881 107816 66 1 0 79 49 77 0.370 -0.14 5.03 Intr - 116523 116405 119 1 2 66 80 46 0.130 0.86 5.02 Intr - 116991 116624 368 2 2 42 54 206 0.273 6.46 5.01 Init - 118577 118528 50 2 2 103 84 14 0.968 2.97 5.00 Prom - 140088 140049 40 -3.55 6.00 Prom + 140685 140724 40 -4.35 6.01 Init + 148116 148155 40 2 1 50 89 29 0.316 -2.36 6.02 Intr + 151002 151078 77 1 2 105 83 40 0.250 3.52 6.03 Term + 158573 158677 105 1 0 63 54 124 0.612 3.93 6.04 PlyA + 158680 158685 6 1.05 7.02 PlyA - 159987 159982 6 1.05 7.01 Sngl - 165160 164126 1035 1 0 64 46 456 0.994 35.85 7.00 Prom - 165936 165897 40 -5.05 8.02 PlyA - 166105 166100 6 1.05 8.01 Sngl - 167313 166333 981 0 0 88 43 800 0.992 72.05 8.00 Prom - 175295 175256 40 -2.95 9.02 PlyA - 175638 175633 6 1.05 9.01 Term - 176362 176288 75 1 0 79 42 130 0.353 4.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:123931864_124132901|GENSCAN_predicted_peptide_1|433_aa XSPRINATGQGVTIFYVDRLGYYPYIDSITGVTVNGGIPQKISLQDHLDKAKKDITFYMP VDNLGMAVIDWEEWRPTWARNWKPKDVYKNRSIELVQQQNVQLSLTEATEKAKQEFEKAG KDFLVETIKLGKLLRPNHLWGYYLFPDCYNHHYKKPGYNGSCFNVEIKRNDDLSWLWNES TALYPSIYLNTQQSPVAATLYVRNRVREAIRVSKIPDAKSPLPVFAYTRIVFTDQVLKFL SQDELVYTFGETVALGASGIVIWGTLSIMRSMKSCLLLDNYMETILNPYIINVTLAAKMC SQVLCQEQGVCIRKNWNSSDYLHLNPDNFAIQLEKGGKFTVRGKPTLEDLEQFSEKFYCS CYSTLSCKEKADVKDTDAVDVCIADGVCIDAFLKPPMETEEPQIFYNASPSTLSATMFIV SILFLIISSVASL >gi568815591r:123931864_124132901|GENSCAN_predicted_CDS_1|1302_bp ngaagcccccgaataaacgccaccgggcaaggtgttacaatattttatgttgatagactt ggctactatccttacatagattcaatcacaggagtaactgtgaatggaggaatcccccag aagatttccttacaagaccatctggacaaagctaagaaagacattacattttatatgcca gtagacaatttgggaatggctgttattgactgggaagaatggagacccacttgggcaaga aactggaaacctaaagatgtttacaagaataggtctattgaattggttcagcaacaaaat gtacaacttagtctcacagaggccactgagaaagcaaaacaagaatttgaaaaggcaggg aaggatttcctggtagagactataaaattgggaaaattacttcggccaaatcacttgtgg ggttattatctttttccggattgttacaaccatcactataagaaacccggttacaatgga agttgcttcaatgtagaaataaaaagaaatgatgatctcagctggttgtggaatgaaagc actgctctttacccatccatttatttgaacactcagcagtctcctgtagctgctacactc tatgtgcgcaatcgagttcgggaagccatcagagtttccaaaatacctgatgcaaaaagt ccacttccggtttttgcatatacccgcatagtttttactgatcaagttttgaaattcctt tctcaagatgaacttgtgtatacatttggcgaaactgttgctctgggtgcttctggaatt gtaatatggggaaccctcagtataatgcgaagtatgaaatcttgcttgctcctagacaat tacatggagactatactgaatccttacataatcaacgtcacactagcagccaaaatgtgt agccaagtgctttgccaggagcaaggagtgtgtataaggaaaaactggaattcaagtgac tatcttcacctcaacccagataattttgctattcaacttgagaaaggtggaaagttcaca gtacgtggaaaaccgacacttgaagacctggagcaattttctgaaaaattttattgcagc tgttatagcaccttgagttgtaaggagaaagctgatgtaaaagacactgatgctgttgat gtgtgtattgctgatggtgtctgtatagatgcttttctaaaacctcccatggagacagaa gaacctcaaattttctacaatgcttcaccctccacactatctgccacaatgttcattgtt agtattttgtttcttatcatttcttctgtagcgagtttgtaa >gi568815591r:123931864_124132901|GENSCAN_predicted_peptide_2|134_aa MISAHCNLCLLSSKDLPTSAYVEELRPQARATIPRHQAKIPAVKEWTAPSTEPQIQCDDK DPCSWYRLSVYLPFWGLEDCGLLTAPLGSASVGTLCEGSKPTFHLHTPVVEVLHEGSASV AGFYLDIQAFTYIF >gi568815591r:123931864_124132901|GENSCAN_predicted_CDS_2|405_bp atgatctcggctcactgcaacctctgcctcctgagctcaaaggatcttcccacctcagcc tatgtggaggagctgagaccacaagcacgtgccaccatacccagacatcaggctaagatc cctgctgtaaaggagtggacagccccttcaactgaaccccaaattcaatgtgatgacaaa gatccatgttcatggtacaggctgtcagtgtatctaccattctggggtctggaggactgt ggccttctcacagctccactaggcagtgcttcagtggggactctgtgtgagggctccaaa cccacatttcaccttcatactcctgtagtagaggttctccatgagggctctgcctctgta gcaggcttctacctggacatccaggctttcacatacatcttctag >gi568815591r:123931864_124132901|GENSCAN_predicted_peptide_3|218_aa MAEGKGEASTFFTRQQEREREQWKHQILITRPDLVRTHYHENRKGKIRLQDPIISHQTSL VTPLGSGKSNVTRDWSGPQAYRSSRTEKWSAVMWVPVPISPHQAGPLGLGLQPPPTRAIE PAPTQELPGQSIQGQLKASWPLPLQKNCPCHLWTNKGAKALCVLSTPPTICSQPKERRPV HLPWVPHPLPPMAHHQTGKPWLGSIAQTLYPGLTALSD >gi568815591r:123931864_124132901|GENSCAN_predicted_CDS_3|657_bp atggcggaaggcaaaggagaagcaagcacctttttcacaaggcagcaggagagagagagg gagcagtggaaacaccagatacttatcacacgaccagatcttgtgagaactcactatcac gagaacagaaaggggaaaatccgcctccaggatccaatcatctctcaccagacttcactg gtgacacctctgggttctggaaaatccaacgtgaccagagactggagtgggccccaagca taccgcagcagccgtacagaaaagtggtcagctgttatgtgggtacctgttcccatatct cctcaccaggcaggtcctctaggcctgggcctccagccaccacctaccagagctattgag ccagcaccaactcaggaactccctggacagagcatccaagggcaactgaaagcctcttgg cccctgcctctgcagaagaactgcccttgccacctttggactaacaaaggcgcaaaggcc ctttgtgtcttatccacacctccaacaatctgcagtcaacccaaagagagaaggccagtt catctcccatgggtgccacaccccctaccacccatggctcatcaccagacagggaaaccc tggcttgggtccatagcacaaaccctctatcctggactgactgcactgagtgattga >gi568815591r:123931864_124132901|GENSCAN_predicted_peptide_4|380_aa MAGSDVDSEGPARRGGAARRPGAPGGPGSEAAAGCPEPLSTAEAPAESATLPAWMRLYFY GMHGITLDVLVSSARRFARSPDLRMLGFSSPYRCLLHSLTHFALEKVYLQQRRCPNAFVF NFLLYPSAHVGLQTLAGQALLLSLGGGAGVAVAPGALDLALQYVLALYHCQVFLKRFLRL RYGRQRRRQQQQQQQQQQQQRRGALPVPPGARVPTAAGARRRRPRGPRGAGGAPSQGLPD LPRFLFFGMHGFLDEIFFTFFFNVLGQGDGTTSGHTSLWSFFMYGSCSFVVEKLYFHLHY SRGWGTWKRVPIYVIFIYVWELSWGLGLRTCGACSWDYSHYPLNFMGLITLMYLPGWIFL SVYQDLISNVLWRVQYVPAN >gi568815591r:123931864_124132901|GENSCAN_predicted_CDS_4|1143_bp atggcggggagcgacgtggacagcgagggccccgcacggaggggcggcgcggcgcggcgt ccgggggcccctggcgggccaggaagcgaggcggcagccggctgcccggagccgctgtcc actgctgaagcgccggctgagagcgccacgctgcccgcctggatgcgcctctacttctac gggatgcacgggatcaccctggacgtgctggtgtcctcggcccggcgcttcgcccgcagc ccggacctgcggatgctaggcttctcctcgccctaccgctgcctgctgcactcgctcacc catttcgccctggagaaggtgtacctgcagcagcggcgctgtcccaacgccttcgtcttc aatttcctcctctacccctcggcccacgtggggctgcagaccctagcgggccaggcgcta ctactcagcctgggcggcggggcgggggtcgcggtggcgccaggggcgctggacctggcg ctgcagtacgtgctggcgctctaccactgccaagtgttcctgaagcgcttcctgcgcttg cggtacgggcgacagaggcggcggcagcagcaacagcagcagcagcagcagcagcagcag cggaggggcgcgctccccgtccctcccggcgcccgggtccctactgcggccggagcccgg cggcgacgaccccgtggccccaggggcgccgggggagcccccagccaggggctgcccgac ctaccccgctttcttttcttcggaatgcacggctttctggatgagatcttcttcaccttc ttcttcaacgtactggggcagggggacgggacaaccagcggccacacgtcgctctggtcc ttctttatgtacggcagctgcagtttcgtggtggaaaagctctacttccacctccactac agccgcggttggggcacttggaagcgggtgcccatctacgtgatcttcatctacgtgtgg gagctgtcctggggtctgggactccgcacgtgcggggcttgttcctgggactattctcac tacccgctcaattttatgggcctcatcaccctgatgtatttacctggctggatattcctt agtgtgtaccaggacctaatttccaacgtgttgtggagggtgcagtacgtaccagctaac taa >gi568815591r:123931864_124132901|GENSCAN_predicted_peptide_5|200_aa MPLSSDQWAGDESAISRDPSYPELPAANPYGSIATSVLASSEERTRLRGIKQKKRPRQIS EQEWKFVKKSLEQERKENSLGRNPTRCLKFQQRKRAKRGVFSLDPRTFIRSPLSHDSSPR VGFPHAQCFPYPLDLGMPSMHPHKVASPWGLYSIKVLMLTGVDHQEMVSLLPNHHQTFPE PLSMKSQKQPSTEAMFPDEK >gi568815591r:123931864_124132901|GENSCAN_predicted_CDS_5|603_bp atgccactaagttctgaccaatgggctggggatgaaagtgcaatttccagagatccaagt taccctgagttaccagcggcgaatccgtatgggtccatagcaacttcagtccttgcctcc tcagaagaaagaactcgactgaggggcataaagcagaaaaagagaccaaggcaaatttca gagcaggagtggaagtttgttaaaaagtctttagaacaggaaagaaaggaaaattcactt ggaagaaacccaaccagatgcctgaagttccaacagagaaaaagagctaaaagaggggtc tttagccttgatcctaggactttcatacgctctcctctttcccatgattcttcccccagg gtgggcttcccgcatgcgcagtgctttccttaccctttggacctgggcatgccaagcatg catccccacaaagttgcttctccctggggtctgtattcaattaaggtgttgatgttaaca ggtgtggaccatcaggaaatggtctctctgttgccgaaccatcaccagacattcccggag cccctctccatgaaatctcagaaacaaccctcaacggaagcaatgtttcctgatgaaaaa tag >gi568815591r:123931864_124132901|GENSCAN_predicted_peptide_6|73_aa MILAHSNLHLLGSKQGSPAPRPQTSSGQWPVRNQAAQQEKMQQQGIILEAETGPSPDTKP KDPFILDFQPPEL >gi568815591r:123931864_124132901|GENSCAN_predicted_CDS_6|222_bp atgatcttggctcactccaacctccacctcctgggttcaaaacaagggtccccagccccc aggccacaaaccagtagcggtcagtggcctgttaggaaccaggccgcacagcaggagaag atgcagcaacaaggtatcatcttggaagcagagacagggccttcaccagacaccaaaccc aaagatcccttcatcttggactttcagcctccagaactgtga >gi568815591r:123931864_124132901|GENSCAN_predicted_peptide_7|344_aa MNKIDRPLARLIKKKREKNQIDAIKNDKGGITTDPTEIQTTVRKYYKHLYANKLENLEEM DKFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELV PFLLKLFQSIEKEGILPNSFYEASIILIPKQGRDTTKKENFRPISLMNIDAKILNKIRAN RIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRTKVKNHMIISIDAEKAFDKIQ QRFIVKTLNKLGIDGTFLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFN IVLEVLARAIRQEKEIKGVQLGKEEVKLSLFADDMIIYLENPIV >gi568815591r:123931864_124132901|GENSCAN_predicted_CDS_7|1035_bp atgaacaaaattgatagaccgctagcaagactaataaagaaaaaaagagagaaaaatcaa atagatgcaataaaaaatgataaaggaggtatcaccaccgatcccacagaaattcaaact accgtcagaaaatactacaaacacctctatgcaaataaactagaaaatctagaagaaatg gataagttcctcgacacatacactctcccaagactaaaccaggaagaagttgaatctctg aatagaccaataacaggatctgaaattgtggcaataatcaatagcttaccaaccaaaaag agtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactggta ccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattt tatgaggccagcatcatcctgataccaaagcagggcagagacacaaccaaaaaagagaat tttagaccaatatccttgatgaacatagatgcaaaaatcctcaataaaatacgggcaaac cgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctggg atgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcatataaacagaacc aaagtcaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaa caacgcttcattgtaaaaactctcaataaattaggtattgatgggacatttcttaaaata ataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactggaa gcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaac atagtgttggaagttctggccagggcaattaggcaggaaaaggaaataaagggtgttcaa ttaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattatatatctagaa aaccccatcgtctaa >gi568815591r:123931864_124132901|GENSCAN_predicted_peptide_8|326_aa MGKKQSRKTGNSKKQSATPPPKERSSSPAMEQSWMENDFDELREEGFRRSNYSKLQEEIQ TKGKEVENFEKNLGLKELMELKAKARELREECRSLRSQCTQLEERVSVMEEEMNEMKREG KFREKRIKRNQQSLQEIWDYVKRPNLHLIGVPESDGENGTKLENTLQDIIQENFPNLARQ ANVQIQEIERTPQRYSSRRATPRHIIVRFTKVEVKEKMLRAAREKVRVTHKGKPIRVTAD LSAETLQARREWGPIFNILKEKNLQPSISYPAKLSFISEGEIKYFTDKQMLRDFVTTRPA LKELLKEALNMERNNQYQLLQNHAKM >gi568815591r:123931864_124132901|GENSCAN_predicted_CDS_8|981_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagtgccactcctcct ccaaaggaacgcagttcctcaccagcaatggaacaaagctggatggagaatgactttgac gagttgagagaagaaggcttcagacgatcaaactactccaagctacaggaggaaattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttaggcttaaaggagctgatggag ctgaaagccaaggctcgagaactacgtgaagaatgcagaagcctcaggagccaatgcact caactggaagaaagggtatcagtgatggaagaggaaatgaatgaaatgaagcgagaaggg aagtttagagaaaaaagaataaaaagaaaccaacaaagcctccaagaaatatgggactat gtgaaaagaccaaatctgcatctgattggtgtacctgaaagtgacggggagaatggaacc aagttggaaaacactctgcaggatattatccaggagaacttccccaatctagcaaggcag gccaatgttcagattcaggaaatagagagaacgccacaaagatactcctcgagaagagca actccaagacacataattgtcagattcaccaaagttgaagtgaaggaaaaaatgttaaga gcagccagagagaaagttcgggttacccacaaagggaagcccatccgagtaacagcggat ctctcagcagaaactctacaagccagaagagagtgggggccaatattcaacattcttaaa gaaaagaatttgcaacccagcatttcatatccagccaaactaagcttcataagtgaagga gaaataaaatactttacagacaagcaaatgctgagagattttgtcaccaccaggcctgcc ctaaaagagctcctaaaggaagcactaaacatggaaaggaacaaccagtaccagttgctg caaaatcatgccaaaatgtaa >gi568815591r:123931864_124132901|GENSCAN_predicted_peptide_9|24_aa VLEPGEPAVSSPRAQRPENLKSDI >gi568815591r:123931864_124132901|GENSCAN_predicted_CDS_9|75_bp gttttagaacctggagaacctgctgtttcaagtcctcgagcccaaaggccagagaacctg aagtctgatatctaa