GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:28:21 Sequence gi568815577f:18757779_18958273 : 200495 bp : 35.86% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 349 376 28 0 1 89 84 40 0.568 3.55 1.02 Term + 2248 2474 227 2 2 -37 40 307 0.719 9.36 1.03 PlyA + 3076 3081 6 1.05 2.00 Prom + 6675 6714 40 -8.15 2.01 Sngl + 7023 7292 270 2 0 101 54 288 0.989 21.73 2.02 PlyA + 9317 9322 6 1.05 3.00 Prom + 38141 38180 40 -2.85 3.01 Init + 42013 42088 76 0 1 49 83 102 0.160 7.10 3.02 Intr + 52013 52243 231 0 0 103 45 109 0.035 4.92 3.03 Intr + 56826 56883 58 1 1 109 65 22 0.006 -0.98 3.04 Intr + 68931 69124 194 0 2 27 68 129 0.102 3.01 3.05 Term + 72507 72658 152 1 2 45 48 92 0.089 -2.01 3.06 PlyA + 73331 73336 6 1.05 4.00 Prom + 81098 81137 40 -4.05 4.01 Init + 92480 92558 79 1 1 104 67 50 0.308 5.77 4.02 Term + 99903 100498 596 1 2 28 38 695 0.888 52.20 4.03 PlyA + 100686 100691 6 1.05 5.05 PlyA - 101610 101605 6 1.05 5.04 Term - 107816 107618 199 1 1 0 43 150 0.162 -2.41 5.03 Intr - 114331 112957 1375 2 1 85 53 360 0.628 19.15 5.02 Intr - 116214 114894 1321 0 1 22 72 560 0.077 34.16 5.01 Init - 117420 117070 351 0 0 88 55 335 0.490 27.41 5.00 Prom - 118022 117983 40 -9.25 6.04 PlyA - 118449 118444 6 1.05 6.03 Term - 119079 118793 287 1 2 110 43 109 0.895 3.18 6.02 Intr - 120388 120231 158 0 2 81 50 103 0.893 4.53 6.01 Init - 124800 124649 152 0 2 88 22 134 0.618 6.36 6.00 Prom - 131660 131621 40 -5.35 7.00 Prom + 137688 137727 40 -4.95 7.01 Init + 142189 142266 78 0 0 78 76 29 0.291 1.81 7.02 Intr + 154741 154877 137 0 2 87 25 93 0.090 1.35 7.03 Intr + 159312 159445 134 0 2 55 41 93 0.030 0.67 7.04 Intr + 184161 184217 57 1 0 125 101 26 0.215 5.54 7.05 Intr + 184330 184396 67 2 1 52 43 93 0.011 -1.76 7.06 Intr + 191031 191189 159 0 0 86 32 115 0.033 3.88 7.07 Intr + 192643 192682 40 1 1 97 67 42 0.060 0.31 7.08 Term + 193317 193382 66 2 0 56 50 77 0.062 -2.34 7.09 PlyA + 193924 193929 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:18757779_18958273|GENSCAN_predicted_peptide_1|84_aa MPNIAFATRAVQEQRQAFSPIRSSNGRLAGSEAQWTPCRIRRDGSQRQVCDGSKQQWWTV SESSARAVTNMDQKSAVARFNRVK >gi568815577f:18757779_18958273|GENSCAN_predicted_CDS_1|255_bp atgcccaacattgcctttgcgaccagggccgtgcaggaacaacggcaagcctttagccca atcaggagcagcaatgggcgcctcgctggatcagaagcacagtggacaccctgccggatc cggagggatgggagtcagcggcaggtctgcgacggcagcaaacagcagtggtggacggtg agtgaaagctcagctcgagctgtaacaaacatggaccagaagagtgcagttgcaagattt aatagagtgaaatag >gi568815577f:18757779_18958273|GENSCAN_predicted_peptide_2|89_aa MPPSCAYKNPKILAGRLDIERNTSAEEDTSGWMLRGACRCKSTPIELAGGRPSTGGMTRN LAREVGGEPGPLSGRTPRENHLPSGSPTC >gi568815577f:18757779_18958273|GENSCAN_predicted_CDS_2|270_bp atgcccccatcctgtgcctataaaaaccccaagatcctagcaggccgactggacatcgag aggaacacgtcagcggaagaagacacaagtggctggatgttgaggggagcatgccgctgt aagagcacaccgatagagctggcaggcggcaggccatcaaccggtggaatgacgcggaat ttggccagggaggttggaggagagcccgggcctctgagcggcaggactccacgggaaaac catcttccttctggctcccccacctgctga >gi568815577f:18757779_18958273|GENSCAN_predicted_peptide_3|236_aa MSIIFKDVGFEENGRPMAIDGREFGGAGHKLPVDILLLGLQGDGPLLTAPLGSVPVKALC GDLNPSFSLDTTLEVICGGSVSEVGFCLDTQAFHTSLEIYAEGRRCEGRLSEYQDAVLNY MSLLYHGLVTSHKSSHKQDQQDIWEISNVCISREECLRSVAEQLTESNVSDLATTDLLFI TEWKSAVHVSITCTELLIWRVVLFQWWSKQQSNGSCEKIWKRIDVGHTNSYILAAL >gi568815577f:18757779_18958273|GENSCAN_predicted_CDS_3|711_bp atgagcatcattttcaaagacgtaggctttgaggaaaatgggagacccatggcaatcgac ggaagagagtttggtggtgcagggcataagctgccagtggatatactattattgggtcta caaggtgatggcccacttctcacagctccattaggcagtgttccggtgaaggctctgtgt ggggacttgaacccttcattttcccttgatactaccctagaagttatctgtgggggttct gtttctgaagtaggcttctgcctggacacccaggcttttcatacatccttggaaatctac gcagaaggcaggagatgtgaaggcagattgtctgaatatcaagatgctgtcctaaattat atgagcctgctgtatcatggcttagttacaagccacaagtcttcccataagcaggatcag caggacatatgggaaatctccaatgtatgtatcagcagagaagagtgtctgcgcagtgtt gcagaacaacttacagaaagtaatgtgagtgatcttgctacaacggatttattatttatt acagagtggaaatctgcagttcatgtttccatcacctgcaccgagctcctcatttggagg gtggttttattccagtggtggagcaaacaacaatccaatgggagctgcgagaagatttgg aagagaattgacgtgggtcacacaaattcttacattctggctgctctttga >gi568815577f:18757779_18958273|GENSCAN_predicted_peptide_4|224_aa MAKRGQLTVQAVVSEGASPKPWTLTQEGKTEGVLVGWRLKKQLLGFADAAAEENRVLLAM VNPTVFFDIAVDGEPLGRVSFELFADKVPKTAENFRALSTGEKGFGYKGSCFHRIIPGFM CQGGDFTRHNGTGGKSIYGEKFEDENFILKHTGPGILSMANAGPNTNGSQFFICTAKTEW LDGKHVVFGKVKEGMNIVEAMERFGSRNGKTSKKITIADCGQLE >gi568815577f:18757779_18958273|GENSCAN_predicted_CDS_4|675_bp atggccaaaaggggtcaacttacagttcaggctgttgtttcagagggtgcaagccccaag ccttggacactcacacaagaaggcaagactgaaggtgtgctggtaggatggcgtcttaag aagcaattacttggctttgcagacgccgccgccgaggaaaaccgtgtactattagccatg gtcaaccccaccgtgttcttcgacattgccgtcgacggcgagcccttgggccgcgtctcc tttgagctgtttgcagacaaggtcccaaagacagcagaaaattttcgtgctctgagcact ggagagaaaggatttggttataagggttcctgctttcacagaattattccagggtttatg tgtcagggtggtgacttcacacgccataatggcactggtggcaagtccatctatggggag aaatttgaagatgagaacttcatcctaaagcatacaggtcctggcatcttgtccatggca aatgctggacccaacacaaatggatcccagtttttcatctgcactgccaagactgagtgg ttggatggcaagcatgtggtgtttggcaaagtgaaagaaggcatgaatattgtggaggcc atggagcgctttgggtccaggaatggcaagaccagcaagaagatcaccattgctgactgt ggacaactcgaataa >gi568815577f:18757779_18958273|GENSCAN_predicted_peptide_5|1081_aa MGKKQSRKTGNSKKQSASPPPKECSSSPATEQSWTENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSQCDQLEERETH LTCRDTHRLKIKGWRKIYQANGKQKKAGVAILVSDKTDFKPTKIKRDKEGHYLMVKGSIQ QEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNNPLSTLDRSMRQKVNKNT QELNSALHQVDLIDIYRTLHPKSTEYTFFSAPQHTYSKIDRIVRSKALLSKCKRTEIITN YLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWVHNEMKAEIKMFFEANENKDTTYQ NLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQEITKIRA ELKEIETQKTLQKINESRSWFSESINKTDRPLARLIKKKTEKNQIDAIKNDKGDITTDPT EIQTTIREYYKHLYTNKLENLEEMDKFLDTYTLPRLNQEAVESLNRPITGSEIVAIINSL LTKRFQDQMDSQPNSTRVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIV SAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMRKLPFTIASKRIKYLGTQL TRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLP MTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGITLPDFKLYYKATITKTAWYWYQN RDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWESWLTICRKLKLDPFL TPYTKINSRWIKDLNVRPKTIKALEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDL IKLKSFCTAKETTNRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWA NDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHFTPVRMAIIKKSGNNRKLGI MEEGKGEAGTFFTVQQDRVSESRGNARPLKTIRFYETHSLPPEEHGANHPHDPIISNWSH P >gi568815577f:18757779_18958273|GENSCAN_predicted_CDS_5|3246_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaaggaatgcagttcctcaccagcaacggaacaaagctggacggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccaatgcgatcaactggaagaaagggaaacccat ctcacgtgcagagacacacataggctcaaaataaaaggatggaggaagatctaccaagcc aatggaaaacaaaaaaaggcaggggttgcaatcctagtctctgacaaaacagactttaaa ccaacaaagatcaaaagagacaaagaaggccattacttaatggtaaagggatcaattcaa caagaagagctaactatcctaaatatatatgcacccaatacaggagcacccagattcata aagcaagtcctgagtgacctacaaagagacttagactcccacacattaataatgggagac tttaacaacccactgtcaacattagacagatcaatgagacagaaagtcaacaagaatacc caggaattgaactcagctctgcaccaagtggacctaatagacatctacagaactctccac cccaaatcaacagaatatacatttttttcagcaccacagcacacctattccaaaattgac cgcatagttcgaagtaaagctctcctcagcaaatgtaaaagaacagaaattataacaaac tatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctcacacaaaac cgctcaactacatggaaactgaacaacctgctcctgaatgactactgggtacataacgaa atgaaggcagaaataaaaatgttctttgaagccaacgagaacaaagacacaacataccag aatctctgggacacattcaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaagaactagaa aagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaaatcagagca gaactgaaggaaatagagacacaaaaaacccttcaaaaaattaacgaatccaggagctgg ttttctgaaagcatcaacaaaactgatagaccactagcaagactaataaagaaaaaaaca gagaagaatcaaatagacgcaataaaaaatgataaaggggatatcaccaccgatcccaca gaaatacaaactaccatcagagaatactacaaacacctctacacaaataaactagaaaat ctagaagaaatggataaattcctcgacacatacactctcccaagactaaaccaggaagca gttgaatctctgaatagaccaataacaggatctgaaattgtggcaataatcaatagctta ctaacaaaaagattccaggaccagatggattcacagccaaattctaccagagtgttggaa gttctggccagggcaattaggcaggagaaggaaataaagggtattcaactaggaaaagag gaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccattgtc tcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatc aatgtacaaaaatcacaagcattcctatacaccaacaacagacaaacagagagccaaatc atgaggaaactcccattcacaattgcttcaaagagaataaaatacctaggaacccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcatg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccacattgccaagtcaatcctgagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacaataaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaataacaccacatatctacaac tatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaagctggctaaccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaatcaattcaagatggattaaagacttaaacgttagacctaaaacc ataaaagccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccaacagagtgaacaggcaacct acaaaatgggagaaaattttcgcaacttactcatctgataaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggca aatgacatgaacagacacttctcaaaagaagacatttacgcagccaaaaaacacatgaaa aaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccat tttacaccagttagaatggcaatcattaaaaagtcaggaaacaacaggaaacttggaatt atggaggaaggcaaaggagaagcaggcaccttcttcacagtgcagcaggacagagtgagt gaaagcaggggaaatgcaagacccttaaaaaccatcagattctatgagactcactcgcta ccaccagaagagcatggggcaaaccacccccatgatccaatcatctccaactggtcccat ccttga >gi568815577f:18757779_18958273|GENSCAN_predicted_peptide_6|198_aa MTGQLSDFNTQVKAVTFECESMHCVIHREMLASQNNSPELNSAFQLSTTLKSGDSVMNTL HQGLQSDTQCCVESWQSSCSGTRGAPGALNISASQKKQLQLQQSSREPWGPGADLDSQHK APTSWKSGQTVLHIGPGPHFSLSRAAEPGTPTQPLFLCLTTLIRGNPAVKEIPTRRHEKE PTQELQQLKWPGCLMSSK >gi568815577f:18757779_18958273|GENSCAN_predicted_CDS_6|597_bp atgactggacagctttctgatttcaatactcaggtcaaagcggtcacttttgaatgcgag tctatgcactgtgtcatccatagagaaatgctggctagccaaaacaattcacctgaactt aacagtgcttttcagttgtcaaccacattaaagtcaggagattccgttatgaacacactc caccagggtctgcagtctgacacgcagtgctgtgtggagtcttggcagagcagctgttca ggcacacgtggagccccaggagctttaaatatctcggcttcccagaaaaagcagctgcaa ctccagcaaagttccagagagccatggggaccaggggctgatctggactcccagcacaaa gcacccacctcatggaaaagtggccagactgtgctccacataggtcctggtcctcacttc tcactgagcagggctgctgagcctgggactccaacacaaccactcttcctctgcctgacc actttaattagaggcaacccagcagtcaaagaaatacccacacgtagacatgagaaagaa ccaacacaagaactccagcaactcaaatggccagggtgtcttatgtcctccaaatga >gi568815577f:18757779_18958273|GENSCAN_predicted_peptide_7|245_aa MEEDIDLSLLNVQSLRTELNGVIKIQKGFPHIRRGLIIVIYIFGHCSRICIRGHHEPSNV VAPTDSKMSCLVLTSLVDTLWLCIATGWASAYCIKEEPLLVITARTIREVLRWNSTGETS SCKKTSSGLLLILRDVHENLSSTKLVPDAKKVGDRCYRVNPLYGGNVLENSAIANLFSLC CFYPDPYIELYTDLPMICFVEVVITDGRSIGLQMAYRGTSPCDRLVRFEPFIANYECGLD TAIII >gi568815577f:18757779_18958273|GENSCAN_predicted_CDS_7|738_bp atggaggaggatatagatttaagtttgctgaatgtccaatccctgagaacagaattaaat ggagtaataaaaatacagaaaggctttccacatattcgaaggggcttgattattgtgatc tacatctttggtcactgcagccgtatctgcattagggggcaccacgaacccagtaatgtt gtggctcctacagactcaaagatgtcctgccttgtcctgacttctcttgtggacacatta tggctatgcattgccactggctgggcttctgcctactgcatcaaggaggagcctctactg gttataactgcaaggacaatcagagaggtgctacgttggaattccacgggtgagacatct agttgcaaaaaaacaagctcagggctcctactgattctacgtgatgtccatgaaaatttg tcttccacaaaactggtccctgatgccaaaaaggttggggaccgctgctatagagtgaat cctttgtatggaggtaatgtcctagaaaatagtgcaattgccaatctgttcagcctgtgc tgcttctatcctgatccctacatcgaactatacacagaccttccaatgatctgttttgtg gaagttgtcattactgatggaagaagtataggcttgcagatggcctatcgtgggacttca ccttgtgatcgtctagttagattcgaaccctttattgccaactatgaatgtggcttggat actgctatcattatttag