GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:50:10 Sequence gi568815593r:135434980_135635690 : 200711 bp : 47.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 673 782 110 0 2 79 85 103 0.770 7.25 1.02 Term + 2630 2765 136 2 1 104 49 41 0.441 -0.71 1.03 PlyA + 5629 5634 6 1.05 2.09 PlyA - 7004 6999 6 1.05 2.08 Term - 10916 10810 107 0 2 52 41 89 0.298 -0.83 2.07 Intr - 11551 11467 85 1 1 85 100 32 0.802 3.49 2.06 Intr - 13250 13133 118 1 1 84 64 84 0.848 6.07 2.05 Intr - 14970 14479 492 2 0 71 31 490 0.836 33.71 2.04 Intr - 26655 26512 144 2 0 57 86 84 0.740 4.50 2.03 Intr - 31505 31140 366 1 0 87 97 293 0.938 24.36 2.02 Intr - 31849 31696 154 2 1 71 -3 142 0.857 2.53 2.01 Init - 37372 37234 139 1 1 71 80 142 0.851 12.00 2.00 Prom - 44063 44024 40 -6.56 3.04 PlyA - 45961 45956 6 1.05 3.03 Term - 47533 47381 153 1 0 101 41 117 0.746 6.22 3.02 Intr - 51464 51391 74 0 2 79 -6 52 0.104 -5.97 3.01 Init - 52041 51999 43 0 1 81 102 49 0.656 6.18 3.00 Prom - 54660 54621 40 -4.46 4.05 PlyA - 55036 55031 6 -0.45 4.04 Term - 55794 55617 178 2 1 66 49 186 0.931 9.66 4.03 Intr - 57493 57389 105 0 0 48 57 107 0.788 2.93 4.02 Intr - 69392 69308 85 0 1 85 87 4 0.068 -1.12 4.01 Init - 75137 75062 76 2 1 90 48 101 0.476 7.55 4.00 Prom - 76434 76395 40 -3.16 5.04 PlyA - 79734 79729 6 1.05 5.03 Term - 82603 82569 35 2 2 116 48 19 0.383 -1.55 5.02 Intr - 93880 93773 108 2 0 39 119 102 0.369 8.66 5.01 Init - 97129 97057 73 1 1 58 36 88 0.288 1.83 5.00 Prom - 99143 99104 40 -5.56 6.02 PlyA - 99169 99164 6 1.05 6.01 Sngl - 100711 99998 714 1 0 85 47 978 0.664 89.63 6.00 Prom - 106577 106538 40 -5.86 7.03 PlyA - 107114 107109 6 1.05 7.02 Term - 112554 112376 179 1 2 100 47 111 0.874 6.05 7.01 Init - 115492 115351 142 1 1 88 93 9 0.200 1.76 7.00 Prom - 119908 119869 40 -5.56 8.07 PlyA - 119918 119913 6 1.05 8.06 Term - 120794 120691 104 0 2 49 45 126 0.432 2.84 8.05 Intr - 123800 123720 81 0 0 64 98 49 0.177 3.11 8.04 Intr - 137432 137350 83 1 2 128 80 -30 0.309 -0.52 8.03 Intr - 139706 139593 114 1 0 104 101 241 0.981 26.56 8.02 Intr - 143560 143455 106 2 1 80 73 236 0.998 20.57 8.01 Init - 143799 143736 64 0 1 88 113 240 0.681 25.81 8.00 Prom - 147004 146965 40 -7.76 9.08 PlyA - 147252 147247 6 1.05 9.07 Term - 153670 153430 241 0 1 95 44 133 0.763 5.10 9.06 Intr - 156642 156562 81 2 0 68 94 51 0.352 2.45 9.05 Intr - 159892 159792 101 1 2 95 41 41 0.247 -0.99 9.04 Intr - 170680 170589 92 2 2 34 121 18 0.129 -0.59 9.03 Intr - 171275 171216 60 0 0 50 121 40 0.016 2.21 9.02 Intr - 192055 191954 102 2 0 121 98 14 0.476 5.85 9.01 Init - 197242 197038 205 1 1 76 48 87 0.208 2.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 105407 105589 183 1 0 68 54 128 0.839 4.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:135434980_135635690|GENSCAN_predicted_peptide_1|81_aa MDTRSHSALGQNHPSLCVCSGLLVAPGPDLEISIALRNYETVTHTPFAFAAPLVAIGMVP SGTEPAVTHFPVPQAPASTIR >gi568815593r:135434980_135635690|GENSCAN_predicted_CDS_1|246_bp atggacacgagatcacacagcgcccttggccagaaccaccccagcctctgtgtctgcagt gggctgctggtggcaccagggcctgacctggagattagcatcgccctgaggaattatgag acagtaacgcacactccttttgcatttgcggctcctcttgtggccatcggtatggttcct tcagggactgagcctgctgtgactcacttcccagtcccccaggccccagcaagcaccatc agatag >gi568815593r:135434980_135635690|GENSCAN_predicted_peptide_2|534_aa MTACKIETTDLVELGPGVLEALIGKKARAGKETVKPMTGQFLQLSPEPRGATGKNKSPGN DPAAAIATAGAAATAGPGSPCSLQNALIYSHSFLEHHLKAEHPAHRTDPGTQQVLHKRLL NAGLEQCRNSSYSEIAGASRGVPHRRRRRIQGVEGAGERVQSAIYRQFVGLAGKALALRR DGAPQTALFATLLRIHSLSNRSAITSQSPVNFAAYLRRERSPLGQREEENAFYIMVVAMW KRHISLNIRFRMKTHVCKAYVKHVMHERTSSMEKPLTVLRVSLYHPTLGPSAFANVPPRL QHDTSPLLLGRGQDAHLQLQLPRLSRRHLSLEPYLEKGSALLAFCLKALSRKGCVWVNGL TLRYLEQVPLSTVNRVSFSGIQMLVRVEEGTSLEAFVCYFHVSPSPLIYRPEAEETDEWE GISQGQPPPGSGCQQFLFSAQKDDRFAPRASYKEVQLHSQALSIFSIRKTELEPSGWLQG LVFALDITIPHLLRQPEQAGILTLAFLCFSKNVNLAKLSVRDTGSPRGRLANAP >gi568815593r:135434980_135635690|GENSCAN_predicted_CDS_2|1605_bp atgacggcctgcaagattgaaacgactgaccttgttgaattgggacctggcgttttggag gctctgataggcaagaaggccagagcaggcaaggaaactgtgaagccaatgactggccag tttctacagctctcgccagagcctagaggtgcaaccggaaagaacaagtcccctggaaac gaccccgctgctgccatcgccactgctggtgctgctgctacggcaggccctggctctcca tgcagtctccagaatgccctcatctactcccattcatttcttgagcaccacctgaaggct gaacacccagcacacaggacagaccctggtactcagcaggtgctgcataagcgtctgtta aatgcaggactggagcagtgcagaaacagctcttactctgaaatcgcaggcgcttcccgg ggagtcccgcaccggcgcagacggcggatccagggcgtggagggggccggggaacgggtt cagagtgccatctaccggcagttcgtcggactggcaggaaaggccttggccctgcggcgg gatggagccccccagactgcgctgtttgctacgctgctccggatccattcactctccaac cgctctgcaatcacttcgcaatcaccagtaaactttgccgcctacttgagaagagaaaga tcccccctggggcagagggaggaggaaaatgctttttatattatggtagtggccatgtgg aaaagacatatttccctcaacattcgattccgaatgaaaacgcacgtttgtaaagcatat gtgaaacatgtcatgcacgaaaggacttcttccatggagaagcccctcaccgtcctgcga gtgagcctgtaccatcccacgctgggcccatctgcctttgccaatgtcccaccacggctg cagcatgataccagccctctgcttctcggacgggggcaggacgcccacctccagctgcag ctccctcgcctctcccgccgtcacctgtccctggagccctacctggagaaaggcagtgcc ctgctggccttctgcctcaaggccctgagccgcaagggctgtgtgtgggtcaatgggctg acgctgaggtacctggagcaggtccccctgagcaccgtcaacagggtctccttctcaggc atccagatgctggttcgcgtagaagaaggcacatccctggaggcttttgtctgctatttc catgtcagcccttcacccctgatttacagacctgaggctgaggaaactgacgaatgggaa ggcatctcccaggggcagcctccccctggttcaggctgtcaacaatttctgttttctgcc caaaaggatgatcgttttgcacccagggcaagctacaaggaggtccaacttcactcacag gccctgagcatcttctccatcaggaaaacagagctggaaccgtcaggctggcttcagggc ctggtcttcgcacttgatatcactatccctcacctgctcagacagccagagcaggcgggt attttaaccctcgcattcctctgcttttccaagaatgtgaatcttgccaagctgtccgtg agagacacgggctcccctcgagggagacttgcaaatgctccataa >gi568815593r:135434980_135635690|GENSCAN_predicted_peptide_3|89_aa MGVPSAPPPFFTDEDQLPSFDPPRWDSSTVLEEPVALTQLSRDDDSDDSNSHIDLGSLPL HPALPLKISVSDQTTSSHPEELITIAGPK >gi568815593r:135434980_135635690|GENSCAN_predicted_CDS_3|270_bp atgggtgttccttcagcccctccacccttcttcacggatgaagaccagctgccttccttt gacccaccaagatgggactcctccacagtcctggaagagcctgtggccttgacccaactg agcagagatgatgacagtgatgatagtaacagccacattgacctaggcagtcttcccctc cacccggccttgcctctgaagatctcagtatctgatcagacaaccagttctcaccctgaa gagctgatcacaattgcaggccccaagtga >gi568815593r:135434980_135635690|GENSCAN_predicted_peptide_4|147_aa MTVDTAEFFPSDPISPSNKDDHTAEGYSFFHPPSTGLLQLLLWKCPDSLSTSQLSSSPNQ QQVLPEHYFMLQPYPQEPSFINHTTMPDGPPIASRAPAACGDGKGPGPELEVEAAWPQLA KTAPTLASFTGASSSLSLESRGPDIVF >gi568815593r:135434980_135635690|GENSCAN_predicted_CDS_4|444_bp atgacggtggacacagctgagttcttcccatcagatcctatcagccccagtaacaaggat gaccatactgcagaagggtatagctttttccacccaccctcaacagggcttctccagctg ctgctctggaagtgtcctgacagcctcagcacatcacagctgtccagtagccctaaccag caacaagtgttgcctgaacactacttcatgctccagccctacccgcaggagcccagtttc atcaaccacacaaccatgccagatgggccgcccatcgcgtccagagcacctgctgcgtgc ggcgacgggaaaggccccggccctgaactggaggtggaggcggcttggccacagctggct aagaccgcgcctacgctggcatcttttactggagcttcttcctcgctcagcctggagtct agaggtcccgacatcgtcttctga >gi568815593r:135434980_135635690|GENSCAN_predicted_peptide_5|71_aa MYDPEHLSRTQSTFPKRNVFNNIVENMYAVAPTSAKQGIKAAKFNTKETNGEDFSTEDAP GPSSKYSHIRS >gi568815593r:135434980_135635690|GENSCAN_predicted_CDS_5|216_bp atgtatgacccagagcacctttccaggacccagagcacctttccaaaaagaaacgtcttt aacaacatcgttgaaaatatgtatgcagttgctccaacctcagccaagcagggaatcaag gctgcaaagttcaacaccaaagagaccaatggtgaagattttagcacagaagatgctcct ggcccgagctccaaatacagccacattaggagttag >gi568815593r:135434980_135635690|GENSCAN_predicted_peptide_6|237_aa MPARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPARRGAPNISRASE VPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLP SFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPA SDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH >gi568815593r:135434980_135635690|GENSCAN_predicted_CDS_6|714_bp atgccagcccgccttgagacctgcatctccgacctcgactgcgccagcagcagcggcagt gacctatccggcttcctcaccgacgaggaagactgtgccagactccaacaggcagcctcc gcttcggggccgcccgcgccggcccgcaggggcgcgcccaatatctcccgggcgtctgag gttccaggggcacaggacgacgagcaggagaggcggcggcgccgcggccggacgcgggtc cgctccgaggcgctgctgcactcgctgcgcaggagccggcgcgtcaaggccaacgatcgc gagcgcaaccgcatgcacaacttgaacgcggccctggacgcactgcgcagcgtgctgccc tcgttccccgacgacaccaagctcaccaaaatcgagacgctgcgcttcgcctacaactac atctgggctctggccgagacactgcgcctggcggatcaagggctgcccggaggcggtgcc cgggagcgcctcctgccgccgcagtgcgtcccctgcctgcccggtcccccaagccccgcc agcgacgcggagtcctggggctcaggtgccgccgccgcctccccgctctctgaccccagt agcccagccgcctccgaagacttcacctaccgccccggcgaccctgttttctccttccca agcctgcccaaagacttgctccacacaacgccctgtttcattccttaccactag >gi568815593r:135434980_135635690|GENSCAN_predicted_peptide_7|106_aa MEVRSQVAFLKASSRPLAADAVTQVCPALSHCSVPSHSEEALLFPGPGEEELCMCTHVSM HVSTHAGMNTWCPCTNPFPPQISLERQLYSKDMNDDGDDDEVKKRD >gi568815593r:135434980_135635690|GENSCAN_predicted_CDS_7|321_bp atggaggtgcgctcacaagtggctttccttaaagccagttccaggcctcttgctgctgat gctgtgacccaggtgtgccccgcactcagccactgctctgtgccctctcactctgaggag gccctcctcttccctggccctggtgaggaagagctatgtatgtgcacgcatgtatccatg catgtcagcacccatgcaggcatgaacacctggtgcccatgcacaaatccattccctcca caaatatcccttgagcgtcaactatactcaaaggatatgaatgatgatggtgatgatgat gaagtcaagaagagggactga >gi568815593r:135434980_135635690|GENSCAN_predicted_peptide_8|183_aa MRLLAAALLLLLLALYTARVDGSKCKCSRKGPKIRYSDVKKLEMKPKYPHCEEKMVIITT KSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRKQLLELAFFWRTCIIWAKYKEIYKP KSRVYPQGRLSVEEIDSCMEMFTEASFTTASYKIPDQYSSKLSQSSKTEKARNFDSQEEP EET >gi568815593r:135434980_135635690|GENSCAN_predicted_CDS_8|552_bp atgaggctcctggcggccgcgctgctcctgctgctgctggcgctgtacaccgcgcgtgtg gacgggtccaaatgcaagtgctcccggaagggacccaagatccgctacagcgacgtgaag aagctggaaatgaagccaaagtacccgcactgcgaggagaagatggttatcatcaccacc aagagcgtgtccaggtaccgaggtcaggagcactgcctgcaccccaagctgcagagcacc aagcgcttcatcaagtggtacaacgcctggaacgagaagcgcaggaagcagcttctagag ctagcattcttctggaggacatgcattatttgggcaaaatacaaagaaatatacaagcct aagtcaagagtctatcctcagggaagactttcagtggaagaaattgatagctgcatggag atgttcactgaagcatcatttaccacagcatcctacaaaatacctgaccagtactcctca aaactgtcacagtcatcaaaaacagagaaagccagaaacttcgacagccaagaggagcct gaggagacatga >gi568815593r:135434980_135635690|GENSCAN_predicted_peptide_9|293_aa MALKTKPTHCSTEDRIFYDPVILLTHDVTMDRASKGCCKRQSGHMCCHTEVSLTHLFHPH PQEAKNHIDGLLSPQAPTSCAKPGHKWGYFYGSVARRQPAGQDSLATQTTSLTMQTFYVE EKEWPNSQQESQAVVEVVKKMLNVLISEKQVPKEDKKAYLSLCVKVLFMLSNAMHRRDLG TLQAVMPLFEEKQWELRRLEQALQKKAGYMEHSRSISTTCFMKAFPIPNLDGTALQPPIW NPRIRNSFEVFSQPAGKGLSGLQTSCRASGPQPPQPCNPGTEQALSKGSTWCL >gi568815593r:135434980_135635690|GENSCAN_predicted_CDS_9|882_bp atggccctcaagacaaagcctacgcactgtagtacagaagacagaatcttctatgaccca gtcatcttgctaactcatgatgtcaccatggatcgggcctccaaaggatgctgcaaaaga caatcaggccacatgtgttgtcacactgaagttagccttacacacctctttcatcctcat cctcaggaggccaagaaccacatagatgggctcctttctccccaggctcccacctcctgt gccaagccaggccataagtggggctacttctatgggtctgttgcaagaaggcaaccagca ggccaagatagccttgcaacccagaccacctccctgacaatgcagacattttatgtggaa gaaaaagaatggccaaattcccagcaagaaagtcaggcagtggtagaggtagtgaagaag atgttaaatgttctcatttctgaaaagcaagtacctaaggaagacaaaaaggcttacctg agcctgtgtgtgaaggtgctcttcatgctgtcaaatgctatgcataggagggatctgggc accctccaagcagtgatgccgctgtttgaggaaaagcaatgggagctgcgtaggctggag caagcgctgcagaagaaagcaggatacatggaacactccaggtctatttccactacctgc ttcatgaaagcattcccaatccccaacctggatgggactgccttacagccccccatctgg aatcctcgcatccgtaactcatttgaagtattctctcagcctgctggcaagggcctctct ggcctgcagacatcctgccgagcctctggcccgcagcccccgcagccctgcaaccccggc acagagcaggcactcagcaagggctcaacttggtgtttatag