GENSCAN 1.0 Date run: 7-Nov-116 Time: 04:03:37 Sequence gi568815593r:135349457_135549939 : 200483 bp : 46.15% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1349 1463 115 2 1 46 42 161 0.809 5.34 1.02 PlyA + 1733 1738 6 1.05 2.09 PlyA - 1910 1905 6 1.05 2.08 Term - 2775 2714 62 1 2 24 48 66 0.419 -5.83 2.07 Intr - 3589 3490 100 1 1 109 59 103 0.817 9.18 2.06 Intr - 11151 11041 111 0 0 29 102 156 0.909 11.68 2.05 Intr - 20147 19950 198 2 0 48 95 202 0.995 16.45 2.04 Intr - 20686 20580 107 2 2 113 80 189 0.887 20.53 2.03 Intr - 39647 39466 182 1 2 75 109 262 0.592 26.51 2.02 Intr - 55907 55847 61 0 1 71 82 44 0.021 -0.11 2.01 Init - 65917 65785 133 1 1 78 47 61 0.257 1.30 2.00 Prom - 66041 66002 40 -2.46 3.00 Prom + 66981 67020 40 -1.66 3.01 Init + 78436 78533 98 0 2 102 77 24 0.259 2.74 3.02 Intr + 86255 86305 51 2 0 71 85 87 0.181 4.72 3.03 Term + 88153 88288 136 1 1 104 49 41 0.368 -0.71 3.04 PlyA + 91152 91157 6 1.05 4.09 PlyA - 92527 92522 6 1.05 4.08 Term - 96439 96333 107 2 2 52 41 89 0.285 -0.83 4.07 Intr - 97074 96990 85 0 1 85 100 32 0.808 3.49 4.06 Intr - 98773 98656 118 0 1 84 64 84 0.848 6.07 4.05 Intr - 100493 100002 492 1 0 71 31 490 0.837 33.71 4.04 Intr - 112178 112035 144 1 0 57 86 84 0.740 4.50 4.03 Intr - 117028 116663 366 0 0 87 97 293 0.938 24.36 4.02 Intr - 117372 117219 154 1 1 71 -3 142 0.857 2.53 4.01 Init - 122895 122757 139 0 1 71 80 142 0.851 12.00 4.00 Prom - 129586 129547 40 -6.56 5.04 PlyA - 131484 131479 6 1.05 5.03 Term - 133056 132904 153 0 0 101 41 117 0.746 6.22 5.02 Intr - 136987 136914 74 2 2 79 -6 52 0.104 -5.97 5.01 Init - 137564 137522 43 2 1 81 102 49 0.656 6.18 5.00 Prom - 140183 140144 40 -4.46 6.05 PlyA - 140559 140554 6 -0.45 6.04 Term - 141317 141140 178 1 1 66 49 186 0.931 9.66 6.03 Intr - 143016 142912 105 2 0 48 57 107 0.788 2.93 6.02 Intr - 154915 154831 85 2 1 85 87 4 0.068 -1.12 6.01 Init - 160660 160585 76 1 1 90 48 101 0.476 7.55 6.00 Prom - 161957 161918 40 -3.16 7.04 PlyA - 165257 165252 6 1.05 7.03 Term - 168126 168092 35 1 2 116 48 19 0.383 -1.55 7.02 Intr - 179403 179296 108 1 0 39 119 102 0.369 8.66 7.01 Init - 182652 182580 73 0 1 58 36 88 0.288 1.83 7.00 Prom - 184666 184627 40 -5.56 8.02 PlyA - 184692 184687 6 1.05 8.01 Sngl - 186234 185521 714 0 0 85 47 978 0.664 89.63 8.00 Prom - 192100 192061 40 -5.86 9.03 PlyA - 192637 192632 6 1.05 9.02 Term - 198077 197899 179 0 2 100 47 111 0.935 6.05 9.01 Init - 198420 198345 76 0 1 84 43 64 0.608 1.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 33439 33537 99 0 0 89 86 83 0.849 8.46 S.002 Term + 190930 191112 183 0 0 68 54 128 0.840 4.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:135349457_135549939|GENSCAN_predicted_peptide_1|38_aa XTRLGITYFTTDVEVSVCRVNDSITVDRGNVSLYNLQL >gi568815593r:135349457_135549939|GENSCAN_predicted_CDS_1|117_bp nncacccggctaggcattacctacttcaccaccgatgtagaagtcagtgtttgtcgggtg aacgacagcatcactgtcgatcgaggcaatgtcagcctgtacaacttgcaactataa >gi568815593r:135349457_135549939|GENSCAN_predicted_peptide_2|317_aa MEYYAAIKKDEFMSFVGTWMKLETIILSKLLQGQKTKDHMFSLIDSYVEALTSNVMVIGD GAFRRATAMSSRGGKKKSTKTSRSAKAGVIFPVGRMLRYIKKGHPKYRIGVGAPVYMAAV LEYLTAEILELAGNAARDNKKGRVTPRHILLAVANDEELNQLLKGVTIASGGVLPNIHPE LLAKKRGSKGKLEAIITPPPAKKAKSPSQKKPVSKKAGGKKGARKSKKKQGEVSKAASAD STTEGTPADGFTVLSTKSLFLGQKLNLIHSEISNLAGFEVEAIINPTNADIDLKDDLAQW HDRAALLLFDVEACQGK >gi568815593r:135349457_135549939|GENSCAN_predicted_CDS_2|954_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctgagcaaactattgcaaggacagaaaaccaaagaccacatg ttctcactcatagattcatatgttgaagctctaacctccaatgtgatggtaattggagat ggagcctttaggagggccaccgccatgtcgagccgcggtgggaagaagaagtccaccaag acgtccaggtctgccaaagcaggagtcatctttcccgtggggcggatgctgcggtacatc aagaaaggccaccccaagtacaggattggagtgggggcacccgtgtacatggccgccgtc ctggaatacctgacagcggagattctggagctggctggcaatgcagcgagagacaacaag aagggacgggtcacaccccggcacatcctgctggctgtggccaatgatgaagagctgaat cagctgctaaaaggagtcaccatagccagtgggggtgtgttacccaacatccaccccgag ttgctagcgaagaagcggggatccaaaggaaagttggaagccatcatcacaccaccccca gccaaaaaggccaagtctccatcccagaagaagcctgtatctaaaaaagcaggaggcaag aaaggggcccggaaatccaagaagaagcagggtgaagtcagtaaggcagccagcgccgac agcacaaccgagggcacacctgccgacggcttcacagtcctctccaccaagagcctcttc cttggccagaagctgaaccttattcacagtgaaatcagtaatttagccggctttgaggtg gaggccataatcaatcctaccaatgctgacattgaccttaaagatgacctagctcaatgg cacgacagggctgctctgctgctcttcgatgtagaggcttgccaggggaaatag >gi568815593r:135349457_135549939|GENSCAN_predicted_peptide_3|94_aa MALPQTVSTQLVVSLFGLVQVFVCSERLPELVSGLLVAPGPDLEISIALRNYETVTHTPF AFAAPLVAIGMVPSGTEPAVTHFPVPQAPASTIR >gi568815593r:135349457_135549939|GENSCAN_predicted_CDS_3|285_bp atggctctgccacaaacagtttccacccaactggtggtcagcctttttggactagtccag gtttttgtgtgctctgagaggctgccagaacttgtaagtgggctgctggtggcaccaggg cctgacctggagattagcatcgccctgaggaattatgagacagtaacgcacactcctttt gcatttgcggctcctcttgtggccatcggtatggttccttcagggactgagcctgctgtg actcacttcccagtcccccaggccccagcaagcaccatcagatag >gi568815593r:135349457_135549939|GENSCAN_predicted_peptide_4|534_aa MTACKIETTDLVELGPGVLEALIGKKARAGKETVKPMTGQFLQLSPEPRGATGKNKSPGN DPAAAIATAGAAATAGPGSPCSLQNALIYSHSFLEHHLKAEHPAHRTDPGTQQVLHKRLL NAGLEQCRNSSYSEIAGASRGVPHRRRRRIQGVEGAGERVQSAIYRQFVGLAGKALALRR DGAPQTALFATLLRIHSLSNRSAITSQSPVNFAAYLRRERSPLGQREEENAFYIMVVAMW KRHISLNIRFRMKTHVCKAYVKHVMHERTSSMEKPLTVLRVSLYHPTLGPSAFANVPPRL QHDTSPLLLGRGQDAHLQLQLPRLSRRHLSLEPYLEKGSALLAFCLKALSRKGCVWVNGL TLRYLEQVPLSTVNRVSFSGIQMLVRVEEGTSLEAFVCYFHVSPSPLIYRPEAEETDEWE GISQGQPPPGSGCQQFLFSAQKDDRFAPRASYKEVQLHSQALSIFSIRKTELEPSGWLQG LVFALDITIPHLLRQPEQAGILTLAFLCFSKNVNLAKLSVRDTGSPRGRLANAP >gi568815593r:135349457_135549939|GENSCAN_predicted_CDS_4|1605_bp atgacggcctgcaagattgaaacgactgaccttgttgaattgggacctggcgttttggag gctctgataggcaagaaggccagagcaggcaaggaaactgtgaagccaatgactggccag tttctacagctctcgccagagcctagaggtgcaaccggaaagaacaagtcccctggaaac gaccccgctgctgccatcgccactgctggtgctgctgctacggcaggccctggctctcca tgcagtctccagaatgccctcatctactcccattcatttcttgagcaccacctgaaggct gaacacccagcacacaggacagaccctggtactcagcaggtgctgcataagcgtctgtta aatgcaggactggagcagtgcagaaacagctcttactctgaaatcgcaggcgcttcccgg ggagtcccgcaccggcgcagacggcggatccagggcgtggagggggccggggaacgggtt cagagtgccatctaccggcagttcgtcggactggcaggaaaggccttggccctgcggcgg gatggagccccccagactgcgctgtttgctacgctgctccggatccattcactctccaac cgctctgcaatcacttcgcaatcaccagtaaactttgccgcctacttgagaagagaaaga tcccccctggggcagagggaggaggaaaatgctttttatattatggtagtggccatgtgg aaaagacatatttccctcaacattcgattccgaatgaaaacgcacgtttgtaaagcatat gtgaaacatgtcatgcacgaaaggacttcttccatggagaagcccctcaccgtcctgcga gtgagcctgtaccatcccacgctgggcccatctgcctttgccaatgtcccaccacggctg cagcatgataccagccctctgcttctcggacgggggcaggacgcccacctccagctgcag ctccctcgcctctcccgccgtcacctgtccctggagccctacctggagaaaggcagtgcc ctgctggccttctgcctcaaggccctgagccgcaagggctgtgtgtgggtcaatgggctg acgctgaggtacctggagcaggtccccctgagcaccgtcaacagggtctccttctcaggc atccagatgctggttcgcgtagaagaaggcacatccctggaggcttttgtctgctatttc catgtcagcccttcacccctgatttacagacctgaggctgaggaaactgacgaatgggaa ggcatctcccaggggcagcctccccctggttcaggctgtcaacaatttctgttttctgcc caaaaggatgatcgttttgcacccagggcaagctacaaggaggtccaacttcactcacag gccctgagcatcttctccatcaggaaaacagagctggaaccgtcaggctggcttcagggc ctggtcttcgcacttgatatcactatccctcacctgctcagacagccagagcaggcgggt attttaaccctcgcattcctctgcttttccaagaatgtgaatcttgccaagctgtccgtg agagacacgggctcccctcgagggagacttgcaaatgctccataa >gi568815593r:135349457_135549939|GENSCAN_predicted_peptide_5|89_aa MGVPSAPPPFFTDEDQLPSFDPPRWDSSTVLEEPVALTQLSRDDDSDDSNSHIDLGSLPL HPALPLKISVSDQTTSSHPEELITIAGPK >gi568815593r:135349457_135549939|GENSCAN_predicted_CDS_5|270_bp atgggtgttccttcagcccctccacccttcttcacggatgaagaccagctgccttccttt gacccaccaagatgggactcctccacagtcctggaagagcctgtggccttgacccaactg agcagagatgatgacagtgatgatagtaacagccacattgacctaggcagtcttcccctc cacccggccttgcctctgaagatctcagtatctgatcagacaaccagttctcaccctgaa gagctgatcacaattgcaggccccaagtga >gi568815593r:135349457_135549939|GENSCAN_predicted_peptide_6|147_aa MTVDTAEFFPSDPISPSNKDDHTAEGYSFFHPPSTGLLQLLLWKCPDSLSTSQLSSSPNQ QQVLPEHYFMLQPYPQEPSFINHTTMPDGPPIASRAPAACGDGKGPGPELEVEAAWPQLA KTAPTLASFTGASSSLSLESRGPDIVF >gi568815593r:135349457_135549939|GENSCAN_predicted_CDS_6|444_bp atgacggtggacacagctgagttcttcccatcagatcctatcagccccagtaacaaggat gaccatactgcagaagggtatagctttttccacccaccctcaacagggcttctccagctg ctgctctggaagtgtcctgacagcctcagcacatcacagctgtccagtagccctaaccag caacaagtgttgcctgaacactacttcatgctccagccctacccgcaggagcccagtttc atcaaccacacaaccatgccagatgggccgcccatcgcgtccagagcacctgctgcgtgc ggcgacgggaaaggccccggccctgaactggaggtggaggcggcttggccacagctggct aagaccgcgcctacgctggcatcttttactggagcttcttcctcgctcagcctggagtct agaggtcccgacatcgtcttctga >gi568815593r:135349457_135549939|GENSCAN_predicted_peptide_7|71_aa MYDPEHLSRTQSTFPKRNVFNNIVENMYAVAPTSAKQGIKAAKFNTKETNGEDFSTEDAP GPSSKYSHIRS >gi568815593r:135349457_135549939|GENSCAN_predicted_CDS_7|216_bp atgtatgacccagagcacctttccaggacccagagcacctttccaaaaagaaacgtcttt aacaacatcgttgaaaatatgtatgcagttgctccaacctcagccaagcagggaatcaag gctgcaaagttcaacaccaaagagaccaatggtgaagattttagcacagaagatgctcct ggcccgagctccaaatacagccacattaggagttag >gi568815593r:135349457_135549939|GENSCAN_predicted_peptide_8|237_aa MPARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASASGPPAPARRGAPNISRASE VPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVLP SFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSPA SDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH >gi568815593r:135349457_135549939|GENSCAN_predicted_CDS_8|714_bp atgccagcccgccttgagacctgcatctccgacctcgactgcgccagcagcagcggcagt gacctatccggcttcctcaccgacgaggaagactgtgccagactccaacaggcagcctcc gcttcggggccgcccgcgccggcccgcaggggcgcgcccaatatctcccgggcgtctgag gttccaggggcacaggacgacgagcaggagaggcggcggcgccgcggccggacgcgggtc cgctccgaggcgctgctgcactcgctgcgcaggagccggcgcgtcaaggccaacgatcgc gagcgcaaccgcatgcacaacttgaacgcggccctggacgcactgcgcagcgtgctgccc tcgttccccgacgacaccaagctcaccaaaatcgagacgctgcgcttcgcctacaactac atctgggctctggccgagacactgcgcctggcggatcaagggctgcccggaggcggtgcc cgggagcgcctcctgccgccgcagtgcgtcccctgcctgcccggtcccccaagccccgcc agcgacgcggagtcctggggctcaggtgccgccgccgcctccccgctctctgaccccagt agcccagccgcctccgaagacttcacctaccgccccggcgaccctgttttctccttccca agcctgcccaaagacttgctccacacaacgccctgtttcattccttaccactag >gi568815593r:135349457_135549939|GENSCAN_predicted_peptide_9|84_aa MGVIRSSKGVARVSICPCLTPTSSRGEEELCMCTHVSMHVSTHAGMNTWCPCTNPFPPQI SLERQLYSKDMNDDGDDDEVKKRD >gi568815593r:135349457_135549939|GENSCAN_predicted_CDS_9|255_bp atgggagtgatccgcagcagcaaaggcgtggccagggtttctatctgcccttgcctgacg cccaccagcagcaggggtgaggaagagctatgtatgtgcacgcatgtatccatgcatgtc agcacccatgcaggcatgaacacctggtgcccatgcacaaatccattccctccacaaata tcccttgagcgtcaactatactcaaaggatatgaatgatgatggtgatgatgatgaagtc aagaagagggactga