GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:12:44 Sequence gi568815597f:179202685_179453638 : 250954 bp : 40.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 361 356 6 1.05 1.04 Term - 26368 26217 152 2 2 99 36 75 0.664 0.49 1.03 Intr - 26836 26557 280 1 1 -9 101 150 0.073 2.73 1.02 Intr - 27001 26905 97 0 1 66 65 119 0.110 6.49 1.01 Init - 32486 32419 68 2 2 60 116 36 0.125 4.27 1.00 Prom - 34346 34307 40 -6.05 2.00 Prom + 40925 40964 40 -4.55 2.01 Init + 41339 41378 40 1 1 84 44 82 0.394 3.80 2.02 Term + 53060 53304 245 0 2 13 42 255 0.369 8.48 2.03 PlyA + 53468 53473 6 1.05 3.00 Prom + 56357 56396 40 -7.35 3.01 Init + 58415 58430 16 1 1 98 78 15 0.135 1.94 3.02 Intr + 78475 78603 129 2 0 66 92 62 0.326 4.15 3.03 Intr + 90689 90898 210 0 0 35 26 186 0.047 5.06 3.04 Intr + 99993 100118 126 1 0 63 77 145 0.284 10.63 3.05 Term + 120753 120832 80 1 2 111 43 58 0.122 0.55 3.06 PlyA + 121148 121153 6 1.05 4.00 Prom + 129433 129472 40 -3.85 4.01 Init + 132148 132189 42 0 0 108 64 -7 0.931 -0.63 4.02 Intr + 132822 132973 152 2 2 64 100 114 0.950 8.24 4.03 Intr + 136754 136861 108 2 0 51 116 48 0.506 2.38 4.04 Intr + 138344 138626 283 2 1 115 110 107 0.991 12.10 4.05 Intr + 139430 139508 79 1 1 67 55 27 0.932 -4.49 4.06 Intr + 140178 140259 82 1 1 84 95 62 0.951 4.28 4.07 Intr + 140906 140951 46 2 1 121 111 24 0.796 5.49 4.08 Intr + 142263 142392 130 2 1 102 92 44 0.663 5.55 4.09 Intr + 144916 145013 98 2 2 77 62 17 0.566 -3.19 4.10 Intr + 146160 146258 99 2 0 106 106 83 0.985 11.29 4.11 Intr + 147612 147747 136 2 1 80 100 147 0.952 14.32 4.12 Intr + 148633 148778 146 2 2 81 82 108 0.856 8.58 4.13 Intr + 166184 166288 105 1 0 96 76 49 0.084 3.99 4.14 Intr + 167291 167394 104 1 2 104 98 31 0.662 3.75 4.15 Intr + 175953 176073 121 0 1 88 94 73 0.982 7.48 4.16 Intr + 176713 176798 86 0 2 84 61 43 0.951 -1.10 4.17 Intr + 180758 180860 103 2 1 81 86 105 0.994 8.86 4.18 Intr + 182554 182675 122 0 2 93 92 106 0.997 9.87 4.19 Intr + 191219 191359 141 2 0 36 98 166 0.953 10.95 4.20 Term + 192414 192522 109 0 1 91 44 125 0.658 5.40 4.21 PlyA + 193493 193498 6 -0.45 5.05 PlyA - 194175 194170 6 1.05 5.04 Term - 196006 195782 225 1 0 49 48 144 0.189 2.30 5.03 Intr - 199708 199673 36 1 0 86 97 21 0.142 0.34 5.02 Intr - 214322 214132 191 0 2 33 76 188 0.271 10.48 5.01 Init - 215802 215703 100 0 1 52 -2 180 0.550 5.87 5.00 Prom - 225193 225154 40 -3.55 6.08 PlyA - 225958 225953 6 1.05 6.07 Term - 234980 234945 36 2 0 87 54 84 0.525 1.26 6.06 Intr - 237208 236912 297 1 0 60 39 190 0.552 7.45 6.05 Intr - 238943 238806 138 2 0 -3 81 207 0.975 10.64 6.04 Intr - 239517 239324 194 1 2 49 106 139 0.524 10.09 6.03 Intr - 239787 239547 241 0 1 61 -29 147 0.211 -3.50 6.02 Intr - 245362 244913 450 1 0 73 94 383 0.321 29.97 6.01 Init - 246196 246059 138 1 0 75 123 1 0.384 2.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 166145 166288 144 1 0 53 76 83 0.828 3.77 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:179202685_179453638|GENSCAN_predicted_peptide_1|198_aa MVRHAHSVIMYLLAQPVLLHEPSARWFKDGGGGGGGGGGGVGGAEPRQEEGKAESPLRAR AWAGRERPEQRQEPEAGAEEECDQGSAGAREYARAGMGQQVGRVGEAPGLQQPQPRGIRG SSAARPSGRRRDPAGRTTETGFNIFTQHAWLADPGGGEGKRSGFGDPGWSVLYSGTGLCG CRKKVNGSFRRGKDEGQY >gi568815597f:179202685_179453638|GENSCAN_predicted_CDS_1|597_bp atggtcagacatgcacattcagtcatcatgtaccttctagcccagccagtgctgttacat gaaccaagcgctcggtggtttaaagatggcggcggcggtggcggcggcggcggcggcggc gttgggggcgcggagccgcgacaggaggagggaaaagcagaatctccactcagggccagg gcctgggctgggagggagagaccggagcagcgccaggagcccgaggccggagccgaggag gaatgtgaccaggggtcggcgggggcgcgggagtacgcgagagcagggatggggcagcag gtgggccgcgtcggggaagctccggggctccagcagcctcagccccgcgggatccggggc agcagtgcagccaggccctccggccgcaggcgggacccggcggggcgcaccacagagacc ggcttcaatatcttcacccagcatgcctggctggcagatcccgggggcggagaaggaaag agaagtgggtttggggaccctggatggtctgtcctctactctgggacgggactatgcggc tgcagaaagaaagtgaatggcagcttcaggagagggaaggatgaagggcagtattag >gi568815597f:179202685_179453638|GENSCAN_predicted_peptide_2|94_aa MRTQQEDAIYEAKNALQTRALATQALRARNSNGVAMVYSMASGGGVLTDDKKAAGLEREL MMAARKGLDPYSIIPPKAASVTGNPNLVPTITNK >gi568815597f:179202685_179453638|GENSCAN_predicted_CDS_2|285_bp atgaggacacaacaagaagatgccatctatgaagccaagaatgctttgcagaccagagcg ctggccacacaggccctgagggcccgcaattccaatggagtggccatggtgtactccatg gcatctggaggtggtgttcttactgatgacaagaaggcggctgggttggagagggagctc atgatggctgcacggaagggactggacccatacagtataatacccccaaaggcagcttca gtcacagggaaccctaatttggtccccaccatcaccaataagtga >gi568815597f:179202685_179453638|GENSCAN_predicted_peptide_3|186_aa MEYAHGELFVVVANEESCGNKRVLRRSWHLNYIYKSELDFSRQRCKSKVLRCSWMVSSNC NDPQLQKRLLPVGLGAKRPCSSMLPRRVPFRACRELSQSRFGCTFYCWGGSVKQDVVADN TMVGEEKMSLRNRLSKSRENPEEDEDQRNPAKESLETPSNGRIDIKQLIAKKIKLTAEAE ASNSLF >gi568815597f:179202685_179453638|GENSCAN_predicted_CDS_3|561_bp atggagtatgcacatggtgagctctttgttgtggtagctaatgaagaatcttgtggaaat aaaagagttttgagaaggagttggcatttgaactacatctataaaagtgaattggatttc agcaggcagagatgtaagagtaaagttctacgttgttcctggatggtcagttcaaactgc aatgatccacagctccagaagcggcttcttccagtcggcctaggagccaaacgcccctgc agctccatgctaccacgcagggtgccgtttcgggcttgcagggagttgtcgcagtcccgt tttggctgcacattctactgctggggtggaagtgtcaagcaagatgttgtcgccgacaat acaatggtgggtgaagagaagatgtctctaagaaaccggctgtcaaagtccagggaaaat cctgaggaagatgaagaccagagaaaccctgcaaaggagtccctagagacacctagtaat ggtcgaattgacataaaacagttgatagcaaagaagataaagttgacagcagaggcagag gcaagtaactccctcttctag >gi568815597f:179202685_179453638|GENSCAN_predicted_peptide_4|763_aa MEKPCLYEKYKISQELKPFFMKEVGSHFDDFVTNLIEKSASLDNGGCALTTFSVLEGEKN NHRANELLEVDHIRTIYHMFIALLILFILSTLVVDYIDEGRLVLEFSLLSYAFGKFPTVV WTWWIMFLSTFSVPYFLFQHWATGYSKSSHPLIRSLFHGFLFMIFQIGVLGFGPTYVVLA YTLPPASRFIIIFEQIRFVMKAHSFVRENVPRVLNSAKEKSSTVPIPTVNQYLYFLFAPT LIYRDSYPRNPTVRWGYVAMKFAQVFGCFFYVYYIFERLCAPLFRNIKQEPFSARVLVLC VFNSILPGVLILFLTFFAFLHCWLNAFAEMLRFGDRMFYKDWWNSTSYSNYYRTWNVVVH DWLYYYAYKDFLWFFSKRFKSAAMLAVFAVSAVVHEYALAVCLSFFYPVLFVLFMFFGMA FNFIVNDSRKKPIWNVLMWTSLFLGNGVLLCFYSQEWYARQHCPLKNNEFIPKEVLLSLT YAANAGPCPENLLPPKKIKTPKGTLPRLVDHVWHHPVRRNKFKYLIDHPVSLTGAGRDIS FLYDVTYAKGQTREKAVCPPHLARSLQSHDGVIVPHKPKTLTDTLIPEEFHIVSSTGVSG LECYDEKPNKRVEVAQLNDVMDTMLERAGVENQEYTGPTKMHKLLHILKKEQTIYNMIFH ELIRQVSVDCADRGELLSKVRERYVQMLDQIARQMIDFYKDLVTQRVMDQRILEELYNFK HVIEELTRELCLVRAHDVKLTKETEKAHKDLAQALLNAEKNAK >gi568815597f:179202685_179453638|GENSCAN_predicted_CDS_4|2292_bp atggagaaaccctgtctctacgaaaaatacaaaattagccaggaattgaagccatttttt atgaaggaagttggcagtcactttgatgattttgtgaccaatctcattgaaaagtcagca tcattagataatggtgggtgcgctctcacaaccttttctgttcttgaaggagagaaaaac aaccatagagcgaatgaactgcttgaagtggaccacatcagaacaatatatcacatgttt attgccctcctcattctctttatcctcagcacacttgtagtagattacattgatgaagga aggctggtgcttgagttcagcctcctgtcttatgcttttggcaaatttcctaccgttgtt tggacctggtggatcatgttcctgtctacattttcagttccctattttctgtttcaacat tgggccactggctatagcaagagttctcatccgctgatccgttctctcttccatggcttt cttttcatgatcttccagattggagttctaggttttggaccaacatatgttgtgttagca tatacactgccaccagcttcccggttcatcattatattcgagcagattcgttttgtaatg aaggcccactcatttgtcagagagaacgtgcctcgggtactaaattcagctaaggagaaa tcaagcactgttccaatacctacagtcaaccagtatttgtacttcttatttgctcctacc cttatctaccgtgacagctatcccaggaatcccactgtaagatggggttatgtcgctatg aagtttgcacaggtctttggttgctttttctatgtgtactacatctttgaaaggctttgt gcccccttgtttcggaatatcaaacaggagcccttcagcgctcgtgttctggtcctatgt gtatttaactccatcttgccaggtgtgctgattctcttccttactttttttgcctttttg cactgctggctcaatgcctttgctgagatgttacgctttggtgacaggatgttctataag gattggtggaactccacgtcatactccaactattatagaacctggaatgtggtggtccat gactggctatattactatgcttacaaggactttctctggtttttctccaagagattcaaa tctgctgccatgttagctgtctttgctgtatctgctgtagtacacgaatatgccttggct gtttgcttgagctttttctatcccgtgctcttcgtgctcttcatgttctttggaatggct ttcaacttcattgtcaatgatagtcggaaaaagccgatttggaatgttctgatgtggact tctcttttcttgggcaatggagtcttactctgcttttattctcaagaatggtatgcacgt cagcactgtcctctgaaaaataatgagttcattcccaaagaagttcttctttctctgacc tatgcggccaatgctggtccttgtcctgaaaacttactacctcctaagaaaattaaaacc ccaaagggcactcttccacgccttgtagaccatgtctggcatcaccctgttcgaaggaat aaattcaaatacctgattgaccatcccgtctccctcacaggagctggaagggatatttct tttctgtacgatgtaacatatgccaaaggacagactagggagaaggcagtttgtccccca catttggcccgttcattacagtcacatgatggtgtcattgtgccccataagccaaagaca ctaacagatactttgattcctgaagaatttcatattgtgtcaagtacaggagtttcaggt ttggagtgttatgatgagaaacctaataaaagagtagaagtggcccagctgaatgatgtg atggatactatgctagagagggctggtgtggaaaatcaggaatatacaggaccaacgaag atgcacaaactactacatatattgaagaaggaacagaccatttacaacatgatatttcat gaacttattcgacaagtcagtgtggactgtgcagacagaggagaacttctgtctaaagtc agagagaggtatgtgcaaatgcttgaccagattgctcggcagatgattgatttctacaaa gacttggtaactcagcgagtgatggaccagcgcattttagaagaattgtataatttcaag catgttattgaagaactgaccagggaactgtgtctagttcgggcacatgatgtgaaatta acaaaggaaacagaaaaagcccacaaggatttggcacaagctcttttaaatgcggaaaag aatgccaagtga >gi568815597f:179202685_179453638|GENSCAN_predicted_peptide_5|183_aa MGHDDNGGFVEWKGGKGGEKIEKSDGCRVCVERSNTDACKDVEKGELFYKRAVYARKFPN AEGAEKPKTKADKSNLWVKGVLLGKLTEALSRVATRQKARSSAYKEGGKRMHIDLHLPEP CPHAKTTTSTTVHTVASRGPPTLFQLCCLHSVVTPTWKQAPQHLLQPMRMHPHNIATTAA ADT >gi568815597f:179202685_179453638|GENSCAN_predicted_CDS_5|552_bp atgggccatgatgacaatggcggttttgtggaatggaaagggggaaaaggtggggaaaag attgagaaatcggatggttgccgtgtctgtgtagaaagaagcaatacagatgcttgcaaa gatgtggagaaaggagaactgttctataaaagggccgtatatgcaaggaagtttccaaat gccgaaggagctgagaaaccaaaaaccaaggcagacaaatccaatttgtgggtaaagggt gttttattggggaaactaacagaagcattgtctagggtagccacaagacagaaagcaaga tcatcagcttataaagagggtggaaagagaatgcacatagacctgcatctgccagagcct tgcccccatgctaagaccaccaccagcacaactgtgcacacagttgccagcaggggcccc ccaaccctcttccagctgtgctgcctccactctgtggtgacacccacatggaagcaggca ccccagcacctgttgcagccaatgagaatgcacccccacaacattgccactactgctgct gctgacacatga >gi568815597f:179202685_179453638|GENSCAN_predicted_peptide_6|497_aa MDQIPLYKSKTIKLLEELNVNLHDFKFRKGFLDMTPKVQTKKKIKSNFSMSVTVPVKSPN ALSYTNPGSSLVSPSLAASSTLTDLNTLSPPQTILHRNVSPGAPQRPPSTGNAGGMLGTT DFTVPNGTGSSPVGNEFVNSRASPNLIGATGANSLGKVMPTKSPPPPGGSNLGMNSGKPD LRVVIPASSKGMTPPLAGTLDVENWDRAGEGLKQAHQKGLKVDSSVFSTWSLVRTVLLPL SPYYSAGQQAESKNLKESVVPPTAPIENKNRRGRIKKTSVRPRLVAEIETPIQRILRCAA IAGEPLGPCAFPSSVRPDPNNPQQVIHEHTPLEFKLLKELKEPYVQVLARLQEAVKRQIP HTVAAEMLTLTLAFENANADWKCALALEMEKPCGCPQGACGHGMGDWRNPGWPTMGPVPP VRAMSQLSLSAKTERRLTRVTMTSTLITWGQLKKTMQEAEKLLEHQGQAKTPDSIFLAML AIMSCASVKSDGPPIGK >gi568815597f:179202685_179453638|GENSCAN_predicted_CDS_6|1494_bp atggatcaaataccattatataagagcaaaactataaaactcttagaagaactaaacgta aatcttcatgactttaaatttaggaaaggattcttagatatgacaccaaaagtacaaaca aaaaagaaaataaaatcgaatttttcaatgtctgtcacagttccagtgaaaagccccaat gctttgtcctacactaacccagggagttcactggtgtcaccatctttggcagccagctca acgttaacagatttaaacacgctctctccacctcaaaccatattacatagaaatgtgtcc cctggagctcctcagagaccaccaagtactggcaatgcaggtgggatgttgggcactaca gacttcacagtgccaaatggaactggaagcagcccagtggggaatgaatttgtaaactca agagcttctccaaatttgattggagctactggtgcaaatagtttaggtaaagtcatgcct acaaagtctccccctccaccaggtggcagtaatcttggaatgaacagtggaaaaccagat cttcgagttgtcatccccgcttcaagcaagggcatgacacctccactagcaggcactctt gatgtggaaaattgggatagagcaggagaaggattaaaacaggctcatcaaaaaggtctt aaagttgattcttcagttttctccacgtggagtttagttcgtactgtacttctgccatta tctccttattattctgcaggacagcaggctgaatctaaaaatctgaaagaatctgtcgtc ccacccacagctccaattgaaaataaaaacaggagagggaggataaaaaaaacatctgta cggcctcgtttggtagcagaaatagaaaccccaatacaaagaattttacgctgtgctgcc atagctggagagcccttaggaccttgtgcttttcctagttccgtaaggcctgatccaaat aatccacagcaggttattcatgaacacactccactagagtttaagttgttaaaggaatta aaagaaccctatgtgcaagtacttgcaagattacaagaggcagtgaagcgtcagattcct cataccgtggccgcagaaatgctaaccttaactctagcttttgagaatgcaaatgcggat tggaaatgtgcactggcactggagatggagaaaccatgtgggtgccctcaaggtgcgtgt ggccatggaatgggagactggaggaacccagggtggccaaccatgggcccggtccctccg gtacgagccatgagccagctgagcctgagtgcaaagacggagagaaggctgacaagggtc acgatgacatcaaccctcataacctggggacaactcaagaaaaccatgcaggaagctgag aaactactggagcatcagggacaggcaaaaacccctgattccatattcttggccatgtta gccataatgtcctgtgcatctgtaaaatcggatggaccgccaattggaaaatga