GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:02:52 Sequence gi568815591r:65860900_66082183 : 221284 bp : 44.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12332 12666 335 1 2 53 68 436 0.372 34.87 1.02 Intr + 31188 31255 68 0 2 53 75 69 0.013 0.65 1.03 Term + 78406 78635 230 2 2 7 43 252 0.618 9.39 1.04 PlyA + 81166 81171 6 1.05 2.00 Prom + 90852 90891 40 -2.06 2.01 Init + 92420 92513 94 1 1 76 6 28 0.174 -6.06 2.02 Term + 93175 93401 227 2 2 131 48 329 0.865 30.14 2.03 PlyA + 93896 93901 6 1.05 3.14 PlyA - 94955 94950 6 1.05 3.13 Term - 95279 95159 121 1 1 101 45 82 0.538 3.15 3.12 Intr - 100164 100053 112 1 1 94 68 62 0.638 4.24 3.11 Intr - 103559 103424 136 2 1 71 84 144 0.968 12.44 3.10 Intr - 107008 106832 177 1 0 75 115 214 0.997 22.92 3.09 Intr - 109467 109383 85 2 1 92 72 52 0.984 3.82 3.08 Intr - 113542 113396 147 0 0 116 81 131 0.992 14.55 3.07 Intr - 113805 113627 179 0 2 95 78 395 0.999 37.92 3.06 Intr - 114172 114020 153 1 0 89 33 226 0.992 17.47 3.05 Intr - 115303 115116 188 2 2 147 80 157 0.976 20.21 3.04 Intr - 118642 118500 143 0 2 49 65 263 0.924 20.10 3.03 Intr - 119012 118828 185 2 2 96 39 256 0.917 20.09 3.02 Intr - 119510 119325 186 2 0 93 49 260 0.982 22.49 3.01 Init - 121284 121075 210 0 0 98 64 466 0.999 41.99 3.00 Prom - 128191 128152 40 -5.66 4.00 Prom + 129228 129267 40 -2.86 4.01 Init + 132523 134133 1611 0 0 39 38 433 0.741 25.77 4.02 Intr + 177626 177779 154 1 1 91 42 105 0.026 5.85 4.03 Term + 182939 182964 26 0 2 106 55 19 0.014 -1.21 4.04 PlyA + 184551 184556 6 1.05 5.02 PlyA - 184595 184590 6 1.05 5.01 Sngl - 210419 210066 354 2 0 68 49 286 0.811 18.75 5.00 Prom - 211458 211419 40 -8.06 6.00 Prom + 212771 212810 40 -10.15 6.01 Init + 214809 214957 149 2 2 69 105 145 0.998 11.86 6.02 Intr + 215140 215194 55 1 1 67 105 90 0.745 7.58 6.03 Intr + 220904 221098 195 1 0 56 75 237 0.775 18.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 183562 183496 67 1 1 87 52 93 0.910 4.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:65860900_66082183|GENSCAN_predicted_peptide_1|210_aa MGRGGGGGGGGGGGGGGGGGGGGWVGPRRAAAAEVEAEGGGGGGGGKMAAPVLLRVSVPR WERVARYAVCAAGILLSIYAYHVEREKERDPEHRALCDLGPWVKCSAALASSFIHIVEND GIAFILMAEQYCIVSKEVKGVLETGGGRGGDLVEVTSLVMVAFMPTMMVVDMVAVGMAIM DLVIMEAILDVVEVTVILAVTTVSLQILDL >gi568815591r:65860900_66082183|GENSCAN_predicted_CDS_1|633_bp atggggcgcggcggcggcggcggcggtggtggcggcggcggcggaggcggcggtggcggc ggtggcggctgggtcgggccccgacgggcggcggcggctgaggtggaggcggagggaggc ggcggcggcggcggcgggaagatggcggctcccgtcctgctaagagtgtcggtgccgcgg tgggagcgggtggcccggtatgcagtgtgcgctgccggaatcctgctctccatctacgcc taccacgtggagcgggagaaggagcgggaccccgagcaccgggccctctgcgacctgggg ccctgggtgaagtgctccgccgcccttgcctccagttttatccatattgttgaaaatgac gggattgcattcattctcatggctgaacagtactgcattgtgtccaaagaagtaaaaggg gtcctggaaactggtggtggtcgtggaggtgatttggtggaagtgacaagtttggtcatg gtggctttcatgccaaccatgatggtggtggatatggtggcagtggggatggctatcatg gatttggtaataatggaagcaattttggatgtggtggaagtgacagtgattttggcagtt acaacagtcagtcttcaaattttggacctataa >gi568815591r:65860900_66082183|GENSCAN_predicted_peptide_2|106_aa MEKQILLSTDGQDVVKVQRRFSPNVCSSEEKGMTASAVAALILMTSSIMSVVGSLYLAYI LYFVLKEFCIICIVTYVLNFLLLIINYKRLVYLNEAWKRQLQPKQD >gi568815591r:65860900_66082183|GENSCAN_predicted_CDS_2|321_bp atggaaaagcaaattttactttccacagatgggcaggatgtagtgaaagtccagagaagg ttctcaccaaatgtgtgcagctcggaagaaaaaggcatgacagcaagcgctgtggcggct ttgatcctcatgacgtcctccatcatgtcggtcgtggggtccctgtacctggcctacatt ctgtactttgtgctgaaggagttctgcatcatctgcatcgtcacgtacgtgctgaacttc cttcttctcattatcaactacaaacgactagtttacttgaacgaggcctggaagcggcag ctgcaacccaagcaggactga >gi568815591r:65860900_66082183|GENSCAN_predicted_peptide_3|673_aa MARGSAVAWAALGPLLWGCALGLQGGMLYPQESPSRECKELDGLWSFRADFSDNRRRGFE EQWYRRPLWESGPTVDMPVPSSFNDISQDWRLRHFVGWVWYEREVILPERWTQDLRTRVV LRIGSAHSYAIVWVNGVDTLEHEGGYLPFEADISNLVQVGPLPSRLRITIAINNTLTPTT LPPGTIQYLTDTSKYPKGYFVQNTYFDFFNYAGLQRSVLLYTTPTTYIDDITVTTSVEQD SGLVNYQISVKGSNLFKLEVRLLDAENKVVANGTGTQGQLKVPGVSLWWPYLMHERPAYL YSLEVQLTAQTSLGPVSDFYTLPVGIRTVAVTKSQFLINGKPFYFHGVNKHEDADIRGKG FDWPLLVKDFNLLRWLGANAFRTSHYPYAEEVMQMCDRYGIVVIDECPGVGLALPQFFNN VSLHHHMQVMEEVVRRDKNHPAVVMWSVANEPASHLESAGYYLKMVIAHTKSLDPSRPVT FVSNSNYAADKGAPYVDVICLNSYYSWYHDYGHLELIQLQLATQFENWYKKYQKPIIQSE YGAETIAGFHQDPPLMFTEEYQKSLLEQYHLGLDQKRRKYVVGELIWNFADFMTEQSPTR VLGNKKGIFTRQRQPKSAAFLLRERYWKIANETRHQMANQYFLEFFNYFRMQEQYIHTPT AIERFTFMGKHKS >gi568815591r:65860900_66082183|GENSCAN_predicted_CDS_3|2022_bp atggcccgggggtcggcggttgcctgggcggcgctcgggccgttgttgtggggctgcgcg ctggggctgcagggcgggatgctgtacccccaggagagcccgtcgcgggagtgcaaggag ctggacggcctctggagcttccgcgccgacttctctgacaaccgacgccggggcttcgag gagcagtggtaccggcggccgctgtgggagtcaggccccaccgtggacatgccagttccc tccagcttcaatgacatcagccaggactggcgtctgcggcattttgtcggctgggtgtgg tacgaacgggaggtgatcctgccggagcgatggacccaggacctgcgcacaagagtggtg ctgaggattggcagtgcccattcctatgccatcgtgtgggtgaatggggtcgacacgcta gagcatgaggggggctacctccccttcgaggccgacatcagcaacctggtccaggtgggg cccctgccctcccggctccgaatcactatcgccatcaacaacacactcacccccaccacc ctgccaccagggaccatccaatacctgactgacacctccaagtatcccaagggttacttt gtccagaacacatattttgactttttcaactacgctggactgcagcggtctgtacttctg tacacgacacccaccacctacatcgatgacatcaccgtcaccaccagcgtggagcaagac agtgggctggtgaattaccagatctctgtcaagggcagtaacctgttcaagttggaagtg cgtcttttggatgcagaaaacaaagtcgtggcgaatgggactgggacccagggccaactt aaggtgccaggtgtcagcctctggtggccgtacctgatgcacgaacgccctgcctatctg tattcattggaggtgcagctgactgcacagacgtcactggggcctgtgtctgacttctac acactccctgtggggatccgcactgtggctgtcaccaagagccagttcctcatcaatggg aaacctttctatttccacggtgtcaacaagcatgaggatgcggacatccgagggaagggc ttcgactggccgctgctggtgaaggacttcaacctgcttcgctggcttggtgccaacgct ttccgtaccagccactacccctatgcagaggaagtgatgcagatgtgtgaccgctatggg attgtggtcatcgatgagtgtcccggcgtgggcctggcgctgccgcagttcttcaacaac gtttctctgcatcaccacatgcaggtgatggaagaagtggtgcgtagggacaagaaccac cccgcggtcgtgatgtggtctgtggccaacgagcctgcgtcccacctagaatctgctggc tactacttgaagatggtgatcgctcacaccaaatccttggacccctcccggcctgtgacc tttgtgagcaactctaactatgcagcagacaagggggctccgtatgtggatgtgatctgt ttgaacagctactactcttggtatcacgactacgggcacctggagttgattcagctgcag ctggccacccagtttgagaactggtataagaagtatcagaagcccattattcagagcgag tatggagcagaaacgattgcagggtttcaccaggatccacctctgatgttcactgaagag taccagaaaagtctgctagagcagtaccatctgggtctggatcaaaaacgcagaaaatac gtggttggagagctcatttggaattttgccgatttcatgactgaacagtcaccgacgaga gtgctggggaataaaaaggggatcttcactcggcagagacaaccaaaaagtgcagcgttc cttttgcgagagagatactggaagattgccaatgaaaccagacaccagatggccaatcag tactttctagagttctttaactacttcagaatgcaagaacagtacatacatactccaaca gccattgaaagattcacattcatgggaaaacacaagtcttga >gi568815591r:65860900_66082183|GENSCAN_predicted_peptide_4|596_aa MNIDAKILNKILANQIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINIIQHINRTKGKNHM IISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKT GTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSA QNPLKLISNFSKVSGYKINVQKSQAFLYTNNRETESQIMSELPLTIASKIIKYLGIQLTR DVKDLFKENYKPLLNEIKENTNKWNNIPHSWIGRINIVKMAILPKVIYRFNAIPIKLPMT FFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKVAVTKTAWYWYQNRD TDQWNRTEPSEIIPHIYNYLIFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLDPFLTP YTKINSRWIKDLNVRHKTIKTLEENLGNTIQDIGMGKDFMSKTPKAMATKAEIDKRDLIK LKSFCTAKETTIRLNRQPTQWEKIFAIYSSDKGLISRIYKELKQIYKKKTNNPNNKWMWN LQIWRANCIYFLKSAYKEIHEVQARVVQGSTVSATQGSAKDPKEQTSKVLTVPIKL >gi568815591r:65860900_66082183|GENSCAN_predicted_CDS_4|1791_bp atgaatattgatgcaaaaatcctcaataaaatactggcaaaccaaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaac atacgcaaatcaataaacataatccagcatataaacagaaccaaaggcaaaaaccacatg attatctcaatagatgcagaaaaggcttttgacaaaattcaacagcccttcatgctaaaa actctcaataaattaggtattgatgggacgtatctcaaaataataagagctatttatgac aaacccacagccaatatcatactgaatgggcaaaaactggaggcattccctttgaaaact ggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaatcaggcaggagaaagaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccattgtctcagcc caaaatccccttaagctgataagcaactttagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaataacagagaaacagagagccaaatcatgagt gaactcccactcacaattgcttcaaagataataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagagaat acaaacaaatggaacaacattccacactcatggataggcagaatcaatatcgtgaaaatg gccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgc attgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacctgacttc aaactatactacaaggttgcagtaaccaaaacagcatggtactggtaccaaaacagagat acagaccaatggaacagaacagagccctcagaaataataccacacatctacaactatctg atctttgacaaacctgacaaaaacaagaaatggggaaaggactccctatttaacaaatgg tgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacacct tatactaaaattaattcaagatggattaaagacttaaatgttagacataaaaccataaaa accctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatg tctaaaacaccaaaagcaatggcaacaaaagccgaaattgacaaaagggatctaattaaa ctaaagagcttctgcacagcaaaagaaactaccatcagactgaacaggcaacctacacaa tgggagaaaatttttgcaatctactcatctgacaaagggctaatatccagaatctacaaa gaactcaaacaaatttacaagaaaaaaacaaacaaccccaacaacaagtggatgtggaac ctgcagatatggagggccaactgcatttattttttaaaatctgcgtataaggagatccat gaagttcaagctcgtgttgttcaagggtcaactgtatctgcaactcaaggttctgctaaa gatcctaaggagcagacttccaaagtgttgactgttcccatcaaactctga >gi568815591r:65860900_66082183|GENSCAN_predicted_peptide_5|117_aa MRLRQAPESRKVFIQRDYSSGTGCQFQTMFSMELENQIDRQQFEEIVQTLNNLYAEAEKL GGQSYLEGCLACLTAYTIFLCLETHYQKLLKKVSKCIQEQNEKIYVPQGLLLTDSIE >gi568815591r:65860900_66082183|GENSCAN_predicted_CDS_5|354_bp atgaggctgcggcaggcaccagagtccagaaaggtgttcattcagcgagactacagcagt ggcacaggctgccagttccagaccatgttctccatggagctggagaaccagattgatagg cagcagtttgaagaaatagttcaaactctaaataacctttatgcagaagcagagaagctt ggtggccaatcatatctcgaaggttgtttggcttgtttaacagcatataccatcttccta tgcttggaaactcattaccagaagcttctgaagaaagtctccaaatgcattcaagagcag aatgagaagatctatgttccacaaggccttctcctgacagactccattgagtaa >gi568815591r:65860900_66082183|GENSCAN_predicted_peptide_6|133_aa MEATPTPPGGLLLARPSPGVGTGPAAADAIPARKALASGGRDTIRAARRRPGGPKLPDDE EPPNMASESGKLWGGRFVGAVDPIMEKFNASIAYDRHLWEVDVQGSKAYSRGLEKAGLLT KAEMDQILHGLDK >gi568815591r:65860900_66082183|GENSCAN_predicted_CDS_6|399_bp atggaggcaacgcccaccccgccgggcggcctcctattggcgcggccgtcgccaggggtg gggacaggaccggcggctgctgacgccatcccggccagaaaagccctggccagtggcggg cgcgacactatccgtgcggccaggcggagacccggaggaccgaagcttccggacgacgag gaaccgcccaacatggcctcggagagtgggaagctttggggtggccggtttgtgggtgca gtggaccccatcatggagaagttcaacgcgtccattgcctacgaccggcacctttgggag gtggatgttcaaggcagcaaagcctacagcaggggcctggagaaggcagggctcctcacc aaggccgagatggaccagatactccatggcctagacaag