GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:09:07 Sequence gi568815576f:35446810_35652029 : 205220 bp : 50.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1613 1860 248 2 2 95 21 173 0.252 8.40 1.02 Term + 7616 7704 89 0 2 92 47 97 0.305 3.82 1.03 PlyA + 10721 10726 6 1.05 2.00 Prom + 11362 11401 40 -7.06 2.01 Init + 25754 25815 62 1 2 57 93 90 0.005 5.12 2.02 Intr + 29965 30128 164 1 2 106 73 39 0.001 3.82 2.03 Intr + 31793 31931 139 0 1 63 99 48 0.001 2.92 2.04 Intr + 43417 43526 110 1 2 144 96 7 0.678 7.03 2.05 Intr + 44770 45236 467 2 2 78 69 333 0.265 23.25 2.06 Intr + 51962 52054 93 1 0 42 121 13 0.335 0.16 2.07 Intr + 52112 52199 88 1 1 142 68 8 0.510 3.74 2.08 Intr + 73520 73592 73 0 1 83 73 35 0.063 -0.04 2.09 Intr + 76155 76268 114 0 0 79 115 37 0.315 5.06 2.10 Term + 80223 80349 127 0 1 77 46 79 0.204 0.36 2.11 PlyA + 84655 84660 6 1.05 3.05 PlyA - 85205 85200 6 1.05 3.04 Term - 87248 86602 647 0 2 94 43 1937 0.999 184.09 3.03 Intr - 93862 93809 54 2 0 130 33 27 0.499 0.35 3.02 Intr - 94098 94011 88 0 1 79 -4 94 0.895 -1.16 3.01 Init - 94484 94311 174 2 0 63 75 158 0.676 9.35 3.00 Prom - 95001 94962 40 -6.96 4.00 Prom + 95057 95096 40 -15.08 4.01 Init + 95419 95559 141 0 0 93 40 73 0.621 3.22 4.02 Intr + 99992 100271 280 1 1 122 94 503 0.928 51.35 4.03 Term + 104694 105223 530 1 2 116 53 1142 0.952 107.92 4.04 PlyA + 107168 107173 6 1.05 5.00 Prom + 117168 117207 40 -0.66 5.01 Init + 142045 142135 91 0 1 92 113 63 0.816 9.85 5.02 Term + 147150 147226 77 1 2 46 44 108 0.537 0.20 5.03 PlyA + 148499 148504 6 1.05 6.10 PlyA - 150390 150385 6 1.05 6.09 Term - 160634 160488 147 2 0 108 47 275 0.999 23.30 6.08 Intr - 164297 164075 223 1 1 110 75 322 0.644 31.23 6.07 Intr - 170456 170354 103 0 1 100 98 115 0.402 12.93 6.06 Intr - 175724 175626 99 0 0 94 68 23 0.143 0.98 6.05 Intr - 180850 180767 84 2 0 60 28 118 0.074 2.79 6.04 Intr - 188272 188146 127 1 1 92 68 27 0.322 1.35 6.03 Intr - 196498 196393 106 0 1 110 63 16 0.013 1.52 6.02 Intr - 201448 201400 49 2 1 87 77 28 0.042 -0.66 6.01 Init - 204949 204817 133 1 1 78 47 69 0.324 2.10 6.00 Prom - 205073 205034 40 -0.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:35446810_35652029|GENSCAN_predicted_peptide_1|112_aa XSWSHLPNHMPDPPLLLNAPQNIIVFVIHSLSTHLKFRSAPGRASDPEDAMGNETDKGPG SPGACIQPGVGEDSDEETEGTEGAGGGSRSFPGALMGWTRLDTKVFIGKRFL >gi568815576f:35446810_35652029|GENSCAN_predicted_CDS_1|339_bp nncagctggagtcatcttcccaaccacatgccagaccctccactcctgctaaacgcccca caaaacatcatcgtattcgtcattcactcactcagcacacatctgaagttccgcagtgcc ccaggccgggcatcagaccccgaggatgcaatggggaatgagacagacaagggtcctggc tctccaggagcatgcatccagcctggtgtgggagaagacagcgatgaggagacagaggga actgagggagcaggcgggggcagtaggagctttccaggggctctgatgggatggacaaga ctggacaccaaggtcttcatcggcaagaggttcctgtga >gi568815576f:35446810_35652029|GENSCAN_predicted_peptide_2|478_aa MLLASSLLLAPQLRAAGSGAWYYVVSPIHRKAFEHRDACPPIFVSLKTSLKAAVFMRLSV GGWRSTLKKQPISAGDGLGESPQLPLSWDILGVNEGRKRQLQLEACGSELRPPPHSLRWL PRSRITSSPLPFITHLCCREARVLPIFSEQSGSLEQPQDVLPAPDVPPAPDVSPAPDVLP APDVLPAPDVSPAPDVLPAPDVSPAPDVLPAPDVLPAPDVSPAPDVLPAPDVSPAPDVLP APDVSPAPDVLPAPDVLPAPDVSPAPDVSPAPDVSPAPDVLPAPDVFPAPDVLPAPEVLP VSSVLLVLVCSLPQTFAFRKLVLFLSLCLCGESVPPLCQGDLMVKLPTHTPAIVGCPELT LGFPCLVLLFTTIPAENEVPPIPPLPKSKEGVSRLKSDIFYSVLGPGNTKKMDTDTGGQG GTSWPSEALGLREDTGSAPQGPCTDRCESMYQNLLSQNTQGFGDQGPRVMSPDLRLLL >gi568815576f:35446810_35652029|GENSCAN_predicted_CDS_2|1437_bp atgctcttggcctcaagccttctcctcgctccccagctcagagctgcagggtcaggagcc tggtattatgtcgtttctcccatacatcggaaggcatttgagcacagagatgcctgtccc cccattttcgtctccttgaaaacttcacttaaggctgctgtgttcatgaggctttctgtg ggtggatggaggtcaaccttgaagaagcagccgatttcagctggagacggccttggcgag agcccccagctgcccttgagttgggacatcctgggggtgaatgagggtagaaaaaggcag ctgcagttggaggcttgcggctcagagctaaggcctccaccccacagcctccgctggctc cccaggtcgcgtatcacctcttctccattgccatttatcacccatctctgctgcagggag gccagggtcctgccgattttttctgagcagtctgggagcctggagcagccccaggatgtg ctccctgccccggatgtgccccctgccccggatgtgtcccctgccccggatgtgctccct gccccggatgtgctccctgccccggatgtgtcccctgccccggatgtgctccctgccccg gatgtgtcccctgccccggatgtgctccctgccccggatgtgctccctgccccggatgtg tcccctgccccggatgtgctccctgccccggatgtgtcccctgccccggatgtgctccct gccccggatgtgtcccctgccccggatgtgctccctgccccggatgtgctccctgccccg gatgtgtcccctgccccggatgtgtcccctgccccagatgtgtcccctgccccggatgtg ctccctgccccggatgtgttccctgccccggatgtgctccctgccccagaggtgctccct gtctcaagtgtgctccttgtcctagtgtgctccctgccccagacttttgccttcagaaag ctggtgctctttctttctctctgcttatgcggagaatctgtaccacctctttgccagggg gacttaatggttaagctccccactcacacaccagccatcgtgggctgcccagaactcacc ctgggctttccctgtcttgtgcttttgttcacaaccattcctgcagaaaatgaggtccct cctattccacctcttcccaaatccaaggaaggtgtcagcaggctgaagtcagatatcttt tactctgttctaggccctggaaatacaaagaagatggacacagacactggtggacaaggg ggcacttcctggccttctgaagctttgggtctgagggaggacacaggcagtgctccccaa gggccttgcacagaccgctgtgaaagcatgtaccagaatctgttgtcacaaaatacgcag ggttttggagaccagggacctcgtgtcatgtctccggacctacggcttcttctgtaa >gi568815576f:35446810_35652029|GENSCAN_predicted_peptide_3|320_aa MGGLRGLGGARAGGGLRLAAAGPALPRAGGLLLSQVPGGLGTSDPGVPAWGSRAGGRELS QRRRDGFFLPARRRQQLKEEQGALLCARRCPTVTRRAKRSLIPPKGLDCIPLSNVIITII TITITIILIITIAFITIIITVIITIIITIITIITLIIIPVITIIIITITIITVIIIPIIT IIIITIITIITVIIPIIAIVIITTITVIIFPIITIIIITITIITIITVIIIPIITIIIIT VIITIITIIPIITIIIIITIIITTLTTITTIIIITITTIITIIITIITTIITITVSVIII ATIIVVIIIISQAYGPFAPF >gi568815576f:35446810_35652029|GENSCAN_predicted_CDS_3|963_bp atgggtgggctccgagggctcggcggggcccgggcagggggcgggctccggctcgccgcc gccggccctgccctgccccgagccgggggcctcctgctgagccaggtgcccggaggcttg gggacgtccgatccgggggtccccgcctggggctctcgggctgggggcagagagctctcc cagcgccggagggacggcttctttctccccgcgcggcgacgccagcagctgaaggaggag cagggcgctctgctctgcgcgcgcagatgccccactgtcacccgcagggcgaagaggtcg ctgatccctccgaaaggcttggactgcatccctctttctaatgtcatcatcaccatcatt accatcactatcaccatcatcctcatcatcaccattgccttcatcaccataatcatcacc gtcatcatcaccatcatcatcaccatcatcaccatcatcactctcatcatcatccccgtc atcaccatcatcatcatcaccatcaccatcatcactgtgatcatcatccccatcatcacc atcatcatcatcaccatcatcacaatcatcactgtcatcatccccatcatcgctatcgtc atcatcaccaccatcactgtcatcatcttccccatcatcactatcatcatcatcaccatc accatcatcaccatcatcactgtcatcatcatccccatcatcaccatcatcatcatcact gtcatcatcaccatcatcaccatcattcccatcatcaccatcatcatcattatcaccatt atcatcaccacccttaccaccatcaccaccatcatcatcatcaccatcaccaccatcatc actatcattattaccatcatcaccaccatcatcaccatcaccgtcagcgtcatcatcata gccaccatcattgttgtcatcatcatcatctctcaagcctatggaccttttgctcctttt tag >gi568815576f:35446810_35652029|GENSCAN_predicted_peptide_4|316_aa MGSLQDLRLHSSPASPGCSRQPHDCQDKVPRRKEPSMCSGLLRVKSWPRAMMKTLSSGNC TLSVPAKNSYRMVVLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLD ILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEA AELPMVICGNKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVLFSMA KLPHEMSPALHRKISVQYGDAFHPRPFCMRRVKEMDAYGMVSPFARRPSVNSDLKYIKAK VLREGQARERDKCTIQ >gi568815576f:35446810_35652029|GENSCAN_predicted_CDS_4|951_bp atggggtccctgcaagacctgcgcctgcattccagtccagcctcaccaggctgctctagg cagccccacgactgccaagataaagtgccacgaagaaaagaacccagcatgtgttcaggt ttgctgcgggtcaagagctggccccgagccatgatgaagactttgtccagcgggaactgc acgctcagtgtgcccgccaaaaactcataccgcatggtggtgctgggtgcctctcgggtg ggcaagagctccatcgtgtctcgcttcctcaatggccgctttgaggaccagtacacaccc accatcgaggacttccaccgtaaggtatacaacatccgcggcgacatgtaccagctcgac atcctggatacctctggcaaccaccccttccccgccatgcgcaggctgtccatcctcaca ggggatgtcttcatcctggtgttcagcctggataaccgggagtccttcgatgaggtcaag cgccttcagaagcagatcctggaggtcaagtcctgcctgaagaacaagaccaaggaggcg gcggagctgcccatggtcatctgtggcaacaagaacgaccacggcgagctgtgccgccag gtgcccaccaccgaggccgagctgctggtgtcgggcgacgagaactgcgcctacttcgag gtgtcggccaagaagaacaccaacgtggacgagatgttctacgtgctcttcagcatggcc aagctgccacacgagatgagccccgccctgcatcgcaagatctccgtgcagtacggtgac gccttccaccccaggcccttctgcatgcgccgcgtcaaggagatggacgcctatggcatg gtctcgcccttcgcccgccgccccagcgtcaacagtgacctcaagtacatcaaggccaag gtccttcgggaaggccaggcccgtgagagggacaagtgcaccatccagtga >gi568815576f:35446810_35652029|GENSCAN_predicted_peptide_5|55_aa MGKVRVNPEDKGGHSVGVQSQGAGEGCQKQASSLGKTNLSYVTTKGSRNSGIQTL >gi568815576f:35446810_35652029|GENSCAN_predicted_CDS_5|168_bp atgggcaaggtgagggtgaatcctgaagacaaaggtggccactcggtgggtgttcagagc cagggagcaggtgaaggttgtcagaaacaagcttcctcacttggaaaaacaaatctgagc tacgtgaccaccaagggctcccggaactctggcattcagaccctgtga >gi568815576f:35446810_35652029|GENSCAN_predicted_peptide_6|356_aa MEYYAAIKKDEFVSFVGTWMKLETIILSKLSQGQKTKHRMFSLLGWTYMGYENSSCGLEL WWAFSITERREFQVYYNSCPSSSWIGDARAFSPPDPTGLYPIYSQDTQTFRLWLNYNTDF PGSPACKWQAMELLSPQSPFRVDRGEHLPFLVKGARYTLVPAGQEGALAAWLEALRGQLG RRGAVVSMMDAEGLERSSPDCAMGLSDGEWQLVLNVWGKVEADIPGHGQEVLIRLFKGHP ETLEKFDKFKHLKSEDEMKASEDLKKHGATVLTALGGILKKKGHHEAEIKPLAQSHATKH KIPVKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFRKDMASNYKELGFQG >gi568815576f:35446810_35652029|GENSCAN_predicted_CDS_6|1071_bp atggaatactatgcagccataaaaaaggatgagttcgtgtcctttgtagggacatggatg aagctggaaaccatcattctcagcaaactatcgcaaggacaaaaaaccaaacaccgcatg ttctcactcctaggatggacctatatggggtatgaaaatagctcctgtggcttagagcta tggtgggctttttccataacagaaagaagagaatttcaagtctattacaattcatgtcct tctagttcctggattggagatgcaagagcattttccccaccagatccaactgggctttac cccatctactcccaggatactcagaccttcagactctggctgaattataacactgacttt cctgggtctccagcttgcaaatggcaggctatggaacttctcagcccacaatcacccttc agggtagaccgaggagagcacctccccttcctggtgaagggagcccgatacacgctggtg ccggctggccaagaaggagccctggccgcttggctggaggctctgcgaggacagctgggg agaaggggagctgtggtcagtatgatggatgctgaggggctggagaggagcagccctgac tgcgccatggggctcagcgacggggaatggcagttggtgctgaacgtctgggggaaggtg gaggctgacatcccaggccatgggcaggaagtcctcatcaggctctttaagggtcaccca gagactctggagaagtttgacaagttcaagcacctgaagtcagaggacgagatgaaggcg tctgaggacttaaagaagcatggtgccaccgtgctcaccgccctgggtggcatccttaag aagaaggggcatcatgaggcagagattaagcccctggcacagtcgcatgccaccaagcac aagatccccgtgaagtacctggagttcatctcggaatgcatcatccaggttctgcagagc aagcatcccggggactttggtgctgatgcccagggggccatgaacaaggccctggagctg ttccggaaggacatggcctccaactacaaggagctgggcttccagggctag