GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:05:30 Sequence gi568815597f:47233946_47476742 : 242797 bp : 44.81% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2597 2675 79 1 1 20 72 107 0.409 3.52 1.02 Term + 7453 7598 146 2 2 95 49 76 0.503 2.47 1.03 PlyA + 8602 8607 6 1.05 2.14 PlyA - 9447 9442 6 1.05 2.13 Term - 17977 17191 787 0 1 97 37 344 0.605 23.00 2.12 Intr - 26594 26344 251 2 2 66 93 58 0.554 0.34 2.11 Intr - 29168 28958 211 1 1 71 91 94 0.597 6.92 2.10 Intr - 35921 35690 232 0 1 73 80 209 0.862 15.43 2.09 Intr - 38248 38131 118 1 1 60 91 96 0.553 7.14 2.08 Intr - 47264 47001 264 2 0 88 69 94 0.962 5.21 2.07 Intr - 48514 48412 103 0 1 98 81 44 0.795 4.88 2.06 Intr - 53715 53606 110 0 2 70 83 75 0.969 4.28 2.05 Intr - 55640 55490 151 1 1 71 106 61 0.960 6.36 2.04 Intr - 66207 65960 248 0 2 47 106 17 0.098 -4.34 2.03 Intr - 67803 67616 188 1 2 96 86 89 0.807 8.91 2.02 Intr - 68401 68289 113 0 2 82 85 71 0.781 6.22 2.01 Init - 76374 76331 44 0 2 97 80 10 0.307 1.02 2.00 Prom - 77080 77041 40 -8.56 3.00 Prom + 78664 78703 40 -2.36 3.01 Init + 100001 100171 171 1 0 6 78 342 0.966 22.44 3.02 Intr + 134524 134670 147 0 0 94 86 129 0.991 13.73 3.03 Intr + 139010 139162 153 1 0 108 80 90 0.993 10.47 3.04 Intr + 140964 141040 77 2 2 100 65 12 0.987 -1.59 3.05 Intr + 141252 141348 97 0 1 68 100 98 0.986 9.01 3.06 Term + 142759 142800 42 0 0 130 31 75 0.993 3.16 3.07 PlyA + 143680 143685 6 1.05 4.00 Prom + 166089 166128 40 -3.26 4.01 Sngl + 182389 183330 942 0 0 92 52 1480 0.911 139.33 4.02 PlyA + 184831 184836 6 1.05 5.00 Prom + 189612 189651 40 -5.46 5.01 Init + 200594 200704 111 1 0 92 82 88 0.669 8.82 5.02 Term + 204074 205678 1605 1 0 59 47 2138 0.856 196.90 5.03 PlyA + 205929 205934 6 1.05 6.05 PlyA - 206451 206446 6 1.05 6.04 Term - 211160 210470 691 1 1 -28 44 274 0.333 4.46 6.03 Intr - 212237 212102 136 0 1 77 96 16 0.754 0.93 6.02 Intr - 215348 215209 140 1 2 99 100 38 0.777 6.21 6.01 Init - 233906 233851 56 2 2 59 73 62 0.040 2.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 100244 100354 111 1 0 57 37 91 0.925 -0.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:47233946_47476742|GENSCAN_predicted_peptide_1|74_aa MVSITEEPVRKANSQASPQAYYYYSEAPSPVSLKHAHNMVPTTLILVSVYPTKTLKNWDH IRGLSIPQHIMEAE >gi568815597f:47233946_47476742|GENSCAN_predicted_CDS_1|225_bp atggtcagcatcaccgaggaacctgttagaaaagcaaattctcaggcctcaccccaggcc tactactactactctgaggctccttctcctgtgtcccttaaacatgcccacaacatggtg ccaaccacacttatcttggtgtctgtgtaccccaccaagactctcaaaaactgggaccac atccgtggtttgtctataccccagcatatcatggaggcagagtag >gi568815597f:47233946_47476742|GENSCAN_predicted_peptide_2|939_aa MEPIYPFARPQMNTRNPKLVVTEKTIRLAYRHAKQNKKNSSCFLLGSLTADEDEEGVTLT VDRFDPGREVPECLEITPTASLPGDFLIPCKVHTQELCSREMIVHSVDDFSSALKALQCH ICSKDSLDCGKLLSLRVHITSRESLDSVEFDLHWAAVTLANNFKCTPVKPIPIIPTALAR NLSSNLNISQVQGTYKYGVFSESGNFIIVLYSMTHKEPEFYECFPCDGKIPDFRFQLLTS KETLHLFKNVEPPDKNPIRCELSAESQNAETEFFSKASKNFSIKRSSQKLSSGKMPIHDH DSGVEDEDFSPRPIPSPHPISKIQPSVPELSLVLDGNFIESNPLPTPLEMVNNENPPLIN HLEHLKPLQPQLYDEKHSPEVEAGEPSLRGIPNQLNQDKPALLRHCKTTAVEDTVQAGRQ MELVSVEAQSSPGLHMRKGVSIAVSTGASLFWNAAGEDQEPDSQMKQDDTKISSEDMNFS VDINNEVTSLPGSASSLKAVDIPSFEESNIAVEEEFNQPLSVSNSSLVVRKEPDVPVFFP SGQLAESVSMCLQTGPTGGASNNSETSEEPKIEHVMQPLLHQPSDNQKIYQDLLGQVNHL LNSSSKETEQPSTKAVIISHECTRTQNVYHTKKKTHHSRLVDKDCVLNATLKQLRSLGVK IDSPTKVKKNAHNVDHASVLACISPEAVISGLNCMSFANVGMSGLSPNGVDLSMEANAIA LKYLNENQLSQLSVTRSNQNNCDPFSLLHINTDRSTVGLSLISPNNMSFATKKYMKRYGL LQSSDNSEDEEEPPDNADSKSEYLLNQNLRSIPEQLGGQKEPSKNDHEIINCSNCESVGT NADTPVLRNITNEVLQTKAKQQLTEKPAFLVKNLKPSPAVNLRTGKAEFTQHPEKENEGD ITIFPESLQPSETLKQMNSMNSVGTFLDVKRLRQLPKLF >gi568815597f:47233946_47476742|GENSCAN_predicted_CDS_2|2820_bp atggagcctatatatccttttgcacggccccagatgaataccagaaatccaaagcttgtg gtgactgagaagaccatccgacttgcttatcgtcatgctaagcagaataaaaaaaattcg tcatgctttttacttggttctctgacagcagacgaagatgaagaaggtgtaacattgaca gtagatcgctttgatcctggtcgagaagtacctgaatgcctagaaataacccctactgct tctcttcctggggactttttgattccatgcaaagttcatactcaagaactttgttcaaga gaaatgatagttcacagtgtagatgacttcagttcagctttaaaggctctacagtgccat atatgtagcaaagattccttggactgtggtaagctgctttccctaagagttcatatcact tccagggagagtttggacagtgtggaatttgacttgcattgggcagcagtaactctagca aataactttaaatgcacacctgtgaagcccatccccattattccaacagctctggcaaga aacttgagcagtaatctgaatatttctcaagttcaagggacttataaatatggggttttt tcagaatctggaaatttcatcatagttctctattctatgacacataaggaacctgagttt tatgaatgcttcccttgtgatggcaagatacctgactttcggtttcagttgctaaccagt aaggaaacattacatcttttcaaaaatgttgaacctcctgacaaaaatccaatccgttgt gaactgagcgctgaaagccaaaatgcagaaacagagtttttcagtaaggcttccaagaat ttttcaattaagaggtcttcccaaaagttatcttctgggaagatgccaatacatgatcac gactctggtgttgaagatgaagatttttctccaagaccaattcctagtcctcatccaatt tctaagatccaaccatcagttcctgaactttcacttgtgttggatggcaatttcatagaa tcaaaccctctgcctactccattggaaatggtgaataatgaaaatcctcctttgattaac cacttggaacacttgaagccattgcaaccccagctttatgatgagaaacacagtccagaa gttgaagctggagagccttccttgagaggaataccaaatcagttaaaccaggataaacca gctcttttgagacactgcaaaacaactgctgttgaagacacagtgcaagctggaagacaa atggagttggtttctgtggaagcacagtcttcccctggcttgcacatgagaaaaggtgta agcattgctgtgagcacaggtgctagcttgttttggaatgcagcaggtgaggatcaagag cctgactctcaaatgaagcaagatgataccaaaatttccagtgaggacatgaatttttct gtcgatattaataatgaagtcacaagtcttccaggtagtgcatcttcattaaaagcagtt gatattcccagttttgaagagagcaacattgctgtggaagaagaatttaaccagccactt tctgtatccaactcttctctagttgtgagaaaagaacctgatgtacctgtgttctttcca agtggccagctggcagaaagtgtaagcatgtgtttacagactggaccaacagggggtgcc agtaacaattctgaaacatcagaggaaccaaaaattgagcatgtaatgcaacccttgctt catcaaccatcagataaccagaaaatttaccaggatttattgggtcaagtaaaccaccta ttaaatagttcctccaaggaaactgagcagccgtctaccaaagcagtaattatcagtcat gaatgcaccagaacccaaaacgtttaccatacaaagaaaaaaacacatcattcaagactg gtggacaaagattgtgtccttaatgcaactcttaagcaactaagaagccttggagtaaaa attgattctcccactaaagtgaagaaaaatgcacataacgtggatcacgccagtgtgttg gcatgcatcagcccagaagcagtgatctctggattaaactgcatgtcatttgctaatgtt ggcatgagcggcttaagccccaatggtgtggatttgagcatggaggcaaatgctatagct ctgaaatatttaaatgaaaatcagctgtcacaactgtctgtcactcgatcgaaccaaaat aattgtgacccattcagccttctccatattaatacagacagaagcacagtggggcttagt ttaatttcaccaaacaacatgtcatttgcaaccaaaaaatatatgaagagatatggactc ctacaaagcagtgacaatagtgaagatgaagaggaacctcccgacaatgcagatagcaag agtgaatatttattgaatcagaaccttaggtccatacctgaacagcttggtggtcagaaa gagccttctaagaatgaccatgaaataattaattgttctaactgtgaatctgtggggacc aacgcagatacgccagtattgagaaatattacaaatgaagttttgcagacaaaagcaaaa cagcagttgactgaaaagccagctttcttagtaaagaaccttaaaccaagtcctgcagtg aaccttcgaaccgggaaagcagagttcactcaacatcctgagaaagaaaatgaaggggac attacaatttttcctgaaagtttgcaaccttctgaaacgctaaagcagatgaatagcatg aattcagtaggcaccttcttagatgtaaaacgtctcagacagttaccaaaattattttaa >gi568815597f:47233946_47476742|GENSCAN_predicted_peptide_3|228_aa MLSRCRSGLLHVLGLSFLLQTRRPILLCSPRLMKPLVVFVLGGPGAGKGTQCARIVEKYG YTHLSAGELLRDERKNPDSQYGELIEKYIKEGKIVPVEITISLLKREMDQTMAANAQKNK FLIDGFPRNQDNLQGWNKTMDGKADVSFVLFFDCNNEICIERCLERGKSSGRSDDNRESL EKRIQTYLQSTKPIIDLYEEMGKVKKIDASKSVDEVFDEVVQIFDKEG >gi568815597f:47233946_47476742|GENSCAN_predicted_CDS_3|687_bp atgctgagccgctgccgcagcgggctgctccacgtcctgggccttagcttcctgctgcag acccgccggccgattctcctctgctctccacgtctcatgaagccgctggtcgtgttcgtc ctcggcggccccggcgccggcaaggggacccagtgcgcccgcatcgtcgagaaatatggc tacacacacctttctgcaggagagctgcttcgtgatgaaaggaagaacccagattcacag tatggtgaacttattgaaaagtacattaaagaaggaaagattgtaccagttgagataacc atcagtttattaaagagggaaatggatcagacaatggctgccaatgctcagaagaataaa ttcttgattgatgggtttccaagaaatcaagacaaccttcaaggatggaacaagaccatg gatgggaaggcagatgtatctttcgttctcttttttgactgtaataatgagatttgtatt gaacgatgtcttgagaggggaaagagtagtggtaggagtgatgacaacagagagagcttg gaaaagagaattcagacctaccttcagtcaacaaagccaattattgacttatatgaagaa atggggaaagtcaagaaaatagatgcttctaaatctgttgatgaagtttttgatgaagtt gtgcagatttttgacaaggaaggctaa >gi568815597f:47233946_47476742|GENSCAN_predicted_peptide_4|313_aa MDPPAAFSGFPALPAVAPSGPPPSPLAGAEPGREPEEAAAGRGEAAPTPAPGPGRRRRRP LQRGKPPYSYIALIAMALAHAPGRRLTLAAIYRFITERFAFYRDSPRKWQNSIRHNLTLN DCFVKVPREPGNPGKGNYWTLDPAAADMFDNGSFLRRRKRFKRAELPAHAAAAPGPPLPF PYAPYAPAPGPALLVPPPSAGPGPSPPARLFSVDSLVNLQPELAGLGAPEPPCCAAPDAA AAAFPPCAAAASPPLYSQVPDRLVLPATRPGPGPLPAEPLLALAGPAAALGPLSPGEAYL RQPGFASGLERYL >gi568815597f:47233946_47476742|GENSCAN_predicted_CDS_4|942_bp atggatccgcccgccgcgttctctggcttccctgccctgccagcggtcgcgccgtcgggg ccgccgccgtcgccgctcgcaggagccgagccagggcgggagccagaggaggcggcggct ggccgcggagaggcggcccccacgcccgcgcccggcccggggcggcggcggcggcggccc ctgcagcgcgggaagccgccctactcgtacatcgcgctcatcgccatggctctggcgcac gccccgggccgccgcctcacgctggccgccatctaccgcttcatcaccgaacgctttgcc ttctaccgcgacagcccgcgcaagtggcagaacagcatccgccacaatctcacgctcaac gactgcttcgtcaaggtgccccgcgagccgggcaacccgggcaagggcaactactggacg ctggaccccgcggccgcagacatgttcgacaacggcagcttcctgcggcgccgcaagcgc ttcaagcgcgccgagctgcccgcgcacgcggccgcggcgccagggccgccgctccccttc ccctacgcgccctacgcgcccgcgcccggccccgcgctgctggtgccgccgccttctgcc ggaccgggcccctcgccgcccgcgcgtctgttcagcgtcgacagcctggtgaacctgcag ccggagctagcggggctgggcgcccccgagccgccctgctgcgccgcgcccgacgccgca gccgcagccttcccgccctgcgctgccgccgcctccccgccactctactcgcaggtcccc gaccgcctggtactgcccgcgacgcgccccggccccggcccgctgcccgctgagcccctc ctggccttggccgggccggcagccgctctcggcccgctcagccctggggaggcctacctg aggcagccgggcttcgcgtcggggctggagcgctacctgtga >gi568815597f:47233946_47476742|GENSCAN_predicted_peptide_5|571_aa MVLEPKASGKESRFSRRERRTHAGSQALLQDAEQAEPSPAPSPPARKTHSPQRQRGRARA AAGAEPESGGGGGSGTMTLGSCCCEIMSSESSPAALSEADADIDVVGGGSGGGELPARSG PRAPRDVLPHGHEPPAEEAEADLAEDEEESGGCSDGEPRALASRGAAAAAGSPGPGAAAA RGAAGPGPGPPSGGAATRSPLVKPPYSYIALITMAILQSPKKRLTLSEICEFISGRFPYY REKFPAWQNSIRHNLSLNDCFVKIPREPGNPGKGNYWTLDPESADMFDNGSFLRRRKRFK RQPLPPPHPHPHPHPELLLRGGAAAAGDPGAFLPGFAAYGAYGYGYGLALPAYGAPPPGP APHPHPHPHAFAFAAAAAAAPCQLSVPPGRAAAPPPGPPTASVFAGAGSAPAPAPASGSG PGPGPAGLPAFLGAELGCAKAFYAASLSPPAAGTAAGLPTALLRQGLKTDAGGGAGGGGA GAGQRPSFSIDHIMGHGGGGAAPPGAGEGSPGPPFAAAAGPGGQAQVLAMLTAPALAPVA GHIRLSHPGDALLSSGSRFASKVAGLSGCHF >gi568815597f:47233946_47476742|GENSCAN_predicted_CDS_5|1716_bp atggtgctggagcccaaggcttctgggaaagagtccaggttttcgcggagggagcgacgg actcacgcgggatcgcaggcactgctgcaagacgccgaacaggctgagccgtcccccgcc ccctccccgcccgcgcgcaaaacgcactcgccccagaggcagcgcggccgagcccgagcc gctgccggagcggagccggagagtggcggcggcggcggcagcggcaccatgaccctgggc agctgctgctgcgagatcatgtcctccgagagctccccggccgcgctgtccgaggccgac gcagacatagacgtggtgggcggcggcagcggcgggggggagctcccagctcgctccggg ccccgcgccccccgggacgtgctcccccacggccacgagcctcccgcggaggaagccgag gcagacttagccgaggacgaggaggagtctggtggctgctcggacggcgagccccgcgct ctggcgtcccggggggcggcggccgcagcggggagcccggggccaggcgccgcggcggcc cgcggcgcagcggggcccgggccgggaccgccgtcggggggcgcggcgacgcggagcccg ctggtgaagccgccctactcgtacatcgcgctcatcaccatggccatcctgcagagcccc aagaagcggctgacgttgagcgagatctgcgagttcatcagcggccgcttcccctactac cgggagaagttccccgcctggcagaacagcatccgccacaacctctctctcaacgactgc ttcgtcaagatcccccgcgagccgggcaacccgggcaagggcaactactggacgctggac ccggagtcggccgacatgttcgacaacggcagcttcctgcggcgtcgcaagcgcttcaag cggcagcccctgccgccgccgcacccacacccgcaccctcacccggagctgctgctgcgt ggcggggccgcggcggcgggggatcccggcgctttcctgcccggcttcgctgcctacggc gcctacggctacggctacgggctggctctcccggcctacggcgcacccccgccggggccg gccccgcatccgcacccgcacccgcacgccttcgctttcgccgcggcagccgccgccgct ccttgccagctgtcggtacccccaggccgcgccgccgcgcctccacccggacctccgacg gcctcggtgttcgcaggcgcgggatcggccccagctcctgcgcctgcctcaggctcgggc ccgggcccgggccccgcaggcctgcccgccttcctgggcgcggagctgggctgcgccaaa gccttctacgcggcgtccctgagtcctcccgcagccggcaccgcggcgggtctgcccacc gcacttctgcgccagggcctcaagacggacgcgggcggtggtgcaggcggcgggggcgcc ggggcagggcagaggccttccttctctatagaccacatcatgggccacggtggcggcggg gcagcacccccgggcgccggcgagggctctccgggaccgccattcgcggcagccgcgggt cctgggggccaagcccaggtcttggccatgctgactgctccggccctggctcccgttgct ggccacattcgcctctcgcatcccggggacgcgctgctgtcctcagggtcccggtttgcc agcaaagtcgccggccttagtggctgccacttctga >gi568815597f:47233946_47476742|GENSCAN_predicted_peptide_6|340_aa MDEQQLIKIQRGECYARERRYIYANRQPKHTQRFGHPSMALPGKEKALGPTWKPLRSRKE VSQAPNHSWLGQPAPSELGPKLALGYISSYRSGPGLIFPAAQTICCVTPARRNAKSRTAS DFRSESAWSEKTALRRDPGLEAESRGSARNNPEGDGAECSTGALRRKPPSFPTSPAVRSL QASGAASASSGSSRSATEGGGARSALRRAAPTSPRKNPFPKLELETSVLSTGSLLVRVCA DLKEQSNSAPKNPINICFAHKRFRRMSYPARPLPIFSFLSTFLQSAKPQSVSHRDKCGLD VIFLVGPPERLPRPQPTQHFDARIGIKLGANPNWPRFSLA >gi568815597f:47233946_47476742|GENSCAN_predicted_CDS_6|1023_bp atggacgagcagcaattaattaaaattcagcgtggcgagtgctatgcaagagagaggagg tacatttatgcaaaccggcaaccaaaacatacacagagatttggccacccctccatggcc ctgccagggaaggaaaaagccctgggtcccacgtggaagcctcttagatccagaaaggaa gtgagccaagctcccaaccatagctggctggggcagccggcacccagtgaactgggacca aagctggccttaggatacatatcctcataccgttcgggtccagggctcatcttcccagct gcacaaactatctgctgtgtgacccccgccaggagaaatgcaaagtcgcggacagcgagc gacttccgctcagagtccgcgtggagcgaaaagacagcgctccgcagggacccaggactc gaggctgagagcagagggagcgccagaaacaacccagaaggtgacggtgcggagtgcagc accggggcgctgcggaggaaaccgccttccttccccactagtccagcagtccgaagcctc caggcctccggagcggcctcagcctcgagcggcagctctcgctccgctacggaaggaggc ggtgcccgctccgccctgcgccgcgccgcgcccactagcccgcggaaaaatccgttcccg aaactagagctggaaacctctgtcctgtctacaggttctctgctggtacgtgtctgcgcc gaccttaaagaacagagtaattccgctccaaaaaacccaatcaacatttgttttgctcac aaacgctttaggaggatgagttatcctgcccgaccactgcccatctttagttttctttca acgtttctccagagcgccaaacctcagagcgtcagccacagagacaaatgtgggctggat gtcatcttcctggttggccctcctgaacgcctccccaggccccagcctacacaacatttt gacgcgcgtatcggaattaagttaggcgcaaacccaaactggccccgcttttctttggct tga