GENSCAN 1.0 Date run: 3-Nov-116 Time: 09:39:45 Sequence gi568815594r:117475101_117676109 : 201009 bp : 34.78% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 735 730 6 1.05 1.05 Term - 4900 4872 29 2 2 118 38 19 0.002 -2.84 1.04 Intr - 24335 24166 170 1 2 75 82 76 0.106 4.37 1.03 Intr - 29064 28909 156 2 0 82 76 58 0.101 2.20 1.02 Intr - 41134 41044 91 2 1 67 53 84 0.052 0.93 1.01 Init - 44759 44597 163 2 1 70 72 66 0.650 3.14 1.00 Prom - 93633 93594 40 -4.25 2.03 PlyA - 93707 93702 6 1.05 2.02 Term - 100462 99998 465 1 0 -8 41 368 0.800 16.53 2.01 Init - 101009 100476 534 2 0 49 -29 557 0.489 33.00 2.00 Prom - 111808 111769 40 -5.95 3.00 Prom + 114243 114282 40 -3.55 3.01 Init + 121314 121385 72 2 0 70 93 19 0.194 1.72 3.02 Intr + 128443 128558 116 0 2 27 109 134 0.318 7.73 3.03 Intr + 176455 176575 121 1 1 93 11 133 0.151 5.68 3.04 Term + 176977 177090 114 0 0 75 48 164 0.326 8.69 3.05 PlyA + 177666 177671 6 1.05 4.02 PlyA - 177938 177933 6 1.05 4.01 Sngl - 183144 181036 2109 0 0 70 44 630 0.984 50.78 4.00 Prom - 183462 183423 40 -10.45 5.00 Prom + 183987 184026 40 -12.62 5.01 Sngl + 184243 184650 408 0 0 45 49 300 0.999 17.84 5.02 PlyA + 184850 184855 6 1.05 6.05 PlyA - 185907 185902 6 1.05 6.04 Term - 192449 192238 212 0 2 12 42 117 0.199 -4.03 6.03 Intr - 193829 193132 698 1 2 -45 86 295 0.132 6.20 6.02 Intr - 194484 194144 341 0 2 -3 77 243 0.102 7.45 6.01 Intr - 197422 197304 119 2 2 82 89 61 0.088 4.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:117475101_117676109|GENSCAN_predicted_peptide_1|202_aa MDKFLDTYTLPRLNQEEVDSLNRLKTGSEIEAIINSLPTKKVKDQTNSQLNSTRGYQEFS AELMELPSTRNAGLRSQCNKFMAGRFLGKINYSGHKDPLHKVAASTALITALSLTTTIDI VTIWAHQNKLRGTLHFLHKKKQDHVFCKDMDGAGGHYPQQTNTGCCRCGKEDSYSHGSKV LMRSDSQLTCEQIGCPLTLQAL >gi568815594r:117475101_117676109|GENSCAN_predicted_CDS_1|609_bp atggataaattcctcgacacatacaccctcccaagactaaatcaggaagaagttgattcc ctgaatagactaaaaacaggctctgaaattgaggcaataattaatagcctaccaaccaaa aaagtcaaagaccagacaaattcacagttgaattctaccagagggtatcaagagttcagt gcagaactaatggaactgccatcaaccagaaatgctggactcagatctcagtgcaataaa tttatggctgggagattcctgggaaagataaattactctggacacaaggaccccttgcat aaagtggctgcttcaactgctctgatcacagcactttcactgactacaactatagacata gtaactatttgggcacaccagaataaactaagaggcaccctccacttcctccataaaaag aaacaagatcatgtcttttgcaaggacatggatggagctggaggccattatcctcaacaa acaaacacaggatgctgtagatgcggcaaggaagacagctatagccatggctccaaagta cttatgagatcagactcacaattaacatgtgaacaaataggttgcccactcaccctgcag gccttgtga >gi568815594r:117475101_117676109|GENSCAN_predicted_peptide_2|332_aa MRARSMDRAAVARVGAVVSASVCALVAGVVLAQYIFTLKRKTGRKTKIIEMMPEFQKSSV RIKNLTRVEEMICGLIKGGAAKLQIITDFDMTLSRFSYKGKRCPTCHNSIDNCQLITDEC RKKLLQLKKKYYAIEVDPVLTVEEKYPYMVEWYTKSHGLLVQQALPKAKLKEIVAESDKD MKNFFDKLQQHSIPVFIFSAGISDVLEEVIRQAGVYHPNVKVVSNFMDFDETGVLKGFKG VLIHIFNKHDGALRNTEYFNQLKDNSNIILLGDSQGDLRMADGVANVEHILKVGYLNDRV DELLEKYMDSYDIVLVQDESLEVANSILQKIL >gi568815594r:117475101_117676109|GENSCAN_predicted_CDS_2|999_bp atgagggcccggtccatggaccgcgcggccgtggcgagggtgggcgcggtagtgagcgcc agcgtgtgcgccctggtggcgggggtggtgctggctcagtacatattcaccttgaagagg aagacggggcggaagaccaagatcatcgagatgatgccagaattccagaaaagttcagtt cgaatcaagaaccttacaagagtagaagaaatgatctgtggtcttatcaaaggaggagct gccaaacttcagataataacggactttgatatgacactcagtagattttcatacaaaggg aaaagatgcccaacatgtcataatagcattgacaactgtcagctgattacagatgaatgt agaaaaaagttattgcaactaaagaaaaaatattacgctattgaagttgatcctgttctt actgtagaagagaagtacccttatatggtggaatggtatactaaatcacatggtttgctt gttcaacaagctttaccaaaagctaaacttaaagaaattgtggcagaatctgacaaagat atgaagaatttctttgataagctccaacaacatagtatcccggtgttcatattttcggct ggaatcagcgatgtactagaggaggttattcgtcaagctggtgtttatcatccaaatgtc aaagttgtgtccaattttatggattttgatgaaactggggtgctcaaaggatttaaagga gttctaattcatatatttaacaaacatgatggtgccttgaggaatacagaatatttcaat caactaaaagacaatagtaacataattcttctgggagactcccaaggagacttaagaatg gcagatggagtggccaatgttgagcacattctgaaagttggatatctaaatgacagagtg gatgagcttttagaaaagtacatggactcttatgatattgttttagtacaagacgaatca ttagaagtagccaactctattttacagaagattctataa >gi568815594r:117475101_117676109|GENSCAN_predicted_peptide_3|140_aa MRLIWKPSFSNALQCSFVHLEHSTASEAIGQCQSSAAKPRRSGKESVRQPWARVPGALEV AASLGAVGVGTPVEAMSENLSVASGVSSVEKHGTMADGSNQPRPAGSSSRDHGSSNAAVL SVTFGSTIPEKCTAATSQSA >gi568815594r:117475101_117676109|GENSCAN_predicted_CDS_3|423_bp atgagactaatttggaaaccgtcattttcaaatgcacttcagtgcagttttgttcatttg gaacattccactgcttctgaggcgatcgggcagtgtcagtcttcagctgctaagccgaga agatctgggaaggagtcagtcagacagccttgggccagagttccaggggctctggaagta gctgccagcttgggggcagtaggagtagggacccctgtggaagcaatgtcagagaatttg tcagttgcctctggggtctccagtgtagagaaacatggaaccatggctgatgggagtaat caaccaaggcctgcaggcagtagcagcagggaccatggcagcagcaatgcagcagtcctg tcagttacctttgggagcaccatcccagagaaatgcacagctgccaccagccagagtgct tag >gi568815594r:117475101_117676109|GENSCAN_predicted_peptide_4|702_aa MDKFLDTYTLPRLNQEEVDSLNKPIRGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VAFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKIVA NRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINIIQHINRTKDKNHMIISIDAEKAFDKI QQTFMLKTLNKLGIDGTYLKIITAIYDKPTANIILNAQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQKKEIKGIQLGKAEVKLSLFADEMIVYLENPIVSAQSLLKLISNFSKV SEYKINVQKSEGFLYTNNRQTESQIMTELPFTIASKRINYLGIQLTRDVKDLFKENYKPL LNEIKEDTKKWENIPCSWAGRINIVKRAILPKVIYTFNAIPIKLPMTFFTELEKTTLKFI WNQKRAHISESILSQKNKAGGTTLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIT THIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAIRRKLKLDPFLTPYTKINSRWIKDLN VRPKTIKTLEENLGITIQDTGIGKDFMSKTPKAMATKAKMDKRDLIKLKIFCTAKETTIR VNMQPTKWEKIFTTYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRNFSKEDIYAA KRHMKKCSPSLAIREMQIKTTVRYHLTPVRMAIIKKSGNNRC >gi568815594r:117475101_117676109|GENSCAN_predicted_CDS_4|2109_bp atggataaattccttgacacatacaccctcccaagactaaaccaggaagaagttgactct ctgaataaaccaataagaggctctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtagcattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagcctggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatagtggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatacgcaaatcaataaatataatccagcatataaacaga accaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaaaccttcatgctaaaaactctcaataaattaggtattgatgggacatatctcaaa ataataacagctatctatgacaaacccacagccaatatcatactgaatgcgcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaattaggcagaagaaggaaataaagggtatt caattaggaaaagcagaagtcaaattgtccctgtttgcagatgagatgattgtatatcta gaaaaccccattgtctcagcccaaagtctccttaagctgataagcaacttcagcaaagtc tcagaatacaaaatcaatgtacaaaaatcagaaggattcttatacaccaataacagacaa acagagagccaaatcatgactgaactcccattcacaattgcttcaaagagaataaactac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaatgaaataaaagaggatacaaagaagtgggagaacattccatgctcatgggcagga agaatcaatatcgtgaaaagggccatactgcccaaggtaatttatactttcaatgccatc cccattaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcata tggaaccaaaaaagagcccacatctctgagtcaatcctaagccaaaagaacaaagctgga ggcaccacgctacctgacttcaaactatattacaaggctacagtaaccaaaacagcatgg tactggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacg acacatatctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaag gattccctatttaataaatggtgctgggaaaactggctagccatacgtagaaagctgaaa ctggatcccttccttacaccttatacaaaaatcaattcaagatggattaaagacttaaac gttagacctaaaaccataaaaaccctagaagaaaacctaggcattaccattcaggacaca ggcataggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaatg gacaaacgggatctaattaaactaaagatcttctgcacagcaaaagaaactaccatcaga gtaaacatgcaacctacaaaatgggagaaaattttcacaacctactcatctgacaaaggg ctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaacccc atcaaaaagtgggcaaaggatatgaacagaaacttctcaaaagaagacatttatgcagcc aaaagacacatgaaaaaatgctcaccatcactggccatcagagaaatgcaaatcaaaacc acagtgagataccatctcacaccggttagaatggcaatcattaaaaagtcaggaaataac aggtgctag >gi568815594r:117475101_117676109|GENSCAN_predicted_peptide_5|135_aa MDLKAKARELREECRSLRSRCDQLEERVSVMEDEMNEMKREGKFREKRIKRNEQSLQGIW DYVKRPNLHLIGVPESDGENGTKLENTLQDIIQENFPNLERQANIQIQEIQRMPQRYSSR RATPKHIIVRFTKLK >gi568815594r:117475101_117676109|GENSCAN_predicted_CDS_5|408_bp atggacctgaaagccaaggctcgagaactacgtgaagaatgcagaagcctcaggagcaga tgcgatcaactggaagaaagggtatcagtgatggaagatgaaatgaatgaaatgaagcga gaagggaagtttagagaaaaaagaataaaaagaaatgaacaaagcctccaaggaatatgg gactatgtgaaaagaccaaatctacatctgattggtgtacctgaaagtgacggggagaat ggaaccaagttggaaaacactctgcaggatattatccaggagaacttccccaacctagaa aggcaggccaacattcagattcaggaaatacagagaatgccacaaagatactcctcgaga agagcaactccaaaacacataattgtcagattcacaaagttgaaatga >gi568815594r:117475101_117676109|GENSCAN_predicted_peptide_6|456_aa XYWFVFSSTAELKMSTAAPQSECYKQCSSSSASRSSSHYQINKIDRPLARLIKKKGEKNQ IDAIKKDKGDITTNATDIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESL NRPITGAEIVAIINGLSTKKSPGPDGFTAEFYQRTKDKNHMIISIDAEKAFDKIQQPFIV KTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKAGTRQGCPLSPFLFNIVLEV LARAIRQEKEIKGIQLGKEEAKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKIN VQKSQAFLYTNNRQTQSQIMSELPFTIASKRINYLGIQLTRDVKDLFKENYKPLLNEIKE DTKKWENIPCSWVGRFNYRENGHNAQGLISRIYNELKQIYKEKTNNPIKKWAKDMNSHFS EEDIYAAKKYMKKCSPSLAVREMQIKTTMRYHLTPV >gi568815594r:117475101_117676109|GENSCAN_predicted_CDS_6|1371_bp ngttattggtttgtcttttcatccacagcagagctcaaaatgagcacagctgccccacag agtgagtgctataagcaatgtagcagttcatctgcttcaagatcttcttcccactaccag atcaacaaaattgatagaccactagcaagactaataaagaaaaaaggagagaagaatcaa atagatgcaataaaaaaggataaaggggatatcaccaccaacgccacagatatacaaact accatcagagaatactacaaacacctctacgcaaataaactagaaaatctagaagaaatg gataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatctctg aatagaccaataacaggagctgaaattgtggcaataatcaatggcttatcaaccaaaaaa agtccaggaccagatggattcacagccgaattctaccagagaaccaaagacaaaaaccac atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatagta aaaactctcaataaattaggtattgatgggacgtatctcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa gctggcacaagacagggatgccctctctcaccattcctattcaacatagtgttggaagtt ctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gccaaattgtccctgtttgcggacgacatgattgtatatctagaaaaccccattgtctca gcccaaaatctcctgaagttgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaataacagacaaacacagagtcaaatcatg agtgaactcccattcacaattgcttcaaagagaataaactacctaggaatccaacttaca agggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagag gatacaaagaaatgggagaacattccatgctcatgggtaggaagattcaattatcgtgaa aatggccataatgcccaagggctaatatccagaatctacaatgaactcaaacaaatttac aaggaaaaaacgaacaaccccatcaaaaagtgggcgaaggatatgaacagccacttctca gaagaagacatttatgcagccaaaaaatacatgaaaaaatgctcaccatcactcgccgtc agagaaatgcaaatcaaaaccacaatgagatatcatctcacaccagtgtga