GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:48:46 Sequence gi568815586r:66037963_66269951 : 231989 bp : 39.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 705 700 6 1.05 1.04 Term - 2804 2680 125 0 2 102 43 52 0.409 -0.33 1.03 Intr - 6588 6439 150 1 0 72 37 93 0.427 1.81 1.02 Intr - 7008 6833 176 2 2 37 108 206 0.388 16.16 1.01 Init - 15612 15605 8 0 2 59 86 0 0.018 -2.66 1.00 Prom - 31692 31653 40 -2.95 2.00 Prom + 32410 32449 40 -2.95 2.01 Init + 38033 38087 55 1 1 29 91 50 0.561 0.90 2.02 Intr + 38203 38309 107 2 2 137 100 126 0.820 17.71 2.03 Term + 56042 56140 99 1 0 107 47 52 0.006 0.15 2.04 PlyA + 56229 56234 6 1.05 3.02 PlyA - 56440 56435 6 1.05 3.01 Sngl - 59226 57964 1263 0 0 70 48 430 0.965 32.82 3.00 Prom - 60479 60440 40 -6.15 4.00 Prom + 67613 67652 40 -1.65 4.01 Init + 89085 89157 73 2 1 82 75 73 0.944 6.68 4.02 Intr + 92099 92235 137 0 2 103 68 47 0.615 3.57 4.03 Intr + 92474 92724 251 1 2 -23 93 212 0.417 5.91 4.04 Intr + 92957 93013 57 2 0 49 96 94 0.643 3.38 4.05 Term + 121328 121472 145 2 1 42 42 113 0.009 -1.50 4.06 PlyA + 121602 121607 6 1.05 5.00 Prom + 122359 122398 40 -7.55 5.01 Init + 125540 125714 175 1 1 72 100 69 0.754 6.06 5.02 Intr + 131096 131297 202 0 1 86 10 184 0.553 7.82 5.03 Term + 131797 132211 415 1 1 26 45 306 0.942 13.85 5.04 PlyA + 133939 133944 6 1.05 6.02 PlyA - 137661 137656 6 1.05 6.01 Sngl - 151546 151181 366 1 0 91 48 457 0.930 35.84 6.00 Prom - 152396 152357 40 -7.55 7.00 Prom + 153494 153533 40 -3.15 7.01 Init + 153727 153780 54 0 0 86 91 48 0.283 6.24 7.02 Intr + 163814 163936 123 1 0 42 60 78 0.074 0.26 7.03 Intr + 172185 172239 55 2 1 70 86 83 0.418 3.93 7.04 Intr + 173484 173635 152 1 2 86 80 51 0.527 3.06 7.05 Intr + 190339 190462 124 0 1 41 84 81 0.400 2.24 7.06 Term + 192299 192477 179 0 2 120 48 76 0.749 3.67 7.07 PlyA + 193391 193396 6 -0.45 8.02 PlyA - 193410 193405 6 1.05 8.01 Sngl - 197325 196111 1215 0 0 83 36 1139 0.984 103.75 8.00 Prom - 199475 199436 40 -9.15 9.00 Prom + 199858 199897 40 -8.55 9.01 Init + 200566 200711 146 0 2 32 81 164 0.192 9.74 9.02 Intr + 206524 206722 199 1 1 43 68 138 0.211 5.73 9.03 Intr + 206986 207048 63 0 0 93 99 38 0.610 3.40 9.04 Intr + 209733 210092 360 2 0 109 94 170 0.540 14.19 9.05 Term + 210556 210648 93 0 0 73 28 120 0.524 1.35 9.06 PlyA + 212144 212149 6 1.05 10.00 Prom + 213612 213651 40 -7.55 10.01 Init + 213999 214150 152 2 2 67 74 152 0.956 11.26 10.02 Intr + 214635 214707 73 0 1 63 92 14 0.738 -2.21 10.03 Intr + 219782 219877 96 1 0 111 72 21 0.732 2.09 10.04 Intr + 220173 220334 162 2 0 78 14 111 0.493 1.95 10.05 Term + 222231 222290 60 2 0 95 42 108 0.447 3.73 10.06 PlyA + 222392 222397 6 1.05 11.03 PlyA - 223496 223491 6 1.05 11.02 Term - 229240 228980 261 1 0 96 42 96 0.329 0.34 11.01 Intr - 231193 231074 120 1 0 51 75 114 0.540 6.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:66037963_66269951|GENSCAN_predicted_peptide_1|152_aa MVSFDETHRKALISGLDRQLFSHSPAGSSAQGVEDESRPLMLAATSGTLVAAKSKALCKT PIWAEKHPYGSMAVKMIQARACPAVVGMSFLGPEASCCSAADGYHFASAPTVLNHITPEA SNSGPGCRHQPHIFPVPSSLSFVVQQHQIAVF >gi568815586r:66037963_66269951|GENSCAN_predicted_CDS_1|459_bp atggtaagttttgatgaaacccacaggaaagctctcatcagcggcttggacagacagctc ttctctcacagccctgccggatcatctgcacagggagtggaggatgaatcacgtcctctt atgcttgcagctaccagcggtactctggtggcagcaaagtccaaggctctctgtaagact ccaatatgggctgagaaacacccttatgggagtatggctgttaaaatgatacaggcaagg gcctgcccagcagttgttggaatgtctttcctggggccagaagcatcttgttgttctgct gcagatggttaccactttgcctcagcccccacagttcttaatcacataaccccagaagcc tcaaattctggccctggatgccggcatcagcctcatatttttccagtcccttccagcctt agctttgtggtgcagcaacaccaaattgctgttttctga >gi568815586r:66037963_66269951|GENSCAN_predicted_peptide_2|86_aa MTLKKPLPTTAATCRTKAGLTMDLALAHSGPETSEEGDSWKCVAEETGEAGVAKESKWYK ERISSSERHGFKLYLEGILLSKYWTI >gi568815586r:66037963_66269951|GENSCAN_predicted_CDS_2|261_bp atgactctcaaaaagccattaccaactacagcagccacttgcaggaccaaagcaggtctc accatggatctcgcactggcccactcaggcccagaaacaagtgaagagggagattcttgg aaatgtgttgcagaagagactggggaagctggggtggcaaaggaatctaaatggtataag gaaagaatatccagctctgaaagacatggattcaaactctatttagaagggatcctgctg tccaaatattggacaatttga >gi568815586r:66037963_66269951|GENSCAN_predicted_peptide_3|420_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIEAIINSLPTKKSPGPEGFTVEFYQRYKEEL VPFFLKLFQSIEKEGILPNSFYEASIILIPKPDRDTTKKENFRPISLMNIDAKICNKILA NRIQQHIEKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRTNDKNHMIISIDAEKAFDKI QWPFMPKTLNKLGIDGAYLKIIRAVYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKSIQLGKEEVKLSLFADDVIVYLENPIVSAQNLLKLISNFSKV SGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKCLGIQLTRDMKDLFKENYKPL LNEIKEDTNKWKNIPCSWIGRINIVKMAILPKVIYRFNDIPIKLPMTFFTELEKTTLKFI >gi568815586r:66037963_66269951|GENSCAN_predicted_CDS_3|1263_bp atggataaattcctcgacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaa aaaagtccaggaccagagggattcacagtcgaattctaccagaggtacaaggaggagctg gtaccattctttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagcctgacagagacacaacaaaaaaagag aattttagaccaatatccctgatgaacatcgatgccaaaatctgcaataaaatactggca aaccgaatccagcagcacatcgagaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaacatacgcaaatcaataaatgtaatccagcatataaacaga accaatgacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caatggcccttcatgccaaaaactctcaataaattaggtattgatggggcgtatctcaaa ataataagagctgtttatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttaaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaatcaggcaggagaaagaaataaagagtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagatgatgtgattgtatatcta gaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatgc ctaggaatccaacttacaagggatatgaaggacctcttcaaggagaactacaaaccactg ctcaatgaaataaaagaggacacaaacaaatggaagaacattccatgctcatggatagga agaatcaatattgtgaaaatggccatactgcccaaggtaatttatagattcaatgacatc cccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcata tga >gi568815586r:66037963_66269951|GENSCAN_predicted_peptide_4|220_aa MAIIRKWKTSISEDVKKLKYSNTAGTNYVCKESKLSLRQCLHICCPYAWQLLFSRLAPSF ATSERTPLTSESGRQTTLRAEVNFSGSERQGWAPLYPVHSEAGGPPPKRARGTLALNKAC ISGLELKYSPASVKKRNSPRVAESSPKSHVSLDRCRSRIWRMLTGVRSRLGPRGSQLPCH EDTEAALWRGSCGKELRPPVSSHGNEISQKQILQPSQAFM >gi568815586r:66037963_66269951|GENSCAN_predicted_CDS_4|663_bp atggctataatccgaaaatggaaaacaagtattagtgaggatgtgaagaaactcaaatac tcaaacactgcaggcaccaactacgtctgtaaagagtccaaactatctctacgtcagtgc cttcatatttgctgtccctatgcctggcaactgctcttctcaaggcttgctccttcattc gctacctcagaaaggactcctctgacctccgaaagcgggcggcaaacaaccttgagggcc gaagttaacttcagcgggagtgaacgacaggggtgggctccactttatccagtgcactcg gaagccggagggcccccaccaaaaagagcaaggggaaccctcgccctcaacaaggcctgc atctccggactggagctcaagtatagcccagcgagtgtcaagaaacgaaattctccaagg gtggcggaatcaagccccaagtcccatgtgtcactggaccggtgtagaagtcggatttgg agaatgctgacgggtgtccggtcaaggttaggccccagaggaagtcagctgccatgtcat gaggacactgaagcagccctatggagaggttcatgtggcaaggaactgaggcctccagtc agtagccatgggaatgaaatttcccaaaagcagatcctccagcccagtcaagccttcatg tga >gi568815586r:66037963_66269951|GENSCAN_predicted_peptide_5|263_aa MAERGNAKQKGERALIKPSDLVRTYYHKNSMKVTAPVIKLPPTWSLPQHVGMMGMTTQEQ VLGTKILSTSELSSSFLFQNKFQLPKQAESLGLFRRSNDQTRAGNGFHVDEAEHFKFKNF CPLATTRWPLSLGSRKSASRGRAVQTSGWEALERGQRTHSDVHGGGGHAAAIVEVVLDRG ARVPGVGVSHDGNSTPTGPGPLTATASSPLPRGRAQPQQAVRRRRANTERVDFLASKIPA GRRKRGWLRVRMEEKTVYIPGVI >gi568815586r:66037963_66269951|GENSCAN_predicted_CDS_5|792_bp atggcagaaaggggaaatgccaagcaaaagggggaaagggcccttataaaaccatcagat cttgtgagaacttactatcacaagaacagcatgaaggtaactgcccctgtgattaaatta cctcccacctggtccctcccacaacacgtggggatgatgggaatgacaactcaagaacaa gttctaggcactaagattcttagcacatcagaactatcttcatctttccttttccagaac aagttccagctgcctaaacaggctgaaagtctggggctgtttcggcgatcaaatgaccaa actagagcaggcaatggcttccacgtagatgaagctgagcattttaaattcaaaaatttc tgcccattggctactacgagatggccgctctccctcggaagccggaagtcggcttctaga ggccgagcagtgcagacaagcggctgggaggcactggagaggggacaacgtacccattcg gatgtgcacggtggcggaggccacgctgctgccatagttgaagtcgtcctcgatcgagga gcgagggtaccgggggtcggggtcagccatgatggcaacagcacccctaccggtcccggt ccactaaccgcaaccgcctcctccccacttccgcgaggaagagcccagccccagcaagcc gtgcggcgtcggcgggcgaacaccgagcgagtcgattttctcgcttcaaaaattccagct gggcggagaaagcggggatggctccgagttagaatggaagagaaaacggtttacattccc ggcgtcatctga >gi568815586r:66037963_66269951|GENSCAN_predicted_peptide_6|121_aa MQGDAERAPRRAGLPRRPRPVPADSPRPRQPSAPSQLSRTAQSSPSSAGGRSNSSVCADS APRAPQFPAMARLPEPRGRGGVQVPARLRQAVTQPRKSCFHDGSRALGTRRSPVTAGAVP T >gi568815586r:66037963_66269951|GENSCAN_predicted_CDS_6|366_bp atgcagggcgacgccgagcgggccccgcggcgcgctgggctcccccgccgcccccggccg gtccccgccgactcacccaggccgcgccagcccagcgcgccgtcgcagctgtccagaaca gcgcagagctctccgagcagcgcgggcggcaggtcgaacagcagcgtgtgcgccgacagc gcgccgcgggccccacagttccccgccatggctcggctgcccgagccccggggacgaggc ggagtccaggtccctgcacgcctgcgacaggccgttacacaaccgcggaaatcctgcttc cacgacggcagcagagccctaggaacacgccgttctccggtcactgcaggggcggtgccc acgtga >gi568815586r:66037963_66269951|GENSCAN_predicted_peptide_7|228_aa MPSFPVGTAKGERGEEVQILAFNCSKVFNGKTGGTKHHFSNLQQLNLTVLRMNNNHTEGE TANVTVDNVLIPEHNEKGILLKSSISFQNIIEGTRNFHKDFLIGEGEIFEVYRVEIQNLT YAVKLFKQEYPKPFTTCTTFNHARSSVAVYQGKLFQSFWLYLKASFILTCRSRKWAQKHR FTCRRFLREGSQGQQMKRGKEKGRIGQLERLSWDIIVIEASVHAMGKH >gi568815586r:66037963_66269951|GENSCAN_predicted_CDS_7|687_bp atgccgagttttcctgtgggcacagcgaagggagaaagaggggaagaggtgcagatatta gcttttaattgctcaaaagtgttcaatgggaagacagggggaacaaaacaccatttctca aaccttcaacagctgaatctaactgtattacgaatgaataataaccacactgaaggggaa acagccaatgtcaccgtggataatgttcttattcctgaacataatgaaaaaggaatactg cttaaatcttccatcagctttcaaaatatcatagaaggaactagaaatttccacaaagac ttcctaattggagaaggagagatttttgaggtatacagagtggagattcaaaacctaaca tatgctgtcaaattatttaaacaggaatatccaaagccattcactacctgcacaacgttc aaccatgctcggtcatctgtggcagtatatcaaggtaaattattccagagcttttggtta tacctgaaagcttcttttatcctcacatgtaggtccaggaagtgggctcagaagcacaga tttacctgcaggaggtttttaagggagggctctcagggacagcaaatgaagcgaggaaag gaaaaaggcaggattgggcagctggagaggttgagttgggatattattgtcatagaggcc tcagtccatgccatggggaagcactga >gi568815586r:66037963_66269951|GENSCAN_predicted_peptide_8|404_aa MIFPSSSGNPGGSSNCRTPYRKQQSLVPAHPMAPPSPSTTSSNNNSSSSSNSGWDQLSKT NLYIRGLPPHTTDQDLVKLCQPYGKIVSTNAILHKTTNKCKGYGFVDFDSPAAAQKAVSA LKASGVQAQMAKQQEQDPTNLYISNLPLSMDKQELENMLKPFGQVISTRILRDASGTSRG VGFARTESTEKCEAVTGHFNGKFIKTPPGVSAPTEPLLCKFSDGGQKKRQNPNKYIPNGR PWHREGEVRLTGMTLTYDPTTAAIQNGFYPSPYSIATNRMITQTSITPYIVSPVSAYQVQ SPSWTQPQPYILQHPGAVLTPSMEHTMSLQPASMISPLAQQMSHLSLGSTGTYMPATSAM QGAYLPQYEHMQTTAAPVEEASGQQQVAVETSNDHSPYTFQPNK >gi568815586r:66037963_66269951|GENSCAN_predicted_CDS_8|1215_bp atgatcttccccagcagcagcggcaaccccgggggcagcagcaactgccggacgccctat cgcaagcagcagtctctggtcccagcccaccccatggcccctcccagtcccagcaccacc agcagtaataacaacagtagcagcagcagcaactcaggatgggatcagctcagcaaaacg aacctctatatccgaggactgcctccccacaccaccgaccaggacctggtgaagctctgt caaccatatgggaaaatagtctccacaaacgcaattttgcataagacaacgaacaaatgc aaaggttatggttttgtcgactttgacagccctgcagcagctcaaaaagctgtgtctgcc ctgaaggccagtggggttcaagctcaaatggcaaagcaacaggaacaagatcctaccaac ctctacatttctaatttgccactctccatggataagcaagaactagaaaatatgctcaaa ccatttggacaagttatttctacaaggatactacgtgatgccagtggtacaagtcgtggt gttggctttgctaggacggaatcaacagaaaaatgtgaagctgttactggtcattttaat ggaaaatttattaagacaccaccaggagtttctgcccccacagaacctttattgtgtaag ttttctgatggaggacagaaaaagagacagaacccaaacaaatacatccctaatggaaga ccatggcatagagaaggagaggtgagacttactggaatgacacttacttacgacccaact acagctgctatacaaaacggattttatccttcaccatacagtattgctacaaaccgaatg atcactcaaacttctattacaccctatattgtatctcctgtatctgcctaccaggtgcaa agtccttcttggacgcaacctcaaccatatattctacagcaccccggtgccgtgttaact ccctcaatggagcacaccatgtcactacagcccgcatcaatgatcagccctctggcccag cagatgagtcatctgtcactaggcagcaccggaacatacatgcctgcaacgtcagctatg caaggagcctacttgccacagtatgaacatatgcagacgacagcggctcctgttgaggag gcaagtggtcaacagcaggtggctgtcgagacgtctaatgaccattctccatataccttt caacctaataagtaa >gi568815586r:66037963_66269951|GENSCAN_predicted_peptide_9|286_aa MEACWRHQQRQGNWEGEESIISVQKLGNADIGLLAEARGIEEEIGPGKRANILLDDQFQP KLTDFAMAHFRSHLEHQSCTINMTSSSSKHLWYMPEEYIRQGKLSIKTDVYSFGIVIMEV LTGCRVVLDDPKHIQLVLNTLESTQASLYFAEDPPTSLKSFRCPSPLFLENVPSIPVEDD ESQNNNLLPSDEGLRIDRMTQKTPFECSQSEVMFLSLDKKPESKRNEEACNMPSSSCEES WFPKYIVPSQDLRPYKCESSEPPSAKNIQLDSIVWFSLLARVVGNV >gi568815586r:66037963_66269951|GENSCAN_predicted_CDS_9|861_bp atggaagcatgttggcgtcatcagcagagacagggaaattgggaaggagaggagagcata attagtgtgcagaaacttggaaatgctgacattggactcctggcagaagcaagaggcatt gaagaggagatcgggcctggaaagcgtgcaaacatccttttggatgatcagtttcaaccc aaactaactgattttgccatggcacacttccggtcccacctagaacatcagagttgtacc ataaatatgaccagcagcagcagtaaacatctgtggtacatgccagaagagtacatcaga caggggaaactttccattaaaacagatgtctacagctttggaattgtaataatggaagtt ctaacaggatgtagagtagtgttagatgatccaaaacatatccagctggttttaaatact cttgaaagtactcaagccagcttgtattttgctgaagatcctcccacatcactaaagtcc ttcaggtgtccttctcctctattcctggagaatgtaccaagtattccagtggaagatgat gaaagccagaataacaatttactaccttctgatgaaggcctgaggatagacagaatgact cagaaaactccttttgaatgcagccagtctgaggttatgtttctgagcttggacaaaaag ccagagagcaagagaaatgaggaagcttgcaacatgcccagttcttcttgtgaagaaagt tggttcccaaagtatatagttccatcccaggacttaaggccctataagtgtgagtcctca gagcctccatctgccaagaacattcagttggattccatcgtttggtttagcttgcttgca cgggttgtaggaaatgtctaa >gi568815586r:66037963_66269951|GENSCAN_predicted_peptide_10|180_aa MKKASWRRFLKRDTELREIEKSTDEALRMAIQGIGDPQQLCTTIPNARKMGHYSRSWGYS ADGNQQSSLPSWSLYRQESPSVAQASLKLLESSNPPASASQNAGITGLRHLATVITTSDF DAEKPTHNFHPCCHGHLIPPTPDLMLLALNLEPSCFEKYCLICVIDHFGKAAFIIHEIVD >gi568815586r:66037963_66269951|GENSCAN_predicted_CDS_10|543_bp atgaagaaggcttcttggagaaggtttctgaaaagagatacggaattaagagaaatagaa aagagcactgacgaagcactcaggatggctattcaaggaattggggatccacagcagctg tgcacaaccatccctaatgccagaaaaatggggcactattctaggagctggggctacagt gcagatggaaaccagcaaagctcactgccctcatggagcttatatagacaggagtctccc tctgttgctcaggctagtctcaaactcctggaatcaagcaatccacctgcttcagcctct caaaatgctgggattacagggctcaggcacttggccacagtgataaccacatcagacttc gatgcagaaaagcccacacataactttcatccctgttgccatggccacctaataccccct acaccagacttaatgcttctggcactgaatctggaacccagttgttttgaaaagtattgt cttatttgtgtcattgaccactttggtaaagctgctttcatcatccacgaaattgttgat taa >gi568815586r:66037963_66269951|GENSCAN_predicted_peptide_11|126_aa QRRGKTHYLNDKNHNYFCTNLIEAQRMAASEETEEEEMKKTWGYLQRECRDTQWGLCPLT QCMAHWLLSSTSPEVLHPSWRQGDPPPHSHRPPPPPTAEGEESRSGNLHIRMPLGPGSVL ARVCLF >gi568815586r:66037963_66269951|GENSCAN_predicted_CDS_11|381_bp cagcggcgaggtaaaacccattaccttaatgacaaaaatcacaattacttctgcaccaac ctaatagaagcccagagaatggcagcctctgaggaaacagaggaggaagaaatgaagaag acatggggatatttgcagagggagtgcagagacacccagtgggggctctgtcccctgact cagtgcatggcccactggcttctgagttctacctcccctgaagtgctgcaccccagctgg agacagggagatccacctcctcattcccacaggccaccgcccccacccacagcagaaggg gaagaatcaaggagtggcaacctccacatcaggatgcccttgggacctggatcagtattg gcaagagtctgtctgttttga