GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:53:15 Sequence gi568815595r:55370095_55580920 : 210826 bp : 43.63% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1222 1227 6 0 0 95 92 5 0.348 2.26 1.02 Intr + 19476 19638 163 2 1 73 71 52 0.033 1.55 1.03 Intr + 19721 19997 277 0 1 62 56 218 0.022 12.58 1.04 Term + 20007 20670 664 0 1 -55 43 250 0.302 -0.46 1.05 PlyA + 20898 20903 6 1.05 2.00 Prom + 21067 21106 40 -4.96 2.01 Sngl + 22312 23421 1110 0 0 43 54 365 0.766 25.45 2.02 PlyA + 23503 23508 6 1.05 3.00 Prom + 57373 57412 40 -1.76 3.01 Init + 58356 58416 61 2 1 60 81 61 0.721 4.01 3.02 Intr + 62106 62188 83 1 2 81 94 48 0.278 4.06 3.03 Term + 69777 69887 111 2 0 82 53 58 0.147 0.26 3.04 PlyA + 72291 72296 6 1.05 4.00 Prom + 89209 89248 40 -3.66 4.01 Init + 91171 91224 54 0 0 83 64 87 0.910 7.08 4.02 Term + 95019 95084 66 2 0 68 40 58 0.616 -3.06 4.03 PlyA + 95122 95127 6 1.05 5.13 PlyA - 96566 96561 6 1.05 5.12 Term - 100456 99998 459 1 0 93 40 988 0.998 89.59 5.11 Intr - 104535 104243 293 1 2 119 96 653 0.998 66.05 5.10 Intr - 105989 105963 27 0 0 104 106 0 0.735 1.49 5.09 Intr - 109470 109220 251 2 2 119 63 155 0.962 13.18 5.08 Intr - 110824 110691 134 1 2 107 103 -1 0.689 2.74 5.07 Intr - 115037 114923 115 1 1 69 10 101 0.568 0.85 5.06 Intr - 115569 115455 115 1 1 76 29 103 0.010 2.71 5.05 Intr - 116974 116886 89 0 2 103 100 -8 0.011 1.51 5.04 Intr - 117659 117595 65 2 2 77 49 27 0.010 -4.78 5.03 Intr - 118132 117840 293 2 2 38 28 248 0.011 10.65 5.02 Intr - 119047 118838 210 2 0 -20 83 167 0.049 4.28 5.01 Init - 123580 123571 10 1 1 71 110 2 0.048 1.46 5.00 Prom - 135046 135007 40 -5.26 6.00 Prom + 136890 136929 40 -2.56 6.01 Init + 137288 137290 3 1 0 75 95 0 0.543 -0.60 6.02 Intr + 137518 137598 81 0 0 116 86 10 0.707 3.43 6.03 Intr + 142530 142688 159 2 0 112 82 -3 0.736 1.68 6.04 Term + 143022 143168 147 2 0 99 45 110 0.967 5.70 6.05 PlyA + 144349 144354 6 1.05 7.00 Prom + 145938 145977 40 -5.26 7.01 Init + 151290 151425 136 2 1 95 109 18 0.069 4.69 7.02 Intr + 163701 163828 128 1 2 106 64 26 0.144 2.40 7.03 Term + 166994 167188 195 1 0 19 43 142 0.163 0.21 7.04 PlyA + 168581 168586 6 1.05 8.00 Prom + 168637 168676 40 -3.76 8.01 Sngl + 173639 174034 396 1 0 96 37 212 0.907 11.16 8.02 PlyA + 174577 174582 6 -3.44 9.05 PlyA - 175044 175039 6 1.05 9.04 Term - 176419 176333 87 1 0 57 49 91 0.167 -0.24 9.03 Intr - 177214 177065 150 1 0 61 55 92 0.164 3.56 9.02 Intr - 186572 186312 261 2 0 80 72 109 0.318 6.18 9.01 Intr - 203847 203767 81 0 0 74 97 114 0.971 10.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 115237 115155 83 1 2 63 75 140 0.974 8.64 S.002 Intr + 117896 118031 136 0 1 135 13 89 0.909 5.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:55370095_55580920|GENSCAN_predicted_peptide_1|369_aa MAPPLLISRQTGSGVDLQQTPTDLQLRVLTVRRKTNKQKGHPHQNHICMSPSTKTKEEHS SSPATEQSWMENDFDELREEGFRRSNYSELKEEVRTHGKEVKNLEKKLDEWLTRITNAEK SLKDLMELKTTAREICDECTSLSSRCDQLVSVMEDQMNEMKREEKFREKRIQRNKQSHQE IWDYVKRPNLRLIGVPESDKENGTKLENTLQDIIQENFPNLARQANIHIQEIQRTPQRYS SRRATPRHIIVTFTKVEMKEKILRAARDKGRVTHKGKPIRLTADLLAETLQERREWGPIF NILKEKNFQPEFHIPKLSFRSEGEIKYFTDKQMLRDSVITRPALKGLVKEALNMERNNRY QPLQKHAKV >gi568815595r:55370095_55580920|GENSCAN_predicted_CDS_1|1110_bp atggcacctccgctgctgatatccaggcaaacagggtctggagtggacctccagcaaact ccaacagacctgcagctgagggtcctgactgttagaaggaaaactaacaaacagaaagga catccacaccaaaaccacatctgtatgtcaccatcaacaaagaccaaagaggaacacagc tcctcaccagcaacggaacaaagctggatggagaatgactttgatgagttgagagaagaa ggcttcagacgatcaaactactctgagctaaaggaggaagttcgaacccatggtaaagaa gttaaaaaccttgaaaaaaaattagatgaatggctaactagaataaccaatgcagagaag tccttaaaggacctgatggagctgaaaaccacagcaagagaaatatgtgacgaatgcaca agtctcagtagccgatgtgatcaactggtttcagtgatggaagatcaaatgaatgaaatg aaacgagaagagaagtttagagaaaaaagaatacaaagaaacaaacaaagccaccaagaa atatgggactatgtgaaaagaccaaatctacgtctaatcggtgtacctgaaagtgacaag gagaatggaaccaagttggaaaacactctgcaggatattatccaggagaacttccccaat ctagcaaggcaggccaacattcacattcaggaaatacagagaacgccacaaagatactcc tcgagaagagcaactccaagacacataattgtcacattcaccaaagttgaaatgaaggaa aaaatattaagggcagccagagacaaaggtcgggttacccacaaagggaagcccatcaga ctaacagctgatctcttggcagaaactctacaagagagaagagagtgggggccaatattc aacattcttaaagaaaagaattttcaaccagaatttcatatccccaaactaagcttcaga agtgaaggagaaataaaatactttacagacaagcaaatgttgagagattctgtcatcacc aggcctgccctaaaagggctcgtgaaggaagcactaaacatggaaaggaacaaccggtac cagccactgcaaaaacatgccaaagtgtaa >gi568815595r:55370095_55580920|GENSCAN_predicted_peptide_2|369_aa MNIDAKILNKILANRIQQHIKKLIHHNQVGFIPGMQGWFNICKSINIIQHINRNNGKNHM ITSIDAEKAFDKIQQRFMLKTLNKLGIDRTYLKIIRAIYDKPTANIILNGQKLEAFPLKT GTRHGCPLSPLLFNIVLEVLARAIRQEKEIKGIQSGKEEVKLSLFADDMIVYLENPIVSA QNLLKLIGNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFIIASKRIKYLGIQLTR DVKDLFKENYKPLLKEIKEDTNKRKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMT FFRELEKTTLKFIWNQKRAHIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWHQNRH IDQWNRTEP >gi568815595r:55370095_55580920|GENSCAN_predicted_CDS_2|1110_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccataatcaagtgggcttcatccctgggatgcaaggctggttcaac atatgcaaatcaataaacataatccagcatataaacagaaacaacggcaaaaaccacatg attacctcaatagatgcagaaaaggcctttgacaaaattcaacaacgcttcatgctaaaa actctcaataaattaggtattgataggacgtatctcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacacggatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaattaggcaggagaaggaaataaagggtattcaatcaggaaaagaagaagtc aaactgtccctgtttgcagatgacatgattgtatatctagaaaaccccattgtctcagcc caaaatctccttaagctgataggcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagt gaactcccattcataattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggat acaaacaaacggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatg gccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatgact ttcttcagagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccac attgccaaatcaatcctaagccaaaagaacaaagctggaggcatcactctacctgacttc aaactatactacaaggctacagtaaccaaaacagcatggtactggcaccaaaacagacat atagaccaatggaacagaacagagccctga >gi568815595r:55370095_55580920|GENSCAN_predicted_peptide_3|84_aa MFTAPSFFYGPSPPKGISNAEHQEDMMYTHLKSIRECMFGPEDSRSLEAMGLPWLAAADA LRKPLLFQQKECLMTCIGPSDKHL >gi568815595r:55370095_55580920|GENSCAN_predicted_CDS_3|255_bp atgtttactgcccccagcttcttctacgggccttcacctcccaaaggaatctccaatgct gagcatcaagaagacatgatgtatacccacctgaaatccataagggagtgcatgtttggg cctgaagacagcaggtcactagaggccatggggcttccatggctggcagcagcagatgct ctaaggaagcccctcctcttccaacaaaaggagtgcctgatgacatgcatcgggccatct gacaaacatttgtga >gi568815595r:55370095_55580920|GENSCAN_predicted_peptide_4|39_aa MAEGKGEAGTFFTGQQDGGTFGMFVDVFDCHKWKDVIGI >gi568815595r:55370095_55580920|GENSCAN_predicted_CDS_4|120_bp atggcagaaggcaaaggagaagcaggcaccttcttcacagggcagcaggatggaggcaca tttggcatgtttgtagatgtttttgattgtcacaaatggaaggatgttattggcatctag >gi568815595r:55370095_55580920|GENSCAN_predicted_peptide_5|686_aa MATVPKTTGIWKRLEIPVPGTRFCQTSQSPHLRAPAKISVAGPRIAPHALGRSQMSNHAS HYPSPYYSGDFAGEPGSVRQLPSEPFRRRRCARDRPPPAGPLSRPSDPGAKRGGGQLRRP WTAGAESPPRSSELSPFPSLAPAVHSLLFALRPGKTGKREINPKRERTGYVRHLPEKKPC APDCSVAPKLQRGLNPVAPLGFLGCARSGGDFLPAPPPPRHEATFAHSLPNQPDLVSRQT LPGEFRGGNAESAQRTAGRQMGPRRPPEDQFRCAAGRCNTEGATGPAQQMGLGQLDTGLK SIGILSPGVALGMAGSAMSSKFFLVALAIFFSFAQVVIEANSWWSLGMNNPVQMSEVYII GAQPLCSQLAGLSQGQKKLCHLYQDHMQYIGEGAKTGIKECQYQFRHRRWNCSTVDNTSV FGRVMQIEKRLLASSQSSRETAFTYAVSAAGVVNAMSRACREGELSTCGCSRAARPKDLP RDWLWGGCGDNIDYGYRFAKEFVDARERERIHAKGSYESARILMNLHNNEAGRRTVYNLA DVACKCHGVSGSCSLKTCWLQLADFRKVGDALKEKYDSAAAMRLNSRGKLVQVNSRFNSP TTQDLVYIDPSPDYCVRNESTGSLGTQGRLCNKTSEGMDGCELMCCGRGYDQFKTVQTER CHCKFHWCCYVKCKKCTEIVDQFVCK >gi568815595r:55370095_55580920|GENSCAN_predicted_CDS_5|2061_bp atggccacagtccccaaaaccacgggcatctggaagcgtctagaaattccggtgcccgga acccggttttgccaaacttcgcaatctcctcacctgagggcacccgccaagatctccgtg gcaggaccacgcatcgcgcctcatgctctgggccggagccagatgagcaatcatgcatcc cactacccaagtccctactactctggggactttgcaggagagccggggtccgtgcggcag ctgcccagcgagcccttccgccgccgccgttgtgcccgggaccgcccgcctcccgccggc ccgctctcccgcccctcggacccgggcgccaaacgaggaggtgggcagcttcgccggcca tggaccgccggagctgagtcgccgccgcggagctccgagctctcgcccttccccagcctc gcgcccgcggtgcactcgctgctcttcgcccttcgccctggaaaaacggggaaaagagag atcaacccgaagagagaaagaacgggatacgtgcggcatctcccggagaaaaagccatgc gcgcctgattgttctgtggccccaaagctgcagcgggggcttaacccggtcgctccgctc ggattcctcggctgcgctcgctcgggtggcgacttcctccccgcgccccctccccctcgc catgaagccaccttcgcacattcactgccgaaccagccagatttagtttctagacagact ctgccaggggagtttcgagggggcaatgcggagagtgcgcagcgaaccgcgggccgacaa atgggccctcggaggcccccagaggaccagtttcgctgcgccgccggccgctgcaatacg gaaggggccacgggtccagcccagcaaatgggactcggccagctggacacagggttgaag tccattggaatattaagcccaggagttgctttggggatggctggaagtgcaatgtcttcc aagttcttcctagtggctttggccatatttttctccttcgcccaggttgtaattgaagcc aattcttggtggtcgctaggtatgaataaccctgttcagatgtcagaagtatatattata ggagcacagcctctctgcagccaactggcaggactttctcaaggacagaagaaactgtgc cacttgtatcaggaccacatgcagtacatcggagaaggcgcgaagacaggcatcaaagaa tgccagtatcaattccgacatcgaaggtggaactgcagcactgtggataacacctctgtt tttggcagggtgatgcagatagaaaaaagattgttagcaagctctcaaagcagccgcgag acggccttcacatacgcggtgagcgcagcaggggtggtgaacgccatgagccgggcgtgc cgcgagggcgagctgtccacctgcggctgcagccgcgccgcgcgccccaaggacctgccg cgggactggctctggggcggctgcggcgacaacatcgactatggctaccgctttgccaag gagttcgtggacgcccgcgagcgggagcgcatccacgccaagggctcctacgagagtgct cgcatcctcatgaacctgcacaacaacgaggccggccgcaggacggtgtacaacctggct gatgtggcctgcaagtgccatggggtgtccggctcatgtagcctgaagacatgctggctg cagctggcagacttccgcaaggtgggtgatgccctgaaggagaagtacgacagcgcggcg gccatgcggctcaacagccggggcaagttggtacaggtcaacagccgcttcaactcgccc accacacaagacctggtctacatcgaccccagccctgactactgcgtgcgcaatgagagc accggctcgctgggcacgcagggccgcctgtgcaacaagacgtcggagggcatggatggc tgcgagctcatgtgctgcggccgtggctacgaccagttcaagaccgtgcagacggagcgc tgccactgcaagttccactggtgctgctacgtcaagtgcaagaagtgcacggagatcgtg gaccagtttgtgtgcaagtag >gi568815595r:55370095_55580920|GENSCAN_predicted_peptide_6|129_aa MNRIHFPLLETAPPTPTSEAVNHMALKKMKNNYSECLMSLMFFQDKVAKNSHLFSCEKDF QLGSRSQGRIQCPTIAANCFWHPTLRFQRSSQQQRHFNGDIQSIVHSRLQFTTPVYNPVY NPVALRMQD >gi568815595r:55370095_55580920|GENSCAN_predicted_CDS_6|390_bp atgaatagaatccattttcccttgttggaaactgctccccccacccccaccagtgaggct gtcaaccacatggccttgaaaaagatgaaaaataattacagtgaatgtctgatgtccttg atgtttttccaagataaagtagcaaagaacagtcatttattcagttgtgagaaagatttc cagctgggcagcaggtcacaaggcagaatacaatgccccacaattgcagcaaactgcttt tggcacccaactctgcgctttcaacgcagcagccaacaacagcgccatttcaatggtgac atccagtccatagtgcattcccgtttacagtttacaactcctgtttacaaccctgtttat aatcccgttgccctgagaatgcaggactga >gi568815595r:55370095_55580920|GENSCAN_predicted_peptide_7|152_aa MEAFDLVSQAGLKACLWCKETTLEVKCYAAYFLGEEEREKDSTFPGQLCIFSLPDAEVPY NISFEFKISYYFDDAETVKPSEGQVTQGSLLNTLASFASMLGGRMGSTDTGNSLQTSGAG RPHFHKKSPLTAMLGATSNQCLQMTSLLKEQS >gi568815595r:55370095_55580920|GENSCAN_predicted_CDS_7|459_bp atggaggcatttgaccttgtttctcaggcagggctgaaggcttgcctgtggtgcaaagaa accaccttggaggtgaaatgctatgcagcatactttcttggagaagaagagagagagaaa gactccacttttccaggacagctctgcattttctcccttccagatgctgaggttccttat aacatttccttcgaattcaagatttcatattattttgatgatgcagagacagtgaagccc agcgaggggcaggtcacccagggatctctgctgaacaccttggcgtcctttgccagcatg ctgggaggccgcatgggctccaccgacaccggtaactcactgcagaccagcggggctggc cggccacatttccacaagaagtctcccctcacagccatgctgggtgccacaagcaatcaa tgtctgcaaatgacaagtttgctgaaagaacagtcttag >gi568815595r:55370095_55580920|GENSCAN_predicted_peptide_8|131_aa MDLLVPAALSLCAICFSLRPSSLVPRNYSLHVMAALATLPLSLEALPLTTNQLHLPTILN SDSQEQEPAWPISSVQGGRFTGVHPDDEVAAVGPSAQPSSNQPWLGQGSMGLSKAAQAGE ATDGDESDVGL >gi568815595r:55370095_55580920|GENSCAN_predicted_CDS_8|396_bp atggatctcctcgttcctgctgcgctgtctctgtgtgccatttgcttctctctacgtccc tcttctctggttcctcggaactacagcttgcatgtcatggctgccctggccactctccca ctctctctcgaagccttacctctcaccactaatcaactccacctccccacgatcctcaat tcagattctcaagaacaagaacctgcctggcctatttcatccgttcagggaggccgattc acaggagtccatccagatgatgaggtggctgctgttggtcccagtgctcagccctcatcc aatcaaccatggctggggcagggcagcatgggactaagcaaggctgcccaggcaggagag gctaccgatggggatgagagtgatgtgggcctttag >gi568815595r:55370095_55580920|GENSCAN_predicted_peptide_9|192_aa LFGDEEKRYIKCPHQIKAMFSIQKNLKALLVANGCQPSSFQIQIHKQRAGAFPAVVQQNS QLLSLDLIESLQARECDAVIGQVESYALCLELMGVVGASGQKVEEVGTWSVFLERLNYPD LADVEDERSQGCFLTPGDSVVVGPAGASRGQPWASAECQDSSSQNLDSNVEMDSEYDKQG NEAACQKMMSVT >gi568815595r:55370095_55580920|GENSCAN_predicted_CDS_9|579_bp ctgtttggtgatgaggagaagcgctacatcaagtgccctcatcaaataaaagctatgttc tcaattcagaaaaacttaaaggctctgttggtagcaaatggctgccagccatcatccttc cagattcagatccacaagcaaagagcaggtgcctttcctgcagtagttcagcaaaattct cagctcctctcactggatttgattgaatcactgcaggccagagaatgtgatgctgtgatt ggccaagttgagtcatatgctctatgtctggagcttatgggggtggtgggagcttctgga cagaaagtggaggaggtgggaacttggtctgtcttcctggaaaggctgaactatccagac ctggccgatgtagaggatgaaaggagccaggggtgcttcctcaccccgggagattcagtg gtggtgggcccggctggggcaagtcgaggacagccctgggcatcagcagagtgccaagac agcagctcacagaacctggattctaatgtggagatggacagtgaatatgataaacaagga aatgaggctgcatgtcagaaaatgatgtcagtcacttga