GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:01:54 Sequence gi568815596f:206059980_206262880 : 202901 bp : 42.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 3037 2820 218 1 2 43 82 195 0.241 12.61 1.00 Prom - 10906 10867 40 -3.45 2.05 PlyA - 11458 11453 6 1.05 2.04 Term - 24052 23999 54 1 0 104 44 56 0.083 -0.62 2.03 Intr - 25356 25340 17 1 2 96 111 9 0.124 -1.66 2.02 Intr - 26063 25922 142 2 1 92 121 87 0.194 11.41 2.01 Init - 52447 52346 102 1 0 28 86 76 0.037 1.69 2.00 Prom - 53236 53197 40 -7.35 3.17 PlyA - 53368 53363 6 1.05 3.16 Term - 56300 56191 110 0 2 82 36 132 0.484 5.09 3.15 Intr - 66632 66560 73 2 1 89 91 70 0.954 5.46 3.14 Intr - 66865 66731 135 1 0 29 87 153 0.977 9.14 3.13 Intr - 67993 67818 176 2 2 86 60 136 0.848 9.34 3.12 Intr - 70263 70109 155 2 2 72 115 99 0.999 9.79 3.11 Intr - 73126 72966 161 1 2 84 58 147 0.999 9.16 3.10 Intr - 78635 78506 130 1 1 74 47 120 0.999 6.28 3.09 Intr - 82090 81962 129 0 0 51 73 118 0.978 5.49 3.08 Intr - 82852 82707 146 1 2 84 72 200 0.767 16.16 3.07 Intr - 84153 84039 115 2 1 77 53 79 0.987 2.83 3.06 Intr - 85047 84913 135 2 0 50 76 156 0.998 9.36 3.05 Intr - 87109 86924 186 0 0 109 65 126 0.998 10.28 3.04 Intr - 87682 87552 131 1 2 46 93 111 0.986 5.97 3.03 Intr - 89946 89839 108 0 0 108 75 108 0.947 11.06 3.02 Intr - 92531 92440 92 0 2 21 32 183 0.431 4.79 3.01 Init - 93699 93639 61 0 1 32 89 21 0.334 -1.94 3.00 Prom - 94591 94552 40 -7.85 4.00 Prom + 97645 97684 40 -7.85 4.01 Init + 97899 97931 33 2 0 95 76 -19 0.676 -2.46 4.02 Intr + 99093 99302 210 2 0 66 -42 181 0.571 1.09 4.03 Intr + 99355 99567 213 0 0 50 41 187 0.840 8.19 4.04 Intr + 99980 100080 101 1 2 82 72 199 0.484 15.69 4.05 Intr + 100609 100731 123 1 0 89 98 97 0.998 9.58 4.06 Intr + 101367 101493 127 0 1 -13 64 236 0.933 10.86 4.07 Intr + 102059 102125 67 1 1 88 56 103 0.994 4.66 4.08 Intr + 102510 102635 126 1 0 78 84 98 0.954 8.13 4.09 Term + 102750 102904 155 1 2 66 38 198 0.906 9.70 4.10 PlyA + 102930 102935 6 1.05 5.09 PlyA - 104368 104363 6 1.05 5.08 Term - 110603 110389 215 0 2 58 49 131 0.310 2.61 5.07 Intr - 114905 114818 88 2 1 53 92 62 0.016 1.72 5.06 Intr - 117296 116264 1033 1 1 82 79 559 0.012 44.01 5.05 Intr - 142916 142783 134 2 2 -37 121 67 0.036 -3.98 5.04 Intr - 143742 143631 112 2 1 37 66 147 0.820 6.86 5.03 Intr - 143969 143915 55 0 1 56 103 92 0.331 4.62 5.02 Intr - 157931 157830 102 0 0 55 105 138 0.945 11.43 5.01 Init - 176505 176376 130 0 1 61 90 78 0.366 5.66 5.00 Prom - 179426 179387 40 -7.65 6.05 PlyA - 179558 179553 6 1.05 6.04 Term - 180294 180115 180 0 0 114 33 106 0.541 4.33 6.03 Intr - 193655 193499 157 1 1 34 49 113 0.027 1.09 6.02 Intr - 200025 199877 149 0 2 28 81 117 0.230 3.11 6.01 Init - 202188 202120 69 0 0 77 76 49 0.815 3.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:206059980_206262880|GENSCAN_predicted_peptide_1|73_aa MYEGKHIHFSEVDNKPLCSYSPKLCKQRRLNGYAFCIRHVLEDKTAPFKQCEYVAKYNSQ RCTNPIPKSEDRS >gi568815596f:206059980_206262880|GENSCAN_predicted_CDS_1|219_bp atgtatgaagggaaacatatacacttctctgaggttgacaataagcccttgtgctcatat agccccaaactgtgcaagcagaggcgactcaacggctacgccttctgtatcagacacgtt ctggaggacaagactgcccccttcaagcaatgtgaatatgtggccaagtataacagccaa cgctgcaccaaccccatccccaaatcagaggatcgtagn >gi568815596f:206059980_206262880|GENSCAN_predicted_peptide_2|104_aa MGLPSRKDLPVAYSASQPLVSWTSYRVAQGSKGKGKALAGAQGGAVAAAAELPSPLPAPL TLICQPSRGNRGEGEKKKQLKDGPQIPGQDVSRGYAYLLVVLEC >gi568815596f:206059980_206262880|GENSCAN_predicted_CDS_2|315_bp atgggattgccttccagaaaggatcttcctgtggcctattcagcaagtcagcctctggtt agttggacttcttatagggtagctcaaggctccaagggcaagggaaaggctctcgccgga gcccaggggggagcagtagcagcggcagccgagcttcctagtccattgccagcgcctctc actctgatctgtcagcccagccgagggaacaggggggaaggagaaaaaaaaaagcagctg aaagacggcccccagattccgggacaggatgtttcccgtggctatgcttacctccttgtt gttcttgagtgttga >gi568815596f:206059980_206262880|GENSCAN_predicted_peptide_3|680_aa MLRIPVRKALVGLSKSPKGCVRTTATAASNLIEVFVDGQSVMVEPGTTVLQACEKVGMQI PRFCYHERLSVAGNCRMCLVEIEKAPKDQSMMFGNDRSRFLEGKRAVEDKNIGPLVKTIM TRCIQCTRCIRFASEIAGVDDLGTTGRGNDMQVGTYIEKMFMSELSGNIIDICPVGALTS KPYAFTARPWETRKTESIDVMDAVGSNIVVSTRTGEVMRILPRMHEDINEEWISDKTRFA YDGLKRQRLTEPMVRNEKGLLTYTSWEDALSRVAGMLQSFQGKDVAAIAGGLVDAEALVA LKDLLNRVDSDTLCTEEVFPTAGAGTDLRSNYLLNTTIAGVEEADVVLLVGTNPRFEAPL FNARIRKSWLHNDLKVALIGSPVDLTYTYDHLGDSPKILQDIASGSHPFSQVLKEAKKPM VVLGSSALQRNDGAAILAAVSSIAQKIRMTSGVTGDWKVMNILHRIASQVAALDLGYKPG VEAIRKNPPKVLFLLGADGGCITRQDLPKDCFIIYQGHHGDVGAPIADVILPGAAYTEKS ATYVNTEGRAQQTKVAVTPPGLAREDWKIIRALSEIAGMTLPYDTLDQVRNRLEEVSPNL VRYDDIEGANYFQQANELSKLVNQQLLADPLVPPQLTIKDFYMTGEVTEINEALAENPGL VNKSRYEDGWLIKMTLSNPS >gi568815596f:206059980_206262880|GENSCAN_predicted_CDS_3|2043_bp atgttaaggatacctgtaagaaaggccttagtaggcctttctaagtctcctaaaggatgt gttcgaacaactgccacagcagcaagcaacttgattgaagtatttgttgatggtcagtct gtcatggtggaaccgggaacgaccgtcctccaagcttgtgagaaggttggcatgcagatc cctcgattctgttatcatgaaaggttgtctgttgctggaaactgcaggatgtgccttgtt gaaattgagaaagcccctaaggaccagtccatgatgtttggaaatgataggagccgattt ttagaggggaagcgtgctgtggaagacaagaacattgggccattggtaaagaccatcatg acaagatgtatacagtgtactcgctgcatcaggtttgcaagtgagattgcaggagtagat gatttgggaacaacaggcagaggaaatgatatgcaagttggcacatacattgaaaagatg ttcatgtctgaactgtctgggaatatcattgatatctgccctgtaggtgccctaacctct aagccctatgcctttactgcccggccttgggaaacaagaaagacagaatccattgatgta atggatgcggttggaagtaatattgtggttagcacaagaactggagaagtgatgaggatt ttgccacgtatgcatgaggacatcaatgaagagtggatctctgataaaaccagatttgcc tatgatgggctaaaacgtcaaagacttaccgagccaatggtcagaaatgaaaaagggctt ttaacctatacttcttgggaggatgcgctctctcgcgtagctggaatgttgcagagtttt caaggcaaagatgtggcagcaattgcaggtggcttggtggatgctgaagccctggtagct ctcaaagatttgcttaatagagtggactctgacaccttatgcactgaagaggtcttcccc actgcaggagctggcacagatttgcgttccaattatcttcttaatactacaattgctggt gtggaagaggcagatgttgttcttctggttggtacaaacccacgttttgaggcaccactg tttaatgctagaattcgaaagagctggctgcataatgacttaaaagtggcccttataggc agtccagtggacctcacttacacatatgaccacctgggagactcccccaaaattcttcaa gacattgcttcgggaagccatccatttagccaggtcctaaaggaagctaaaaaaccaatg gtggttttaggcagttctgcactccaaagaaatgatggagcagcaattcttgcagctgtt tctagcattgcacaaaagattcggatgactagtggtgttactggtgattggaaagttatg aatatccttcataggattgcaagtcaagtagctgctttggaccttggctataagcctggg gtggaagcaattcggaagaaccctcccaaggtgctgtttctcctgggagcagatggaggt tgtatcacacgacaggatttgccaaaggattgtttcattatttatcaaggacatcatggt gatgttggggctcccatagctgatgttattctcccaggagctgcttacacagagaagtct gctacatatgtcaacactgagggtagagctcagcagactaaggtagcagtgacacctcct ggcttggcaagagaagactggaaaattataagagcactctctgagattgctggaatgact cttccatatgatactctggatcaagtaaggaacagattggaagaagtctctcctaatctt gttcgatatgatgatattgaaggggctaattacttccagcaagcaaatgagctctcaaag ctagtgaaccagcagcttcttgctgacccacttgttccacctcagctaactataaaagac ttctacatgacaggagaagtaaccgaaattaatgaagctcttgcagaaaatccaggactt gtaaacaaatctcgttatgaagatggttggctgatcaagatgacactgagtaacccttca taa >gi568815596f:206059980_206262880|GENSCAN_predicted_peptide_4|384_aa MDPSIYFLPSKREGHLQAYIRDSVPEDPLILIFFSASSDQRPWLKAPHLPALRGFRGGGA GGGAQSRTTEDYVAWAKGNSPTSPSPRSRGGCSAKLSGPRRPPRRPGRLFNMAASANSVS RAWRTESPEGLESLHSTSLSAISVSRPEGGAELSDTADTMGFGDLKSPAGLQVLNDYLAD KSYIEGYVPSQADVAVFEAVSSPPPADLCHALRWYNHIKSYEKEKASLPGVKKALGKYGP ADVEDTTGSGATDSKDDDDIDLFGSDDEEESEEAKRLREERLAQYESKKAKKPALVAKSS ILLDVKPWDDETDMAKLEECVRSIQADGLVWGSSKLVPVGYGIKKLQIQCVVEDDKVGTD MLEEQITAFEDYVQSMDVAAFNKI >gi568815596f:206059980_206262880|GENSCAN_predicted_CDS_4|1155_bp atggatccatccatttacttcttgccttccaagagggaaggtcacctgcaagcctacatt cgtgacagcgttcccgaggaccccctgatcctcatcttcttttctgcttcatcagaccag agaccgtggctaaaagccccccatctgccggctctcaggggcttccgaggcggcggggcg ggaggcggcgcccagtcaaggacaacagaagactacgtcgcgtgggccaaaggaaacagt cccacctcaccttctccccggagccgcggaggctgttctgctaaactgtctggaccacga cgaccccctaggaggccgggtcgcttattcaatatggcggcctcggctaactctgtcagc cgggcctggagaacggaaagcccggagggactagaatccttgcattcgacaagtttgtcg gcaatttccgtcagccgaccagagggcggagctgagctctcggatacagccgacaccatg ggtttcggagacctgaaaagccctgccggcctccaggtgctcaacgattacctggcggac aagagctacatcgaggggtatgtgccatcacaagcagatgtggcagtatttgaagccgtg tccagcccaccgcctgccgacttgtgtcatgccctacgttggtataatcacatcaagtct tacgaaaaggaaaaggccagcctgccaggagtgaagaaagctttgggcaaatatggtcct gccgatgtggaagacactacaggaagtggagctacagatagtaaagatgatgatgacatt gacctctttggatctgatgatgaggaggaaagtgaagaagcaaagaggctaagggaagaa cgtcttgcacaatatgaatcaaagaaagccaaaaaacctgcacttgttgccaagtcttcc atcttactagatgtgaaaccttgggatgatgagacagatatggcgaaattagaggagtgc gtcagaagcattcaagcagacggcttagtctggggctcatctaaactagttccagtggga tacggaattaagaaacttcaaatacagtgtgtagttgaagatgataaagttggaacagat atgctggaggagcagatcactgcttttgaggactatgtgcagtccatggatgtggctgct ttcaacaagatctaa >gi568815596f:206059980_206262880|GENSCAN_predicted_peptide_5|622_aa MSVHRGEVPCTGTTASPLEEATLSELKTVLKSFLSQSQVLKLEEKSLNHKDSDRAQSPEN FGHALSRSSHRHRQLQRVPYIIDEGTGADCTKTLICSLDQIGELGKMAYLQDCRFTTRAR TQEEPDGSDARDKERKRAGVGTVDGGVTGFEGLAAAEVHSATCTETLRGAEFSSFVVRFL HSPFSKVMEDLEETLFEEFENYSYDLDYYSLESDLEEKVQLGVVHWVSLVLYCLAFVLGI PGNAIVIWFTGFKWKKTVTTLWFLNLAIADFIFLLFLPLYISYVAMNFHWPFGIWLCKAN SFTAQLNMFASVFFLTVISLDHYIHLIHPVLSHRHRTLKNSLIVIIFIWLLASLIGGPAL YFRDTVEFNNHTLCYNNFQKHDPDLTLIRHHVLTWVKFIIGYLFPLLTMSICYLCLIFKV KKRSILISSRHFWTILVVVVAFVVCWTPYHLFSIWELTIHHNSYSHHVMQAGIPLSTGLA FLNSCLNPILYVLISKKFQARFRSSVAEILKYTLWEVSCSGTGRQTSNVATQGSKRPRWK FLVLLKAMTGTAMWNCESNSTSFFCKLPSLRYVFISSAKTDKYSKLAPVEWVTVEKVPEN VEVTLELGNRQRLEQFGGLRRR >gi568815596f:206059980_206262880|GENSCAN_predicted_CDS_5|1869_bp atgagtgtccaccgtggagaggtaccttgcacagggaccactgcatctcctttagaggaa gccacactctctgaattaaaaacagtcctgaagagcttcctaagtcaaagccaagtattg aaattggaggagaaatcattaaaccacaaggattcagacagagcccagagccctgaaaac tttggccacgcactttcccgcagcagccacaggcaccggcaacttcagagagttccctat atcatcgatgaaggaacaggagctgattgtaccaaaacacttatatgttcgctagatcag atcggagaactcgggaaaatggcgtacttacaagactgcaggtttactacaagggccaga actcaggaagagccggatggaagcgatgcacgggacaaggaaaggaaacgcgctggagtg ggtacagtggatggaggggtgacaggctttgaggggctggcagcagctgaggtccacagt gccacgtgtaccgagacactgagaggagcagagttctcctcttttgtagtaagatttctt cattctccatttagcaaggtcatggaagatttggaggaaacattatttgaagaatttgaa aactattcctatgacctagactattactctctggagtctgatttggaggagaaagtccag ctgggagttgttcactgggtctccctggtgttatattgtttggcttttgttctgggaatt ccaggaaatgccatcgtcatttggttcacggggttcaagtggaagaagacagtcaccact ctgtggttcctcaatctagccattgcggatttcatttttcttctctttctgcccctgtac atctcctatgtggccatgaatttccactggccctttggcatctggctgtgcaaagccaat tccttcactgcccagttgaacatgtttgccagtgtttttttcctgacagtgatcagcctg gaccactatatccacttgatccatcctgtcttatctcatcggcatcgaaccctcaagaac tctctgattgtcattatattcatctggcttttggcttctctaattggcggtcctgccctg tacttccgggacactgtggagttcaataatcatactctttgctataacaattttcagaag catgatcctgacctcactttgatcaggcaccatgttctgacttgggtgaaatttatcatt ggctatctcttccctttgctaacaatgagtatttgctacttgtgtctcatcttcaaggtg aagaagcgaagcatcctgatctccagtaggcatttctggacaattctggttgtggttgtg gcctttgtggtttgctggactccttatcacctgtttagcatttgggagctcaccattcac cacaatagctattcccaccatgtgatgcaggctggaatccccctctccactggtttggca ttcctcaatagttgcttgaaccccatcctttatgtcctaattagtaagaagttccaagct cgcttccggtcctcagttgctgagatactcaagtacacactgtgggaagtcagctgttct ggcacaggtcgccagacttctaatgtggcaactcagggctccaagaggccaaggtggaaa tttctagtcctgttaaaggctatgactggcacagctatgtggaactgtgagtccaactca acctctttcttttgtaaattgcccagtctcaggtatgtctttatcagcagcgcgaaaaca gacaaatacagcaaattggcaccagtagagtgggtcactgttgaaaaggtacctgaaaat gtggaagtgacattggaactgggtaacaggcagaggttggaacaatttggagggctcaga agaagatag >gi568815596f:206059980_206262880|GENSCAN_predicted_peptide_6|184_aa MYELQTLVQQYKGSISKESENTKWPGSTASRSLSVLEVCRHQYPQTPSLLPRCPRASSPQ LKGQRRKSRSEDCLKMGNWKSLSPQGATVESAAYGSAGGSKSCRVPPAVLHAIECCERFQ DRKTGGCGRAAVPQSPRRTTDQSIALTLGKGLFMGDRKVQILVEEPAVVPGIEPISQAWR NKRK >gi568815596f:206059980_206262880|GENSCAN_predicted_CDS_6|555_bp atgtatgagcttcagacattagtccaacagtataaaggtagcatttccaaagaatcggaa aacacaaagtggccaggcagcaccgcttcaagaagtctctccgtgcttgaagtctgcagg caccagtaccctcagacgccctccctcctgccaagatgccctagagcaagttccccgcag ctgaaggggcagcgaaggaagagccgaagtgaagactgcctaaagatggggaactggaaa agtttgagtccccagggagctaccgtggagagtgctgcatacggtagcgcaggcggctcc aagtcgtgtagagtgccgccggcggtattacatgcaatagagtgctgcgaacggttccaa gacaggaagaccgggggatgtgggagggcagcagtcccccaatccccaagacggaccaca gaccaaagcattgctctgaccctaggaaagggcctgtttatgggtgacaggaaggttcag atcctagtggaggaaccagcagtagtgcctggaatagagcccatatcccaggcatggagg aacaagaggaagtag