GENSCAN 1.0 Date run: 8-Nov-116 Time: 17:39:42 Sequence gi568815583f:41232958_41480183 : 247226 bp : 44.98% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 471 532 62 2 2 51 70 65 0.217 1.72 1.02 Intr + 10592 10648 57 2 0 67 47 94 0.597 1.20 1.03 Intr + 10720 10782 63 1 0 73 96 104 0.794 7.53 1.04 Intr + 23953 24033 81 1 0 47 84 135 0.659 7.75 1.05 Intr + 29799 29926 128 0 2 36 65 129 0.945 5.72 1.06 Intr + 37600 37661 62 2 2 116 91 100 0.978 11.55 1.07 Intr + 45810 45932 123 2 0 107 81 132 0.998 15.18 1.08 Term + 46379 46432 54 1 0 119 43 91 0.997 5.26 1.09 PlyA + 48905 48910 6 1.05 2.00 Prom + 54503 54542 40 -3.66 2.01 Init + 58876 59048 173 0 2 62 63 133 0.852 7.12 2.02 Intr + 99199 99573 375 1 0 131 66 194 0.022 15.63 2.03 Intr + 108734 108785 52 2 1 41 74 26 0.023 -4.59 2.04 Intr + 109429 109497 69 0 0 65 109 53 0.745 4.48 2.05 Intr + 116141 116284 144 1 0 62 68 103 0.903 6.18 2.06 Intr + 118076 118172 97 1 1 19 97 77 0.320 1.28 2.07 Intr + 125192 125301 110 0 2 34 95 75 0.591 2.80 2.08 Intr + 132445 132632 188 0 2 60 103 21 0.557 -0.71 2.09 Intr + 138570 138727 158 0 2 86 58 106 0.909 7.05 2.10 Intr + 142755 142871 117 1 0 61 95 25 0.467 0.94 2.11 Term + 149188 149309 122 2 2 49 55 55 0.171 -3.06 2.12 PlyA + 151072 151077 6 1.05 3.06 PlyA - 151517 151512 6 1.05 3.05 Term - 154636 154487 150 1 0 45 43 93 0.601 -1.59 3.04 Intr - 155565 155491 75 0 0 72 87 23 0.386 0.31 3.03 Intr - 162087 161902 186 0 0 50 87 137 0.833 9.69 3.02 Intr - 164058 163530 529 2 1 47 73 236 0.686 10.94 3.01 Init - 171045 170972 74 0 2 88 116 45 0.917 7.84 3.00 Prom - 179819 179780 40 -3.46 4.00 Prom + 182474 182513 40 -6.76 4.01 Init + 184159 184356 198 0 0 54 94 392 0.996 33.30 4.02 Intr + 205364 205474 111 1 0 76 90 92 0.913 8.78 4.03 Intr + 219944 220091 148 1 1 101 105 100 0.836 12.81 4.04 Intr + 224715 224919 205 1 1 80 116 208 0.984 21.06 4.05 Intr + 231814 231928 115 1 1 40 91 116 0.914 7.55 4.06 Intr + 233184 233295 112 2 1 108 115 173 0.999 21.95 4.07 Intr + 237300 237435 136 1 1 98 57 261 0.981 23.63 4.08 Intr + 238215 238392 178 0 1 89 11 106 0.752 2.92 4.09 Intr + 241663 241745 83 0 2 94 70 30 0.567 0.24 4.10 Intr + 242568 242655 88 0 1 34 88 65 0.694 1.07 4.11 Intr + 242755 242862 108 0 0 75 94 58 0.915 5.58 4.12 Intr + 243489 243566 78 2 0 85 116 66 0.992 8.85 4.13 Intr + 244208 244329 122 1 2 25 89 188 0.995 11.89 4.14 Intr + 244501 244558 58 1 1 117 83 117 0.994 12.99 4.15 Intr + 245591 245668 78 1 0 76 82 58 0.918 3.75 4.16 Term + 246146 246289 144 1 0 61 50 96 0.855 1.01 4.17 PlyA + 246750 246755 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:41232958_41480183|GENSCAN_predicted_peptide_1|209_aa MWLVDKQLSAHGKAVFPIAKEDKSFDIPVLLNFEAFDSEGQITRLYSRFTSLDKGENGTL SREDFQRIPELAINPLGDRIINAFFPEGEDQVNFRGFMRTLAHFRPIEDNEKSKDVNGPE PLNSRSNKLHFAFRLYDLDKDEKISRDELLQVLRMMVGVNISDEQLGSIADRTIQEADQD GDSAISFTEFVKVLEKVDVEQKMSIRFLH >gi568815583f:41232958_41480183|GENSCAN_predicted_CDS_1|630_bp atgtggcttgtggacaaacaactaagtgctcatggaaaagctgttttccccattgccaaa gaagataagagtttcgacatcccagtcctactgaattttgaagcatttgacagcgagggt caaatcactcgcctctacagccggttcaccagcctggacaaaggagagaatgggactctc agccgggaagatttccagaggattccagaacttgccatcaacccactgggggaccggatc atcaatgccttctttccagagggagaggaccaggtaaacttccgtggattcatgcgaact ttggctcatttccgccccattgaggataatgaaaagagcaaagatgtgaatggacccgaa ccactcaacagccgaagcaacaaactgcactttgcttttcgactatatgatttggataaa gatgaaaagatctcccgtgatgagctgttacaggtgctacgcatgatggtcggagtaaat atctcagatgagcagctgggcagcatcgcagacaggaccattcaggaggctgatcaggat ggggacagtgccatatctttcacagaatttgttaaggttttggagaaggtggatgtagaa cagaaaatgagcatccgatttcttcactaa >gi568815583f:41232958_41480183|GENSCAN_predicted_peptide_2|534_aa MVLNKTVLTSDASCTSEVPPATCIFDQVATDLESPMTLLEPAFDFVGFLIVFSSFKVRWF SATRWRKREHPLGLASAFPERFTALTGEDHGPEGPRQVPGEVHRVGEHCVTLCALEHSAP LRLQPRRQLRAGGWLLSPQPCGAERRGPLHHLRIPLHGGRKRSLVNRPLSATTKVPPGRR CTTPQHFWLIPIKCILHIVQATKLLKALKGYIKHEARKGNENQDESQTSASSCDETEIQI SNQEEAERQPLGHVTKTRRRCKTVRVDPDSQNHEKQESQDLRATAKVPSPPDEHQEAENA VSSDFKKLHEAHFKEMESIDQYIERKKKHFEEHNSMNELKQQPINKGGVRTPVPPRGRLS VASTPISQRRSQGRSCGPASQSTLGLKGSLKRSAISAAKTGVRFSAATKDNEHKRSLTKT PARKSAHVTVSGGTPKGEAVLGTHKLKTITGNSAAVITPFKLTTEATQTPVSNKKPVFDL KASLSRPLNYEPHKGSEERLFPAAITSRSGERLCPAAHRLRCGERLCPAAPSGM >gi568815583f:41232958_41480183|GENSCAN_predicted_CDS_2|1605_bp atggtcctcaacaagactgtcctcacttcagatgccagctgtacttcggaggtaccccca gccacctgcattttcgaccaagtggctacagatttggagagtcccatgaccctcttagaa ccagcttttgatttcgttggttttcttattgtttttagttcctttaaagttaggtggttc tccgccacccggtggagaaagcgggaacaccctctcgggctagcctctgcctttcccgaa cgcttcactgcactcactggagaagaccacggccccgagggaccgcgacaggtcccaggc gaggtgcaccgagtcggcgagcactgcgtgacactgtgcgcactggaacacagcgcacct ctcaggctgcagccaagacggcagctgcgggccggcggctggctcctcagcccccagccc tgcggggccgagcggcgaggaccccttcaccacctgcgtatcccactccatggaggtcgt aaaagaagcttggtcaatcgccctctcagtgccaccacaaaagtccccccggggcggcgt tgcacaacgccacagcatttctggctcatccccatcaaatgcatcttgcacattgtgcag gcaaccaagttgttaaaagccttgaaaggctacattaaacatgaggcaagaaaaggaaat gagaatcaggatgaaagtcaaacttctgcatcctcttgtgatgagactgagatacagatc agcaaccaggaagaagctgagagacagccacttggccatgtcaccaaaacaaggagaagg tgcaagactgtccgtgtggaccctgactcacagaatcatgaaaagcaggaaagccaggat ctcagagctactgcaaaagttccttctccaccagacgagcaccaagaagctgagaatgct gtttcctcagactttaagaagcttcatgaagctcattttaaggaaatggagtccattgat caatatattgagagaaaaaagaaacattttgaagaacacaattccatgaatgaactgaag cagcagcccatcaataagggaggggtcaggactccagtacctccaagaggaagactctct gtggcttctactcccatcagccaacgacgctcgcaaggccggtcttgtggccctgcaagt cagagtaccttgggtctgaaggggtcactcaagcgctctgctatctctgcagctaaaacg ggtgtcaggttttcagctgctactaaagataatgagcataagcgttcactgaccaagact ccagccagaaagtctgcacatgtgaccgtgtctgggggcaccccaaaaggcgaggctgtg cttgggacacacaaattaaagaccatcacggggaattctgctgctgttattaccccattc aagttgacaactgaggcaacgcagactccagtctccaataagaaaccagtgtttgatctt aaagcaagtttgtctcgtcccctcaactatgaaccacacaaaggaagtgaggagcgcctc ttcccagccgccatcacatctaggagtggggagcgtctctgcccggccgcccatcgtctg agatgtggggagcgcctctgccccgccgccccatctgggatgtga >gi568815583f:41232958_41480183|GENSCAN_predicted_peptide_3|337_aa MGHGETARIKGCGKRIEESVMGAERKFSKPTSALYPFLGIRFAEYSSSLQKPVASPGKAS SQRKTEGDLQGDHQKEVALDITSSEEKPDVSFDKAIRDEAIYHFRLLKDEIVDHWRGPEG HPLHEVLLEQAKVVWQFRGKEDLDKWTVTSDKTIGGRSEVFLKMGKNNQSALLYGTLSSE APQDGESTRSGYCAMISRIPRGAFERKMSYDWSQFNTLYLRVRGDGRPWMVNIKEDTDFF QRTNQMYSYFMFTRGGPYWQEVKIPFSKFFFSNRGRIRDVQHELPLDKISSIGFTLADKV DGPFFLEIDFIGVFTDPAHTEEFAYENSPELNPRLFK >gi568815583f:41232958_41480183|GENSCAN_predicted_CDS_3|1014_bp atgggccatggagagacagcccgcattaaaggttgtgggaagagaatagaagagtcagtg atgggggcagaaagaaaattctctaagccaacttctgccttgtatccatttttgggtatt cgctttgcagagtattccagtagtcttcagaaaccagtggcttctcctggcaaagcctcc tcacagaggaagactgaaggggatttgcaaggagatcaccagaaagaagttgctttggat ataacttcttctgaggagaagcctgatgttagtttcgataaagcaattagagatgaagca atataccattttaggcttttgaaggatgaaattgtggatcattggagaggaccggaaggc caccctctgcatgaggtcttgctggaacaagccaaggttgtctggcaattccgggggaaa gaagatttggataagtggacagtgacttctgataagacgattggaggcagaagtgaagtg tttttgaaaatgggcaagaataaccaaagtgcactgctatatggaactctgagctctgag gcgcctcaggacggggagtctacccgaagtgggtactgtgcaatgatatccaggattcca aggggtgcttttgagaggaagatgtcttacgattggtcccagttcaatactctgtatctc cgtgtacgtggggatggtcggccttggatggtgaatatcaaggaggacacagatttcttc cagaggacgaatcagatgtatagttacttcatgttcacccgcgggggaccctactggcag gaggtcaagattcctttttccaaatttttcttctctaatcgaggaagaatccgggatgtt cagcatgagcttccgcttgataagatctcttctataggattcaccttggctgataaagtg gatggtccattcttcctggagatagattttattggcgtgtttactgatccagctcataca gaagaatttgcctatgaaaattctccagagcttaacccaaggctttttaaataa >gi568815583f:41232958_41480183|GENSCAN_predicted_peptide_4|653_aa MRGRLCVGRAAAAAAAVAVPLAGGQEGSPGGGRRGSRGTTMVKKRKGRVVIDSDTEDSGS DENLDQELLSLAKRKRSDSEEKEPPVSQPAASSDSETSDSDDEWTFGSNKNKKKGKARKI EKKGTMKKQANKTASSGSSDKDSSAESSAPEEGEVSDSDSNSSSSSSDSDSSSEDEEFHD GYGEDLMGDEEDRARLEQMTEKEREQELFNRIEKREVLKRRFEIKKKLKTAKKKEKKEKK KKQEEEQEKKKLTQIQESQVTSHNKERRSKRDEKLDKKSQAMEELKAEREKRKNRTAELL AKKQPLKTSEVYSDDEEEEEDDKSSEKSDRSSRTSSSDEEEEKEEIPPKSQPVSLPEELN RVRLSRHKLERWCHMPFFAKTVTGCFVRIGIGNHNSKPVYRVAEITGVVETAKVYQLGGT RTNKGLQLRHGNDQRVFRLEFVSNQEFTESEFMKWKEAMFSAGMQLPTLDEINKKELSIK EALNYKFNDQDIEEIVKEKERFRKAPPNYAMKKTQLLKEKAMAEDLGDQDKAKQIQDQLN ELEERAEALDRQRTKNISAISYINQRNREWNIVESEKALVAESHNMKNQQMDPFTRRQCK PTIVSNSRDPAVQAAILAQLNAKYGSGVLPDAPKEMSKASVVPPCCLLSEAAG >gi568815583f:41232958_41480183|GENSCAN_predicted_CDS_4|1962_bp atgcgcggtcgcctttgtgtgggtcgagcagcggcggcggcggcggcagtggcggtccca ctggcaggcgggcaagaggggagtccgggcggcggccggcgtgggagccgggggaccacc atggtaaagaagcggaaaggccgcgtcgtgatcgactcggacacagaggacagcggcagc gacgagaacctggatcaggagctcttgtccctggcaaagcgaaagcgcagtgactctgag gagaaggagccgcctgtgagtcagcctgcagcctcgtcagactcggagacgtctgacagt gacgatgagtggacatttgggagcaataaaaataagaagaaaggaaaagccagaaaaata gagaagaaaggaaccatgaagaaacaggccaacaaaactgcctcctcaggcagttcagac aaagacagttcagctgagagctcagcccctgaggaaggtgaagtgtcagactctgacagc aacagctcctcttccagttcagattcagactcttcctcagaagatgaagagttccatgat ggctatggagaagacctcatgggagatgaggaagacagggcccgtctggaacagatgaca gagaaagagagagagcaagaactgttcaatcgcatagagaagagggaggtgttgaaaaga agatttgaaatcaagaaaaaactaaaaacagccaaaaagaaagaaaagaaagaaaagaag aaaaagcaagaagaggagcaagaaaagaaaaaactgacacagattcaagaatctcaggta acatcccacaacaaggaacggcgttccaagcgggatgagaaactagacaagaaatctcaa gccatggaggagctaaaagcagagcgagaaaaacgaaagaacagaacagctgagctcctt gccaaaaaacagccattaaaaaccagtgaggtctactctgatgatgaagaggaggaagag gatgacaaatccagtgaaaagtcagaccgctcatcacgaacatcatcgtctgatgaagaa gaggagaaagaagagatccctcccaaatcccaaccagtttccttacctgaagaattgaat cgggttcgattatcacggcataagctagaacgctggtgtcacatgcccttctttgctaaa actgtcacaggatgttttgtgcggattggcatcggaaaccacaacagcaaaccagtttac cgggtcgctgagattacgggtgttgtggaaactgccaaagtttaccaactaggtggcacc agaacaaacaaagggctgcaactacggcatggcaatgaccaacgcgtgttccgtttagag tttgtctcaaaccaagaattcaccgaaagtgagtttatgaagtggaaagaagcgatgttc tctgctggcatgcagttgcccactctagatgaaatcaataaaaaggaattatctattaaa gaagctcttaattataaattcaatgatcaggacattgaagagattgtaaaagagaaagaa aggttcagaaaagctccacccaactacgctatgaagaagactcagctactgaaggaaaag gccatggctgaggacctgggggatcaggacaaggccaaacaaatccaagatcaactgaat gagctggaggaacgggcagaggccctggaccgccagcggaccaagaacatatccgctatc agttacatcaaccagcggaaccgggagtggaacattgtagagtctgagaaggcccttgtg gctgaaagtcacaacatgaaaaaccaacagatggatccctttactcggcggcagtgcaag cctaccatcgtttctaattccagagacccagctgttcaagctgccatcttggcccagctg aatgcaaaatacggttctggagtgttaccagatgctccaaaggaaatgagcaaggcaagt gtggtaccaccctgttgcctgctgagtgaggctgctggctga