GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:22:16 Sequence gi568815589r:30731884_30932225 : 200342 bp : 34.39% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 12061 12266 206 2 2 7 54 214 0.727 6.45 1.02 PlyA + 13215 13220 6 1.05 2.00 Prom + 22600 22639 40 -4.45 2.01 Init + 25945 26038 94 0 1 39 84 74 0.828 1.03 2.02 Intr + 26107 26328 222 2 0 27 13 203 0.444 3.88 2.03 Intr + 26571 26885 315 1 0 85 103 101 0.708 6.41 2.04 Intr + 27439 27558 120 2 0 50 52 110 0.178 3.15 2.05 Intr + 41853 42214 362 1 2 73 22 413 0.323 27.21 2.06 Intr + 42305 42532 228 1 0 -11 34 335 0.442 15.34 2.07 Term + 49188 49403 216 2 0 24 38 148 0.073 -0.44 2.08 PlyA + 49579 49584 6 1.05 3.03 PlyA - 49974 49969 6 1.05 3.02 Term - 55996 55002 995 2 2 47 37 392 0.113 21.24 3.01 Init - 58517 58397 121 2 1 74 -4 154 0.497 5.20 3.00 Prom - 61464 61425 40 -4.85 4.05 PlyA - 63016 63011 6 1.05 4.04 Term - 67950 67745 206 1 2 -20 50 252 0.689 7.15 4.03 Intr - 68410 68176 235 1 1 77 -29 477 0.758 31.24 4.02 Intr - 68805 68504 302 1 2 64 71 406 0.245 32.03 4.01 Init - 80899 80887 13 1 1 78 69 17 0.003 -0.87 4.00 Prom - 87165 87126 40 -4.85 5.00 Prom + 87898 87937 40 -7.15 5.01 Init + 88497 88634 138 2 0 70 -5 143 0.400 3.39 5.02 Term + 89250 89642 393 2 0 62 41 218 0.669 8.55 5.03 PlyA + 92248 92253 6 1.05 6.03 PlyA - 93380 93375 6 1.05 6.02 Term - 100366 99995 372 1 0 74 49 463 0.086 34.71 6.01 Init - 114840 114811 30 0 0 85 107 6 0.884 2.01 6.00 Prom - 115055 115016 40 -5.45 7.00 Prom + 120588 120627 40 -7.65 7.01 Init + 122421 122606 186 2 0 56 115 119 0.917 10.50 7.02 Intr + 123497 123676 180 1 0 55 65 100 0.786 3.54 7.03 Term + 126107 126334 228 1 0 86 48 124 0.867 3.85 7.04 PlyA + 126426 126431 6 1.05 8.03 PlyA - 127466 127461 6 1.05 8.02 Term - 151676 151540 137 0 2 47 42 179 0.914 6.40 8.01 Init - 168583 168568 16 1 1 81 121 10 0.033 4.04 8.00 Prom - 176050 176011 40 -4.75 9.00 Prom + 182721 182760 40 -4.55 9.01 Sngl + 193842 194090 249 2 0 60 50 192 0.847 7.43 9.02 PlyA + 194410 194415 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 55928 55002 927 2 0 70 37 355 0.829 24.75 S.002 Intr + 80299 80356 58 2 1 94 101 51 0.818 4.02 S.003 Sngl - 100342 99995 348 1 0 66 49 440 0.834 33.69 S.004 Term - 112513 112394 120 1 0 119 43 63 0.841 2.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:30731884_30932225|GENSCAN_predicted_peptide_1|68_aa XSSVVDRKGLSQGAFKSFETLELVAAAPGRQLSDFGKHEFRPPPLVAVVVAAITVIVEAV GGTRFQSM >gi568815589r:30731884_30932225|GENSCAN_predicted_CDS_1|207_bp ntgtcatcagtggtggacaggaaaggcctatctcagggtgcattcaagtcctttgaaacc ctggagttggttgcagcagcaccaggaaggcaactctcagactttgggaagcatgagttt cggcctccaccattggtggcagtggtagtggcagcaataacagtcatagtggaagcagtt ggaggaacccgttttcagagcatgtga >gi568815589r:30731884_30932225|GENSCAN_predicted_peptide_2|518_aa MLLVLAGLVLPTPPGRLCSTHATGPYPTPAKARHASSCGVGGKLQAQAHVPAPWEALLDQ VYRFQLPLRAPVSQQGEYGGNQKFGDSRNGRAPKRVSQPWLLEPIGRSSGRVCSFQWRRD PQWVAPFRRQVVPTSVQLSEERRPGRDSSFVRAGCPNESRRPKEGSSFLQLLILMFVQVW RSPGVFISSEGRQLHEDWSMGSRGQAWKKHRPHRLEVERSPGCTSSAAAIVFAAASPDES SLPSLGHQTPERGASRPRTRGWRANLGCTWRRRDPKSETGAITIQDLRAQIFTNTMDNAH IVLQVDNAGLAADDFRVKYETGTVMRRSVEREIHGLHKIIDDTNVTWLQLKTQIEAFKEE LLFMKKNHEEETSGPNNEELAQKNREDLEKYWSQQIEESTTVVTMQSAEVGAAEVTLMEL THTVQSLEIDLDLMRNNKANWENSPREVESQCEEGWNLQKELIFLLYCRNGHRLAPWSDY VMTPVPCSSSLLDINDQIKRSISVELRDGVDTDLKKCR >gi568815589r:30731884_30932225|GENSCAN_predicted_CDS_2|1557_bp atgttgctggtgcttgctgggctcgttctgcccaccccacctggcaggctgtgctcaact catgctaccggcccatatcccacacctgccaaggcaaggcatgccagcagctgtggtgtc ggtgggaagctccaggcccaagcacacgtgccggctccatgggaggctttgctggaccag gtataccgctttcagcttccactgagggcacctgtgtctcaacaaggggaatatggtggc aaccagaagtttggagactccaggaatggcagagccccaaagagggtgtcacagccctgg ctcctggagcccataggcaggtcgtccggaagagtgtgcagctttcagtggagaagagac ccacagtgggtagctcctttccgcaggcaggttgtcccgacaagtgtccagctctcagag gagaggagacctggcagggatagctcctttgtgcgggcaggttgtcccaatgagtcaagg agacccaaagagggtagttcctttttgcagctgctaatcctgatgtttgtccaagtctgg cggagtccaggggtttttataagctcagaagggaggcagttgcatgaagattggtccatg ggcagccgtggacaggcctggaaaaagcaccgcccccaccggctcgaagtagagcgcagc cccggctgcacttcctctgctgcagccatcgtctttgcagcagccagtccagatgagtcg tcactgccatcacttggacatcagactccagagcgaggagcctcgagaccaagaaccaga gggtggagagcaaacttggggtgcacctggagaagaagagaccctaagtcagagactggg gccattaccattcaggacctgagggctcagatcttcacaaatactatggacaatgcccac attgttctgcaggttgacaatgctggtcttgctgctgatgactttagagtcaagtatgag acagggacagtcatgcgccggtctgtggagagagaaatccatgggctccacaagatcatt gatgacaccaatgtcacttggctgcagctgaagacacagatcgaggctttcaaggaggag ctgctcttcatgaagaagaaccatgaagaggaaacatccgggcccaataacgaagagctg gctcagaagaaccgagaggatctagaaaagtactggtctcagcagattgaggagagcacc acagtggtcaccatgcagtccgctgaggttggagctgccgaagtgacgctcatggaactg acacatacagtccagtccttggagatcgacctggacttgatgagaaataacaaggccaat tgggagaacagcccgagggaggtagaaagccaatgcgaggaggggtggaacttacagaag gaacttatattccttctgtactgcaggaatgggcacagactggctccctggtctgactac gtaatgacaccagttccatgttcatcgtcacttttagatattaatgatcagataaaacgc tcaatttcagtggaattaagagatggagtggatactgatctcaagaagtgtcggtaa >gi568815589r:30731884_30932225|GENSCAN_predicted_peptide_3|371_aa MRQQHSWITKIRGSADTAADTQAKRVWSGPLANSNRPAAEEIQTTIREYYKHLYSNKLEN LEEMDKFLDTYTLPRLNQEEVVSLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYK EELVPFLLKLFQSIEKEGILPNSFYEASIILISKPGTDTTKKENFRPISLMNIDAKILNK ILANRIQQHIKKLIHHDQVGFIHGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKAF DKIQQRFMLKILNKLGIDGMYLKIIRALYDKPTANIILNGQKLEVLPLKTGRRQGCSLSP LLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNF NKVSGYKINVQ >gi568815589r:30731884_30932225|GENSCAN_predicted_CDS_3|1116_bp atgagacagcagcattcgtggatcacgaaaatccgtggttctgcagacacggctgctgat acccaagcaaaaagggtctggagtggacctctagcaaactccaacagacctgcagctgag gaaatacaaactaccatcagagaatactacaaacacctctactcaaataaactagaaaat ctagaagaaatggataaattcctcgacacatacactctcccaagactaaaccaggaagaa gttgtatctctgaatagaccaataacaggctctgaaattgtggcaataatcaatagctta ccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccagaggtacaag gaggaactggtaccattccttctgaaactattccaatcaatagaaaaagaaggaatcctc cctaactcattttatgaggccagcatcatcctgatatcaaagccaggcacagacacaacc aaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaa atactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggc ttcatccatgggatgcaaggctggttcaatatatgcaaatcaataaatgtaatccagcat ataaacagaaccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggccttt gacaaaattcaacaacgcttcatgctaaaaattctcaataaattaggtattgatgggatg tatctcaaaataataagagctctctatgacaaacccacagccaatatcatactgaatggg caaaaactggaagtactccctttgaaaaccggcagaagacagggatgctctctctcacca ctcctattcaacatagtgttggaagttctggccagggcaatcaggcaggagaaggaaata aagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgatt gtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttc aacaaagtctcaggatacaaaatcaatgtacaataa >gi568815589r:30731884_30932225|GENSCAN_predicted_peptide_4|251_aa MTEEGAQLRRPAASSVASVYAGKGSASWISVSRSTSFLGGMGSGDLAAGMARGLAGMGGI QNKKETMQDLNDGLASYLDRVRSLETNKRKLESKIQEHLEKKGPQMDNAGLAADDFRVKY ETELVMRQSVENDIHGLCKVMDGTNVTRMQLETQFEALKEELLLMKKNHEEQVKGRQAQI AALDRPGLDEKSEGQLGEQPEGGGSPLRPADGTAQRDPVAPGVRAGTDLGRGVAPGAGVR VPAEHQGQAGG >gi568815589r:30731884_30932225|GENSCAN_predicted_CDS_4|756_bp atgactgaagaaggtgcccagctacggcgcccagctgccagcagcgtggccagcgtctat gcaggcaagggctctgcttcctggatctccgtgtcccgctccaccagcttcctgggcggc atggggtccggggacctggccgcggggatggccaggggtctggcaggaatgggaggcatc cagaacaagaaggagaccatgcaagacttgaacgacggcctggcctcctacctggacaga gtaaggagcctggagaccaacaagaggaagctggagagcaaaatccaggagcacctggag aagaagggaccccagatggacaatgcaggacttgctgctgatgactttagagtcaagtac gagacagagctggtcatgcgccagtctgtggagaacgacatccatgggctctgcaaggtc atggatggcaccaatgtcactcggatgcagctggagacacagttcgaggctctcaaggag gagctgctcttaatgaagaagaaccacgaagagcaagtgaaaggccggcaagcccagatt gcagctctggatagacctggactcgatgagaaatctgaaggccagcttggagaacagcct gagggaggtggaagcccgctacgccctgcagatggaacagctcaacgggatcctgttgca cctggagtcagagctggcacagacctgggcagaggggtagcgccaggcgcaggagtacga gtccctgctgaacatcagggtcaagctggaggctga >gi568815589r:30731884_30932225|GENSCAN_predicted_peptide_5|176_aa MDEYSKLIPVQQHVAEKIPENVELTLELGNRQRLEQFGRLRRTQENLWLKGANVQLGLWL QRMEALSLDSFHVVLSLTGAQKSRTEVWEPPPRFQQMYRNDWLPSQKFAAGMGLSWRTSA RAVQKGNVGSEPPHRVPTRALLRGAVRRGSSSSRPHNGRSTDSLHCMPGKAADTQC >gi568815589r:30731884_30932225|GENSCAN_predicted_CDS_5|531_bp atggacgaatacagtaaattgataccagtacagcagcatgttgctgaaaagatacctgaa aatgtggaattgactttggaactcggtaacaggcagaggttggaacaatttggaaggctc agaagaacacaagaaaatctatggctgaaaggggccaatgtacagcttggtctgtggctt cagaggatggaagccctaagcctggacagcttccacgtggtgttgagccttacaggtgca cagaagtcaagaactgaggtttgggaacctccacctagatttcagcagatgtatagaaat gactggttacccagtcaaaagtttgctgcagggatggggctgtcatggagaacctctgct agggcagtacagaagggaaatgtggggtcagaacccccacacagagttcctactagggca ctgctcagaggagctgtgagaagagggtcatcgtcttccagaccccacaatggtagatcc actgacagcttgcactgtatgcctggaaaagccgcagacactcaatgctag >gi568815589r:30731884_30932225|GENSCAN_predicted_peptide_6|133_aa MGLFKVAPKELSPVRASKMTKKRRNNGRAKKGRGHVQPIRCTNCARCVPKDKAIKKLVIR NIVEATAVRDISEASVFDAYVLPKLYVKLHYCVSCAIHSKVVRNRSHEARKDRTPPPRFR PAGAAPRPPPKPL >gi568815589r:30731884_30932225|GENSCAN_predicted_CDS_6|402_bp atggggctttttaaggtagctccaaaagagctctctccggtccgtgcctcgaagatgaca aagaaaagaaggaacaacggtcgtgccaaaaagggccgcggccacgtgcagcctattcgc tgcactaactgtgcccgatgtgtgcccaaggacaaggccattaagaaattggtcattcga aacatagtggaggccacagcagtcagggacatttctgaagcaagcgtcttcgatgcctac gtgcttcccaagctgtatgtgaagctgcattactgtgtgagttgtgcaatccacagcaaa gtagtcaggaatcgatctcatgaagcccgcaaggaccgaacacccccaccccgatttaga cctgctggtgctgccccacgtcccccaccaaagcccttgtaa >gi568815589r:30731884_30932225|GENSCAN_predicted_peptide_7|197_aa MEEDSIPQLHLSIQLMIRSPRHPPVPQFLPKLNSKLQVEALGKKTRSKGPRSRQQSLKGI VQEAVKTVFILILILRAVRCASLEGGDKQPAGTRSPEKLQPACTLGGMCTGVEAQRFTPF AVGFYGFRSNINRTEDKNHIVISIDAEKAFDKIQQCFMLKTLNKVGIDEMYIKIIRAIYD KPTVNIILNGQKLEAFP >gi568815589r:30731884_30932225|GENSCAN_predicted_CDS_7|594_bp atggaggaggactcaattccacagcttcaccttagcattcagcttatgataaggagtcca cgacaccccccagtgccacagtttcttcccaaactcaattccaagcttcaggttgaagcc ctaggaaagaaaactagatctaagggacctagaagcagacaacagagtttaaaaggcata gtgcaggaagcagttaaaacagtcttcatccttatccttattctaagagcagttagatgt gcctctttagaagggggcgataaacaaccagctgggacgaggtccccggaaaaactccaa ccagcctgcacactgggaggaatgtgcactggggtggaggcacagaggttcacaccattt gcagtgggtttttatggttttaggtctaacataaacagaaccgaagacaaaaaccatata gttatctcaatagatgcagaaaaggcctttgacaaaattcaacagtgcttcatgctaaaa actctcaataaagtaggtattgatgagatgtatatcaaaataataagagctatttatgac aaacccacagtcaatatcatactgaatgggcaaaaactggaagcattcccttga >gi568815589r:30731884_30932225|GENSCAN_predicted_peptide_8|50_aa MTVVKVLWLASPLEYWDQWTRNILAPPVQQVLNFKGPGDKAGDLVPVRQS >gi568815589r:30731884_30932225|GENSCAN_predicted_CDS_8|153_bp atgacagtagtgaaagtgctttggctggcatccccactggagtattgggaccagtggacc aggaacatcttggcccctccagtgcagcaggttcttaacttcaagggtccaggggacaaa gctggggacctcgtgccagtccgccagagctag >gi568815589r:30731884_30932225|GENSCAN_predicted_peptide_9|82_aa MSLASGDALGHFYSWQKVKQEQASHMANEKAKEMEALHSFKHQDLTKTESKNSLSVQGQH QAIHEESAPMIQTPPTMPYLQY >gi568815589r:30731884_30932225|GENSCAN_predicted_CDS_9|249_bp atgtctttggcttctggtgatgccttagggcatttttactcatggcagaaggtgaagcag gagcaggcaagtcacatggctaatgagaaagcgaaagaaatggaggcgctacactctttt aaacatcaagatcttacaaaaactgaaagcaaaaactcactctcagtgcaaggacagcac caagctattcatgaggaatctgcccccatgatccaaacacctcccaccatgccctacctc caatattag