GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:51:45 Sequence gi568815596r:25642032_25978222 : 336191 bp : 43.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 2155 2150 6 1.05 1.08 Term - 9840 9818 23 1 2 86 38 30 0.362 -3.73 1.07 Intr - 10630 10563 68 0 2 107 66 101 0.444 8.35 1.06 Intr - 30582 30479 104 0 2 91 59 50 0.021 1.37 1.05 Intr - 31833 31708 126 0 0 37 68 101 0.377 3.88 1.04 Intr - 48509 48445 65 0 2 110 82 52 0.096 5.24 1.03 Intr - 58317 58289 29 2 2 76 86 3 0.148 -3.34 1.02 Intr - 59997 59885 113 0 2 73 61 96 0.335 4.58 1.01 Init - 64240 64163 78 1 0 68 84 13 0.350 -0.03 1.00 Prom - 65840 65801 40 -4.16 2.00 Prom + 67626 67665 40 -3.36 2.01 Init + 75335 75398 64 1 1 99 94 53 0.879 6.31 2.02 Term + 76902 76978 77 1 2 83 41 54 0.465 -1.80 2.03 PlyA + 77770 77775 6 1.05 3.10 PlyA - 78636 78631 6 1.05 3.09 Term - 102400 99998 2403 1 0 98 39 886 0.365 70.08 3.08 Intr - 108382 107665 718 0 1 79 116 358 0.930 29.14 3.07 Intr - 111608 111503 106 0 1 66 75 105 0.988 6.27 3.06 Intr - 114083 113987 97 2 1 69 115 18 0.908 2.18 3.05 Intr - 117614 117451 164 0 2 103 96 105 0.958 12.49 3.04 Intr - 126837 126711 127 0 1 86 70 146 0.999 12.85 3.03 Intr - 129509 129409 101 0 2 92 115 29 0.941 5.83 3.02 Intr - 157504 157354 151 1 1 76 94 129 0.783 12.04 3.01 Init - 168625 168350 276 1 0 116 2 243 0.066 15.49 3.00 Prom - 169713 169674 40 -5.86 4.05 PlyA - 169727 169722 6 1.05 4.04 Term - 172552 172502 51 1 0 89 43 70 0.045 0.03 4.03 Intr - 174483 174348 136 2 1 57 37 44 0.000 -3.23 4.02 Intr - 203532 203450 83 0 2 92 109 48 0.600 5.74 4.01 Init - 214908 214672 237 0 0 58 85 357 0.832 30.41 4.00 Prom - 233111 233072 40 -2.96 5.00 Prom + 243035 243074 40 -4.86 5.01 Init + 244165 244333 169 0 1 92 107 134 0.623 15.40 5.02 Term + 247102 247145 44 2 2 133 36 28 0.597 -0.58 5.03 PlyA + 247175 247180 6 1.05 6.13 PlyA - 247276 247271 6 1.05 6.12 Term - 269592 269489 104 1 2 75 37 158 0.315 7.84 6.11 Intr - 273581 273445 137 1 2 67 15 48 0.222 -4.39 6.10 Intr - 273898 273646 253 2 1 47 97 138 0.317 7.09 6.09 Intr - 275969 275861 109 2 1 51 76 64 0.132 1.36 6.08 Intr - 288032 287924 109 1 1 43 90 146 0.020 10.59 6.07 Intr - 309874 309758 117 0 0 99 67 231 0.111 21.68 6.06 Intr - 312354 312236 119 0 2 54 30 273 0.998 17.36 6.05 Intr - 313632 313510 123 0 0 69 100 249 0.999 24.98 6.04 Intr - 314413 314312 102 1 0 105 91 168 0.987 19.17 6.03 Intr - 315646 315594 53 2 2 77 96 40 0.888 2.33 6.02 Intr - 317790 317763 28 0 1 90 79 35 0.193 0.49 6.01 Init - 331770 331684 87 0 0 92 42 151 0.448 9.54 6.00 Prom - 332533 332494 40 -3.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 168625 168290 336 1 0 116 41 287 0.902 22.83 S.002 Term - 309874 309754 121 0 1 99 47 247 0.886 19.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:25642032_25978222|GENSCAN_predicted_peptide_1|201_aa MTLKEHAAFKHLFNKAHLALPLIHLTLGPTAGSETSMIPSARMWDTALPQKPSLFCRATG EPTSEPVSPLEGKGQLEGPLLASPGPREVAAINDKSRRAGHPADGYLRMERHPARRQRAG EAWMEPASRLPGTRRRPYWEIGAINCVVLGIKQKGQKKPMGEIWESFEGSYGMIEESGNK RKTMAEKRQLFIEMLKSYKDP >gi568815596r:25642032_25978222|GENSCAN_predicted_CDS_1|606_bp atgactcttaaggagcatgctgccttcaagcatctgtttaacaaagcacatcttgcactg cccttaatccatttaaccctgggcccaactgccggctcagagacctccatgataccatct gctcgaatgtgggacacagccttgccacagaagccgtcattattctgccgggccactggg gagcccaccagtgagcctgtttcaccactagaaggcaaaggtcagctggaaggtcctctg ctggcatcaccagggcccagagaagtggctgcaattaacgataagtcacgtcgggccggg caccccgccgacggatacctgcggatggaaagacacccggcccgacgccagcgagcggga gaagcctggatggagccagcttcccgcctgcccggaacccgccggcgaccgtattgggaa attggagccataaattgcgtggttttggggataaagcagaaaggtcaaaagaagcctatg ggagaaatttgggagagttttgaaggctcctacgggatgattgaggaaagtgggaacaag cggaagaccatggcagagaagaggcagctgttcatagaaatgctgaaatcttacaaggac ccttaa >gi568815596r:25642032_25978222|GENSCAN_predicted_peptide_2|46_aa MAGGRAQWLMPVIPALWEAEAELRTEFDMKVMSTQICILAGQCITS >gi568815596r:25642032_25978222|GENSCAN_predicted_CDS_2|141_bp atggccggtggccgggcgcagtggctcatgcctgtaatcccagcgctttgggaggctgag gcagaattaagaacagaatttgacatgaaggttatgagtacacagatttgcatcttggca ggccagtgtatcacctcttaa >gi568815596r:25642032_25978222|GENSCAN_predicted_peptide_3|1380_aa MAGITTIEAVKHNIQVLQQQADDAEERAEHLQREAEGERRAWEQAEAEVASLNHRIQLVE EELDCAQEHVATALQKPEEAEKATDQSERDMKKDVPDGVKELSEGSEESSDGQSDSQSSE NSSSSSDGGSNKEGKKSRWKRKVSSSSPQSGCPSPTIPAGKVISPSQKHSKKALKQALKQ QQQKKQQQQCRPSISISSNQHLSLKTVKAASDSVPAKPGQMKRTKCADIDVETPDSILVN TNLRALINKHTFSVLPGDCQQRLLLLLPEVDRQVGPDGLMKLNGSALNNEFFTSAAQGWK ERLSEGEFTPEMQVRIRQEIEKEKKVEPWKEQFFESYYGQSSGLSLEDSKKLTASPSDPK VKKTPAEQPKSMPVSEASLIRIVPVVSQSECKEEALQMSSPGRKEECESQGEVQPNFSTS SEPLLSSALNTHELSSILPIKCPKDEDLLEQKPVTSAEQESEKNHLTTASNYNKSESQES LVTSPSKPKSPGVEKPIVKPTAGAGPQETNMKEPLATLVDQSPESLKRKSSLTQEEAPVS WEKRPRVTENRQHQQPFQVSPQPFLNRGDRIQVRKVPPLKVSPRARFPVSITSPNRTGAR TLADIKAKAQLVKAQRAAAAAAAAAAAAASVGGTIPGPGPGGGQGPGEGGEGQTARGGSP GSDRVSETGKGPTLELAGTGSRGGTRELLPCGPETQPQSETKTTPSQAQPHSVSGAQLQQ TPPVPPTPAVSGACTSVPSPAHIEKLDNEKLNPTRATATVASVSHPQGPSSCRQEKAPSP TGPALISGASPVHCAADGTVELKAGPSKNIPNPSASSKTDASVPVAVTPSPLTSLLTTAT LEKLPVPQVSATTAPAGSAPPSSTLPAASSLKTPGTSLNMNGPTLRPTSSIPANNPLVTQ LLQGKDVPMEQILPKPLTKVEMKTVPLTAKEERGMGALIATNTTENSTREEVNERQSHPA TQQQLGKTLQSKQLPQVPRPLQLFSAKELRDSSIDTHQYHEGLSKATQDQILQTLIQRVR RQNLLSVVPPSQFNFAHSGFQLEDISTSQRFMLGFAGRRTSKPAMAGHYLLNISTYGRGS ESFRRTHSVNPEDRFCLSSPTEALKMGYTDCKNATGESSSSKEDDTDEESTGDEQESVTV KEEPQVSQSAGKGDTSSGPHSRETLSTSDCLASKNVKAEIPLNEQTTLSKENYLFTRGQT FDEKTLARDLIQAAQKQMAHAVRGKAIRSSPELFSSTVLPLPADSPTHQPLLLPPLQTPK LYGSPTQIGPSYRGMINVSTSSDMDHNSAVPGSQVSSNVGDVMSFSVTVTTIPASQAMNP SSHGQTIPVQAFSEENSIEGTPSKCYCRLKAMIMCKGCGAFCHDDCIGPSKLCVSCLVVR >gi568815596r:25642032_25978222|GENSCAN_predicted_CDS_3|4143_bp atggctgggatcaccactattgaggcagtgaagcacaatatccaggttctgcagcagcag gcagatgatgcagaggagagagctgagcacctccagcgagaagctgaaggagaaaggcgc gcctgggaacaggctgaggctgaggtggcctccttgaaccataggatccagctggttgaa gaggagctggactgtgctcaggagcacgtggccactgccctgcaaaagccggaagaagcg gaaaaagctactgatcagagtgagagagatatgaagaaagatgtgccggatggggtgaaa gagctgtcagaaggttcagaagaaagcagtgatggtcagtcagattcccagagttctgag aacagcagcagcagcagtgatggtggcagcaacaaggagggaaaaaagagcaggtggaaa aggaaagtatcgtcgtcctccccgcagtcaggctgcccatcacccaccattccagcaggt aaagtcatttctccatcacagaagcacagcaagaaggcactaaagcaggcgctaaagcag caacagcagaagaagcagcagcagcaatgcaggccaagcatatccatctcctccaaccag catctctcactaaagactgtcaaagcagccagtgactctgtacctgccaaacctggacaa atgaaaagaactaaatgtgctgacattgacgttgagacaccggactccattctggttaat acaaatctgcgagcactgatcaacaagcacacattttcagtccttcctggagattgccag caacgactgcttttactactcccagaggtagatcgacaggttggtccagatggtttaatg aagttaaatggctcagcccttaacaatgaattcttcacttcagcagcccaaggctggaag gaaagactctcagaaggtgagtttacacctgagatgcaggtgagaattcgacaagagatt gagaaggagaaaaaagtggagccatggaaagaacaattctttgaaagctactatgggcag agttctggcctgagccttgaagattctaagaaattgacagcttctcccagtgatcccaaa gtaaagaaaaccccagctgaacaaccaaaatccatgcctgtgtcagaggcctctcttatc agaatagttccagtagtctcccagtcagagtgtaaagaagaagcattgcaaatgtcatca ccaggcagaaaagaagagtgtgaaagccaaggtgaagtgcagccgaacttctccacatct tcagagcccctgctttcctcagctctcaatacacatgagcttagcagcattcttcccatc aagtgcccaaaggatgaggatctcttggagcagaagccagtcacctctgctgaacaggaa tctgagaagaaccatctcaccacagcttctaattataacaaaagtgaaagccaagaatct ttagttacatcgccaagcaaacccaagagtcctggggttgaaaaaccaatagtgaagccc acagcaggagcgggtccacaggagactaatatgaaagaacctctagcaactcttgttgat cagagcccagaaagcctcaagaggaagtcttccctcacccaagaagaggcccctgtgagc tgggagaagaggccacgtgtcactgagaatcgccagcaccagcagccatttcaggtctca ccacagccctttctcaatagaggggacagaatccaggtgcgaaaagtaccacctctcaag gtctctcccagggctcgttttccagtctccatcactagtcctaacagaacaggagccaga actcttgcagacatcaaagcaaaagcccaactggtcaaagcacagagggcagcagctgcc gctgccgccgcagctgctgcagccgcctcagttggagggaccattccaggacctggccca gggggtggacaaggtccaggagagggtggtgaagggcagactgctagaggaggcagtcca ggctcagacagagtcagtgaaactggaaagggccccacactggaactggcaggaactgga agcaggggaggtacgagagagcttttaccctgtggtccagagactcagccccagtctgag accaagaccaccccaagccaggcacagcctcatagtgtctctggagcacaactacagcaa acccccccagtgcctccaacacctgccgtcagtggagcatgcacaagtgtcccatcacca gcccacatagagaaattggataatgaaaaactgaaccccaccagagcaacagccacagtg gcctctgtcagccatccacaagggcccagtagttgcagacaggagaaagcaccttctcca acaggtcctgctctaatctcaggtgcctcacctgttcattgtgcagctgatggcacagtt gagctcaaagcaggtcctagtaagaatatacctaacccttcagcctcatcaaagacagat gctagtgtgccagtggctgtaactccctcccctttaacatctttattgaccacagccact ttagaaaagcttcctgtaccccaggtcagtgcaactacagcacctgctggatcagctcca ccctcgagcactttgccagcagcttctagccttaaaaccccaggaacttctttaaacatg aatggacccactttaagaccaacctctagtatccctgctaataatcctttagtgactcag ctgcttcaaggcaaagatgttcccatggagcaaattctgcctaaacctctcaccaaagtt gaaatgaaaacggttccactgactgcaaaagaggaaagggggatgggagcgctcatagct accaacacaacagaaaatagcaccagagaggaagttaatgagagacagtcccatccagct acgcagcagcagctgggcaaaaccttgcaaagtaagcagctcccccaggttccaaggccc cttcagctcttttcagctaaggagctgagggactccagcattgacacacaccaataccac gaaggactaagtaaagcaacccaagatcagatccttcagactctcattcagagggttcgg aggcagaatcttctctcagttgtgccgccctcacagttcaacttcgctcactcaggtttc cagctggaagacatctccacaagccagaggttcatgctgggttttgctggcagaaggaca tccaaacctgcaatggcagggcactacttactgaatatttctacctacggccggggctca gagagctttaggaggacccattctgtaaaccctgaagatcgtttttgtctaagcagcccc actgaagccttgaaaatgggatatacagactgtaaaaatgcaacaggagagagtagcagc agcaaagaagatgacactgatgaggaaagtactggtgatgagcaggaatctgtcacagtg aaagaggagccccaggtttcccagagtgctggcaagggtgacacaagttcaggacctcac agcagggaaactctatctaccagtgattgcttagctagcaagaatgtgaaggctgagata ccattgaatgagcaaaccactttaagtaaggagaattacctgttcactagaggccaaaca tttgatgaaaagaccctagccagagatttaattcaggcagcacagaagcagatggctcat gcagtgagaggtaaggcaatccgtagcagccccgagcttttcagttctactgttcttcct ctgcctgcagacagccccacccaccagcccctactccttccacccctgcaaaccccgaag ttgtatggaagccccacccagatagggccaagctatagaggcatgatcaatgtctccacc tcatctgacatggaccataactctgctgtaccaggtagccaggtatctagcaatgtaggt gatgtcatgtcattttcagtgactgtcactaccatccctgctagccaagctatgaatccc agcagccatggccagaccattcctgttcaggcgttctccgaagagaacagcatagagggc acgccttcgaaatgttactgccgcttgaaagccatgatcatgtgcaaaggctgtggcgct ttctgccatgatgattgcatcggcccctccaaactgtgcgtctcctgccttgtcgttcgg taa >gi568815596r:25642032_25978222|GENSCAN_predicted_peptide_4|168_aa MRYRTLSQLPPRLVSIEDPFDQNDWATWTSFLSGVDIQIVGDDLTVTNPKRIAQSFEKKV CSCLLLKVNQIGSVTESIQVLEKYPNTPMSHKEILQVIQREGLKEIRSQPNRQILASASV MSLISLIKSSLVAKTNIIVAGKYISLTVTYCKNPYLFNPNLLSMDKKN >gi568815596r:25642032_25978222|GENSCAN_predicted_CDS_4|507_bp atgaggtataggacactgtcccagctgccacctagactcgtctccatcgaggaccccttt gaccagaatgactgggccacttggacctcgttcctctcaggggtggacatccagattgtg ggggatgacctgacagtcaccaaccccaagaggattgcccagtcctttgagaagaaggtc tgcagctgtctgctgctgaaggtcaaccagatcggctcggtgactgaatcgatccaggtc ttagaaaaataccccaatacacccatgagtcataaagaaattcttcaagttatccagaga gaaggactaaaagaaatcaggagtcagccaaacaggcaaattttagcctctgcttccgtc atgtctcttatttcattgatcaaatcaagccttgtggccaagacgaacatcattgtggca gggaaatatatttcacttactgtaacgtactgcaagaacccctacctcttcaaccccaat ctcttgagcatggacaagaaaaattag >gi568815596r:25642032_25978222|GENSCAN_predicted_peptide_5|70_aa MVAGMEATNGPNTVGSHSPLNFYLLLLLNVQSASKKDENQPPDLTPSLKYTNQPLGGVDP EGTSNELPTC >gi568815596r:25642032_25978222|GENSCAN_predicted_CDS_5|213_bp atggtggcaggaatggaggctacaaatggacccaacaccgtgggttcccactcaccactg aacttttacctactactgctactgaatgtccaatctgccagcaagaaagatgaaaaccaa ccccctgatttgacaccatccctcaagtacaccaatcagccacttggaggtgttgatcct gaaggcacctctaatgaacttcccacatgctag >gi568815596r:25642032_25978222|GENSCAN_predicted_peptide_6|446_aa MGLPILLLIALFLCPGLLLVTREYELWKQVPQMIGPHDSGLLLPKSDSTPCFEIPQAMES KLLIGGRNIMDHTNEQQKMLELKRQEIAEQKRREREMQQEMMLRDEETMELRGTYTSLQQ EVEVKTKKLKKLYAKLQAVKAEIQDQHDEYIRVRQDLEEAQNEQTRELKLKYLIIENFIP PEEKNKIMNRLFLDCEEEQWKFQPLVPAGVSSSQMKKRPTSAVGYKRPISQYARVAMAMG SHPRYRPQLYALALFLFTVHALYSLGSGLFAKSKRLSQEQSQDPGAQLASPSGSPTGAAG GAACQSRAVCLHSSALGWSMGLGAVEQEAALIGEARAAQEPTEWRGGSGMVGCRSRALPG GKAAKARASRPAALSAGSAGATPTRNSRWPASTAYSPGSRPRLSLHTSPQAEVYVYRYRY LYLYLSHAEPKLDCTAAISAHCNLPA >gi568815596r:25642032_25978222|GENSCAN_predicted_CDS_6|1341_bp atgggcctccctatcctactgctgattgcactgtttttgtgtcctggattgttgctggta accagggagtacgaattatggaagcaggttccccagatgatcgggccccatgacagtggt ctcctgctccccaagtctgacagcaccccctgctttgagatccctcaggccatggagagc aagctcctcatcgggggcaggaacatcatggatcacaccaacgaacagcagaagatgttg gaactgaagaggcaggagattgccgagcagaaacgtcgtgagcgggagatgcagcaggag atgatgctccgggacgaggagactatggagctccggggcacctacacatccctgcagcag gaggtggaggtcaaaaccaagaaactcaagaagctctacgccaagctgcaggcggtgaag gcggagatccaggaccagcatgatgagtatatccgcgtgcggcaggacctggaggaggcg cagaacgagcagacccgcgaactcaagctcaagtacctaatcatcgagaacttcatcccg ccggaggagaagaacaagatcatgaaccggcttttcctggactgtgaggaggagcagtgg aagttccagccactggtgccagccggcgtcagtagcagccagatgaagaagcggccaaca tctgcagtgggctacaagaggcctatcagccagtatgctcgggttgccatggcaatgggg tcccaccccaggtacaggccacagctgtatgcactggccctgttcctgttcacagtgcat gctctgtattccttaggctctggtctctttgcaaagtccaagcgtctcagccaggagcag agccaagacccaggagcccagctggcttcacccagtggatcccccacgggggctgcaggt ggagctgcctgccagtcccgcgccgtgtgcctgcactcctcagcccttggatggtcgatg ggactgggcgccgtggagcaggaggcggcgctcatcggggaggctcgggccgcacaggag cccacggagtggcggggaggctcaggcatggtgggctgcaggtcccgagccctgcccggc gggaaggcagctaaggcccgggccagccggccggctgctctgagtgcagggtccgccggg gccacgcccacccggaactcgcgctggcccgcaagcaccgcgtacagccccggttcccgc ccgcgcctctccctccacacctccccgcaagctgaggtctacgtctaccgctaccgctac ctctacctctacctctcccatgccgagccgaagctggactgtactgctgccatctcggct cactgcaacctccctgcctga