GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:32:36 Sequence gi568815592f:50718892_50943389 : 224498 bp : 38.20% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 199 259 61 0 1 64 107 62 0.362 3.09 1.02 Intr + 3190 3244 55 2 1 36 59 81 0.599 -2.88 1.03 Intr + 4455 4717 263 0 2 76 36 277 0.891 17.51 1.04 Intr + 9965 10130 166 0 1 103 116 57 0.967 8.10 1.05 Intr + 10303 10421 119 1 2 69 86 56 0.913 2.69 1.06 Intr + 26216 26357 142 0 1 64 82 97 0.877 5.19 1.07 Term + 53754 53973 220 0 1 81 37 190 0.927 8.73 1.08 PlyA + 54483 54488 6 1.05 2.00 Prom + 56332 56371 40 -4.45 2.01 Init + 90753 90870 118 2 1 76 74 51 0.310 2.92 2.02 Term + 97306 97466 161 2 2 67 42 182 0.753 8.62 2.03 PlyA + 97477 97482 6 -0.45 3.00 Prom + 97525 97564 40 -4.15 3.01 Init + 100034 100081 48 1 0 90 105 78 0.905 10.70 3.02 Intr + 104516 104974 459 1 0 129 91 384 0.871 35.25 3.03 Intr + 106980 107432 453 2 0 33 15 207 0.009 0.43 3.04 Intr + 109728 109788 61 2 1 42 86 72 0.014 -0.21 3.05 Intr + 117170 117389 220 0 1 97 86 150 0.195 12.14 3.06 Intr + 119084 119202 119 2 2 49 121 75 0.982 6.09 3.07 Intr + 121265 121406 142 0 1 106 89 85 0.998 8.89 3.08 Term + 124201 124501 301 1 1 97 48 304 0.980 21.01 3.09 PlyA + 124782 124787 6 1.05 4.00 Prom + 125441 125480 40 -10.65 4.01 Init + 127026 127237 212 2 2 51 44 266 0.890 14.81 4.02 Intr + 131526 131698 173 0 2 101 0 102 0.413 1.36 4.03 Term + 132579 132865 287 1 2 31 41 231 0.668 7.28 4.04 PlyA + 133008 133013 6 1.05 5.02 PlyA - 133733 133728 6 1.05 5.01 Sngl - 138771 138364 408 0 0 102 41 364 0.956 29.14 5.00 Prom - 152132 152093 40 -4.85 6.07 PlyA - 152860 152855 6 1.05 6.06 Term - 156967 156853 115 0 1 93 53 84 0.220 2.46 6.05 Intr - 183895 183703 193 2 1 51 101 73 0.439 2.53 6.04 Intr - 184339 184230 110 0 2 79 53 80 0.494 2.61 6.03 Intr - 185483 185387 97 0 1 83 44 52 0.489 -1.55 6.02 Intr - 189437 189273 165 0 0 47 82 129 0.625 7.21 6.01 Init - 194365 194137 229 1 1 90 34 340 0.547 27.38 6.00 Prom - 197434 197395 40 -3.65 7.03 PlyA - 198189 198184 6 1.05 7.02 Term - 199233 198589 645 0 0 -19 48 250 0.706 3.43 7.01 Init - 200155 199523 633 1 0 30 41 254 0.691 10.29 7.00 Prom - 200650 200611 40 -6.15 8.02 PlyA - 200819 200814 6 1.05 8.01 Sngl - 202063 201635 429 1 0 88 47 421 0.919 34.03 8.00 Prom - 218908 218869 40 -2.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 28008 28129 122 0 2 32 75 89 0.949 1.22 S.002 Term + 28679 28818 140 0 2 109 38 89 0.929 3.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:50718892_50943389|GENSCAN_predicted_peptide_1|341_aa GSVEAQCGLVLNGQGGVIRRACNEGPIAKECVKEGAAARKAQSLGIRDGESRGFFQQLLK EYNWRSVGSLALLPLSVLFGFQMDYLESLLSLAKTHLYGSSIMGEDRARDTPQASPASLV PSADSAGGTCVVNPTDLFCSVPGRLSLLSSTSKYKVTIAEVKRRLSPPECLNASLLGGIL RRAKSKNGGRCLREKLDRLGLNLPAGRRKAANVTLLTSLVEGEALHLARDFGYTCETEFP AKAVGEHLARQHMEQKEQTARKKMILATNLITHGFGTPAICAALSTFQTVLSEMLNYLEK HTTHKNGGAADSGQGHANSEKAPLRKTSEAAVKEGKTEKTD >gi568815592f:50718892_50943389|GENSCAN_predicted_CDS_1|1026_bp ggctctgtggaggcccagtgtgggcttgttctcaatggccaaggtggagtgataagaaga gcctgtaatgaaggcccgatagccaaagaatgcgtgaaagagggggcagcagctaggaag gcccagagccttggaatccgggatggagaaagccgtggattctttcagcagctcctgaag gagtacaattggaggagcgtgggaagccttgctcttcttcctttgtcagtactcttcgga ttccagatggattacctggaatcactactctcgctagccaaaactcatctctatggttca tccattatgggtgaagatagagccagagatacacctcaagcctcgcctgcctccttagtc cccagtgcagattctgcaggtggcacctgtgtggtcaaccccacagacttattttgctct gtccctggccgtttgtcccttcttagttctacttccaaatacaaggtgaccattgctgag gtaaagaggcgcctctccccacctgagtgcctcaatgcttcactcttgggaggcattttg agaagagcaaaatcaaagaatgggggccgctgcctgagagagaaattggataggcttggc ttaaacttaccagcaggaagacggaaagcagctaatgtcaccctccttacttccttggtt gaaggggaggctttgcacttggctcgggattttggctacacttgtgaaacagagtttcca gccaaagcagtaggagaacatcttgccagacaacatatggaacagaaagaacagacagca agaaaaaagatgatcctggcgaccaatttgatcactcatggctttgggactccggcaata tgtgcagctctaagcactttccaaacagttctcagtgaaatgctgaactacttggaaaaa cacactactcacaagaacggcggagcggcggattctggccaaggacatgccaactcggag aaagctcccctgcggaaaacttcagaggctgccgtgaaagagggcaaaacagaaaagaca gactag >gi568815592f:50718892_50943389|GENSCAN_predicted_peptide_2|92_aa MTMKHAAFKHLFNKAHLAPPLIHLTLSGHSRCFREHGVGGFQERKETFAEKEIKETCRVC VYKIPTEGAVARSMTGNGPWERFTLQCLGLFE >gi568815592f:50718892_50943389|GENSCAN_predicted_CDS_2|279_bp atgacgatgaagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccgccc ttaatccatttaaccctgagtggacacagcagatgtttcagagagcatggggttgggggt tttcaagaacggaaagaaacctttgccgaaaaagaaatcaaggagacttgcagggtttgc gtctataaaattccgacagagggcgctgttgcccgttcaatgacagggaacggtccgtgg gaacggtttaccttgcaatgtctaggacttttcgaataa >gi568815592f:50718892_50943389|GENSCAN_predicted_peptide_3|600_aa MLWKLVENVKYEDIYEDRHDGVPSHSSRLSQLGSVSQGPYSSAPPLSHTPSSDFQPPYFP PPYQPLPYHQSQDPYSHVNDPYSLNPLHQPQQHPWGQRQRQEVGSEAGSLLPQPRAALPQ LSGLDPRRDYHSVRRPDVLLHSAHHGLDAGMGDSLSLHGLGHPGMEDVQCKGHTLLLSLE GNSKDRDGLCLRGDDRPPKARVKLTDAGQVTKTRGYVSRTTASNPKASNHLLNTEGSTSE LGSWGAGAKVTQAGWDFNALPTAIEWVACTGPVFKRACTGPVFKRTRECLGLDLKVTCSD LKDLVCVPDPLEPGSWQTLRSVEDANNSGMNLLDQSVIKKVPVPPKSVTSLMMNKDGFLG GMSVNTGEVFCSVPGRLSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNG GRSLRERLEKIGLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYICETEFPAKAVSEYL NRQHTDPSDLHSRKNMLLATKQLCKEFTDLLAQDRTPIGNSRPSPILEPGIQSCLTHFSL ITHGFGAPAICAALTALQNYLTEALKGMDKMFLNNTTTNRHTSGEGPGSKTGDKEEKHRK >gi568815592f:50718892_50943389|GENSCAN_predicted_CDS_3|1803_bp atgctctggaagcttgtggagaatgtcaagtacgaagatatctatgaggaccggcacgat ggtgtcccgagccacagctcgcggctctcccagctgggctcggtgtcccaaggaccctac tcgagcgccccgccgctgtcccacaccccgtcgtcggacttccagccgccctacttccca cccccctaccagccgctcccctaccaccagagccaggacccctactcccacgtcaacgac ccctactccctgaacccactgcaccagccccagcaacatccctgggggcaacggcagcgg caagaagtgggttcggaagccggctctctcctgccccagcctcgggccgccttgccccag ctctcgggccttgacccccggagggactaccactcggtccgccggccggacgtgctgctg cattcggcgcaccacggcctggacgcgggcatgggtgacagcctctcgctgcacggcctc ggccatcccggaatggaagacgtccagtgcaagggacacaccctgctcctgtccctagag ggaaacagcaaagacagggacggtctctgcctccgcggtgatgacagacctcccaaggcc agggtgaagttgactgatgcagggcaggtgacaaagaccaggggttatgtttccaggacc acagcctcgaatccgaaggcttccaatcatctgctcaacacagaaggatccacttccgag ttaggaagctggggtgcgggggctaaagttacacaagcaggttgggatttcaacgctctc cctacagctatcgaatgggtggcctgcactggtccagtctttaaaagggcctgcactggt ccagtctttaaaaggaccagagagtgcctaggcctggaccttaaagtcacttgttccgat ctcaaggacctcgtgtgtgttccagatccccttgagccagggagctggcagacactgagg tcagttgaagatgccaataacagcggcatgaatctattggaccagtctgtcattaaaaaa gttccagttcctcccaaatcggtgacttctctaatgatgaataaagacggcttcctggga ggcatgtctgtcaacaccggcgaggtgttttgctccgtcccaggccgtttgtctctgctc agttcaacttcgaagtacaaagtaactgtgggagaagttcagagacggctgtcgccccct gaatgcctcaatgcatctctcctcggcggagtcctcagaagagccaaatcgaaaaatggg gggagatctttgcgagaaaggctagaaaaaatcggtttgaatttacccgcgggcaggcgc aaagcagcaaatgtcacgttactcacctccctggtggaaggagaagctgttcacttagct agggattttgggtacatttgcgaaacggagtttcccgccaaagccgtctctgagtatttg aaccggcagcacacagacccgagtgacctgcactcccgaaagaatatgctgttggccacc aagcaactttgtaaagaatttacggatctactggcgcaggaccggacaccgatagggaac agccgacccagccccatcctggagccggggatccagagctgcctcacgcacttcagcctc atcacgcacggcttcggcgccccggccatttgcgccgcgctcacggccctgcagaactat ctcaccgaggcgctcaaaggcatggacaagatgttcttgaacaacaccaccactaacagg cacacgtctggggaaggcccaggtagtaaaactggcgacaaggaggagaaacacaggaaa tga >gi568815592f:50718892_50943389|GENSCAN_predicted_peptide_4|223_aa MGRRPVGCRWGGLPAPPAATRGAGSWRESGLEISKESEASPPPPLPVRKHPTVTILKRGD GMGIVLLGKAGFPELNHRHQLGDWVITHAPGTGTAGQRTARAPRQPASSRRTRLREPASR HALQVAKQVWKLPKPDRSTAQNRVEDRVQSLSPSTCHVNHKALNTGNSASGLSIRCPRAQ KHLIARKAGGGYLGKESATRNSSFSDLGNPIFRQFLHSVGFYF >gi568815592f:50718892_50943389|GENSCAN_predicted_CDS_4|672_bp atggggcggcgcccggtgggctgcaggtggggagggctcccagcgccgccagcggccacc cggggcgccggctcctggagggagagtggcctagaaatatccaaggaatccgaagcttcc cctcctccacctctgccagtacggaaacaccccaccgtcacaatcctaaagcggggagat gggatgggaattgtcttacttggcaaggcagggtttccagagctcaatcaccgacaccag ttgggagactgggtaataacacacgctccgggcacagggaccgcgggccaacgaaccgcg cgtgcgccgcgccagcctgcgtcgagccgtcgcacacggctccgggagcccgcgtctagg cacgctctccaggttgccaagcaggtgtggaaactccccaaaccagacaggtctactgcg cagaatagggtggaggatcgagtccagtcactgagccctagcacttgccatgtgaaccat aaggccctgaacactggaaattcagcgtccggactctcgatcagatgccctagagctcag aagcacctgattgcgcggaaggctggtggtggttacttgggcaaagagagtgcgactcga aactcgagtttctcggatttaggaaatccgatctttagacagtttctacacagcgttgga ttttacttctaa >gi568815592f:50718892_50943389|GENSCAN_predicted_peptide_5|135_aa MGHVCTKTMKKAAWVIIEKYYMHLGNDFHTNKHMCKEIAIIPSKKLHNKTAGYVTHLMKQ IQRGPVRGISIKLQEEVRERRDNYVPEISALDQEIIEVDPDTKEMLKLLDFGGLSNLQVT QPTVGMNFKMPRGPV >gi568815592f:50718892_50943389|GENSCAN_predicted_CDS_5|408_bp atgggccacgtttgcaccaaaaccatgaagaaggctgcctgggtcatcatagaaaagtac tacatgcacctgggcaacgacttccacacaaacaagcacatgtgcaaggagattgccatt atccccagcaagaagctccacaacaagacagcaggttatgtcacccatctgatgaagcag attcagagaggcccagtaagaggtatctccatcaagctgcaggaggaggtgagagaaagg agagacaattatgttcctgagatctcagccctggatcaggagatcattgaagtagatcct gacactaaagaaatgctgaagcttttggactttggtggtctgtctaacctgcaggtcact cagcctacagttgggatgaatttcaaaatgcctcggggacctgtttga >gi568815592f:50718892_50943389|GENSCAN_predicted_peptide_6|302_aa MTASNSQVRQNYHQDSEAAINRQINLELYASYFNLSMSYYFDRDDAALKNFAKYFLHQSH EEREHAEKLMKLQNQQGGSTGMFNSDNANTSLQRRENFPLGFEGSADFQWAEVEEQSEQR EHHKQRSECEKGFRSHQSTGFTLTVEDLVEAVFKSVELVKLPIQSSCLLKQWLPALENLL SPSSKAPGVQLYQRTYHTDSGLQKLFLLKTAYKILSLFKAQYNTGKAMEKKVLSIFSPLK VVLQCLNTTGGALGLSKVSAISKRMCFQGIQKELFPTAPNSICFPHPVPTRGAAEAQSQP DA >gi568815592f:50718892_50943389|GENSCAN_predicted_CDS_6|909_bp atgactgcatccaactcgcaggtgcgccagaactaccaccaggactcagaggccgccatc aaccgccagatcaacctggagctctacgcctcctactttaacctctccatgtcttactac tttgaccgtgatgatgcggctttgaagaactttgccaaatactttcttcaccaatctcat gaggagagggagcatgctgagaaactgatgaaactgcagaaccaacaaggagggagcact ggaatgtttaattcggacaatgccaatacaagtttacagagaagagagaactttccattg ggttttgaaggttctgctgattttcaatgggcagaggtggaggaacagtctgagcagaga gaacatcataaacaaaggagtgaatgtgagaaaggcttcaggtcccatcagagtactggc ttcactttgactgtagaagatttagttgaagcagtattcaaaagtgtagaacttgtcaag ctacctatccagtcctcttgtcttctgaaacagtggttacctgctctggaaaaccttctc tctccatcctccaaagctcctggggtacagctttatcagcgcacctatcacactgattct ggtctccaaaaactcttcttgctgaaaacagcatacaaaatactttccctcttcaaagcc caatacaacactggaaaagcaatggagaagaaagtactcagtatcttttccccactaaaa gttgtactgcaatgcttgaacacaactggaggagccctaggtttatcaaaagtatctgcc atttctaaaagaatgtgtttccaaggcatccagaaagagctctttccaactgcaccaaat tccatttgctttccacaccctgtcccgacaaggggtgctgcagaagcacaaagccagcca gatgcatga >gi568815592f:50718892_50943389|GENSCAN_predicted_peptide_7|425_aa MRTKTQHTRISGTFKTVCRGKFIALNAHKRKQERSKIDTLTSQLKQLERQEETHSKASRR QEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRLLARLIKKKREKNQIDPIKNDK GDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPVTGSE IVAIINSLPTKKSPGPDRFTATFYQRYKEELHINRTKDKNHMIISIDAEKAFDNIQQHFM LKTLNNLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGYTLSPLLFHIVLE VLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKI NVQKSKAFLYTNNKQTESQIMSELPFTIASKRIKYLGIQLTRHVKDLFKENYKPLLNEIK EDTKK >gi568815592f:50718892_50943389|GENSCAN_predicted_CDS_7|1278_bp atgagaacaaagacacaacataccagaatctctgggacattcaaaacagtgtgtagaggg aaatttatagcactaaatgcccacaagagaaagcaggaaagatccaaaattgacacccta acatcacaattaaaacaactagaaaggcaagaggaaacacattcaaaagctagtagaagg caagaaatcactaagatcagagcagaactgaaggagatagagacacaaaaaacccttcaa aaaatcaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagactgcta gcaagactaataaagaagaaaagagagaagaatcaaatagacccaataaaaaatgataaa ggggatatcaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacac ctctatgcaaataaactagaaaatctagaagaaatggataaatttctcgacacatacacc ctcccaagactaaaccaggaagaagttgaatctctgaatagaccagtaacaggctctgaa attgtggcaataatcaatagcttaccaaccaaaaaaagtccaggaccagatagattcaca gccacattctaccagaggtacaaggaggaactgcatataaacagaaccaaagacaaaaac cacatgattatctcaatagatgcagaaaaggcctttgacaacattcaacaacacttcatg ctaaaaactctcaataacttaggtattgatgggatgtatctcaaaataatacgagctatc tatgacaaacccacagccaatatcatactgaatgggcaaaaactggaggcattccctttg aaaactggcacaagacagggatacactctctcaccactcctattccacatagtgttggaa gttctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagag gaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccattgtc tcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatc aatgtacaaaaatcaaaagcattcttatacaccaataacaaacaaacagagagccaaatc atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaaggcatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaagaaatga >gi568815592f:50718892_50943389|GENSCAN_predicted_peptide_8|142_aa MGKKQRRKTENSKNQRASPPPKECSSSPATEQSWMEKDFDELREGGFRRSNYTKLKEDVQ NHHKEAKNLQKRLDEWLTRITNVEKSINDLTELKTMARELRDECTSLSSQLNQLEERVSV TEDQMNGLKQEEKFREKKNKKK >gi568815592f:50718892_50943389|GENSCAN_predicted_CDS_8|429_bp atggggaaaaaacagagaagaaaaactgaaaattctaaaaatcagagggcctctcctcct ccaaaggaatgcagctcctcaccagccacggaacaaagctggatggagaaagactttgac gagttgagagaaggaggcttcagacgatcaaactacaccaagctaaaggaggacgttcaa aaccatcacaaagaagctaaaaaccttcaaaaaagattagatgaatggttaactagaata accaatgtagagaagtccataaatgacctgacggagctgaaaaccatggcacgagaacta cgtgatgaatgcacaagcctcagtagccaactcaatcaactagaagaaagggtatcagtg actgaagatcaaatgaatggattgaagcaagaagagaagtttagagaaaaaaagaataaa aagaaatga