GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:53:16 Sequence gi568815588f:102544618_102757351 : 212734 bp : 48.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5353 5489 137 1 2 112 91 74 0.843 10.31 1.02 Intr + 47965 48107 143 2 2 92 94 108 0.970 11.87 1.03 Intr + 49019 49104 86 1 2 40 73 139 0.997 6.22 1.04 Intr + 49376 49448 73 2 1 118 55 35 0.907 2.61 1.05 Intr + 52523 52676 154 1 1 44 81 278 0.855 22.35 1.06 Intr + 54816 54927 112 1 1 46 105 138 0.936 10.74 1.07 Intr + 70651 70785 135 1 0 85 105 189 0.995 19.98 1.08 Intr + 72673 72811 139 1 1 97 52 98 0.940 7.57 1.09 Intr + 74170 74257 88 0 1 64 94 7 0.395 -1.56 1.10 Intr + 81359 81460 102 0 0 97 80 18 0.696 2.05 1.11 Intr + 82558 82626 69 2 0 112 74 15 0.701 1.65 1.12 Intr + 82865 83040 176 0 2 93 56 93 0.907 6.26 1.13 Term + 85449 85538 90 2 0 92 38 120 0.886 5.12 1.14 PlyA + 85628 85633 6 1.05 2.00 Prom + 88188 88227 40 -0.06 2.01 Init + 89117 89216 100 1 1 81 94 39 0.210 2.26 2.02 Intr + 99342 99526 185 1 2 79 64 -9 0.140 -4.69 2.03 Intr + 99779 100574 796 1 1 48 19 1295 0.714 109.64 2.04 Intr + 104550 104661 112 1 1 -15 58 122 0.042 -1.66 2.05 Intr + 106248 106323 76 0 1 112 68 -25 0.066 -2.58 2.06 Intr + 110036 110188 153 1 0 80 92 234 0.227 23.27 2.07 Intr + 110463 110696 234 2 0 106 86 555 0.999 54.99 2.08 Intr + 111489 111520 32 2 2 136 115 48 0.999 9.23 2.09 Intr + 111653 111768 116 2 2 77 100 195 0.974 19.69 2.10 Term + 112130 112737 608 0 2 79 42 533 0.999 42.48 2.11 PlyA + 113462 113467 6 1.05 3.14 PlyA - 113638 113633 6 1.05 3.13 Term - 120890 120822 69 2 0 135 50 12 0.245 0.04 3.12 Intr - 124217 124192 26 0 2 72 96 30 0.217 -0.06 3.11 Intr - 125427 125355 73 0 1 111 34 44 0.106 0.28 3.10 Intr - 131579 131524 56 0 2 75 73 39 0.150 -0.20 3.09 Intr - 132981 132824 158 2 2 75 70 61 0.797 2.65 3.08 Intr - 133609 133463 147 0 0 87 66 72 0.887 4.25 3.07 Intr - 134989 134916 74 1 2 88 107 34 0.816 3.50 3.06 Intr - 141384 141199 186 0 0 73 94 162 0.954 15.19 3.05 Intr - 145326 145276 51 0 0 130 99 13 0.979 5.70 3.04 Intr - 154872 154756 117 0 0 114 89 54 0.977 8.76 3.03 Intr - 160872 160729 144 0 0 45 99 129 0.942 10.18 3.02 Intr - 169885 169711 175 0 1 46 57 85 0.230 1.14 3.01 Init - 177429 177380 50 0 2 61 75 71 0.290 3.52 3.00 Prom - 179418 179379 40 -8.56 4.00 Prom + 180427 180466 40 -5.66 4.01 Init + 182020 182180 161 0 2 93 94 154 0.196 15.81 4.02 Intr + 182370 182540 171 0 0 75 75 285 0.991 24.96 4.03 Intr + 183814 183912 99 1 0 100 91 123 0.992 13.03 4.04 Intr + 184702 184777 76 1 1 107 86 89 0.999 10.12 4.05 Intr + 185106 185191 86 2 2 108 73 100 0.980 9.12 4.06 Intr + 187106 187166 61 2 1 61 88 50 0.955 1.04 4.07 Intr + 187535 187601 67 1 1 101 43 45 0.829 -0.22 4.08 Intr + 188242 188291 50 2 2 113 91 33 0.884 4.50 4.09 Term + 191245 191430 186 0 0 118 43 67 0.808 2.69 4.10 PlyA + 192538 192543 6 1.05 5.00 Prom + 192720 192759 40 -4.36 5.01 Init + 199464 199526 63 2 0 86 110 123 0.578 13.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 47150 47165 16 1 1 77 85 8 0.961 -0.39 S.002 Intr + 181995 182180 186 0 0 120 94 150 0.803 17.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:102544618_102757351|GENSCAN_predicted_peptide_1|501_aa XFTGTDGPSGFGFELTFRLKRETGESAPPTWPAELMQGLARYVFQSENTFCSGDHVSWHS PLDNSESRIQHMLLTEDPQMQPVQTPFGVVTFLQIVGVCTEELHSAQQWNGQGILELLRT VPIAGGPWLITDMRRGETIFEIDPHLQERVDKGIETDGSNLSGVSAKCAWDDLSRPPEDD EDSRSICIGTQPRRLSGKDTEQIRETLRRGLEINSKPVLPPINPQRQNGLAHDRAPSRKD SLESDSSTAIIPHELIRTRQLESVHLKFNQESGALIPLCLRGRLLHGRHFTYKSITGDMA ITFVSTGVEGAFATEEHPYAAHGPWLQPCPGLLRLPLWRTGREQACLLGGAAEEAGGRVS PQRSPSPGSSPHYLSSFVPYLGAQGVREKTDSVDRRVCRENVGGFRRFDFSRGTLVRTPS KSSELPAPESLIEEEARAFRLVRVLPSPCCSLADSEQNAHVCCQRTFRWGVLFKLPKEYS WPEKKLKVSILPDVVFDSPLH >gi568815588f:102544618_102757351|GENSCAN_predicted_CDS_1|1506_bp nngtttacaggaacagatggacctagtggttttggctttgagttgacctttcgtctgaag agagaaactggggagtctgccccaccaacatggcccgcagagttaatgcagggcttggca cgatacgtgttccagtcagagaacaccttctgcagtggggaccatgtgtcctggcacagc cctttggataacagtgagtcaagaattcagcacatgctgctgacagaggacccacagatg cagcccgtgcagacaccctttggggtagttaccttcctccagatcgttggtgtctgcact gaagagctacactcagcccagcagtggaacgggcagggcatcctggagctgctgcggaca gtgcctattgctggcggcccctggctgataactgacatgcggaggggagagaccatattt gagatcgatccacacctgcaagagagagttgacaaaggcatcgagacagatggctccaac ctgagtggtgtcagtgccaagtgtgcctgggatgacctgagccggccccccgaggatgac gaggacagccggagcatctgcatcggcacacagccccggcgactctctggcaaagacaca gagcagatccgggagaccctgaggagaggactcgagatcaacagcaaacctgtccttcca ccaatcaaccctcagcggcagaatggcctcgcccacgaccgggccccgagccgcaaagac agcctggaaagtgacagctccacggccatcattccccatgagctgattcgcacgcggcag cttgagagcgtacatctgaaattcaaccaggagtccggagccctcattcctctctgccta aggggcaggctcctgcatggacggcactttacatataaaagtatcacaggtgacatggcc atcacgtttgtctccacgggagtggaaggcgcctttgccactgaggagcatccttacgcg gctcatggaccctggttacaaccatgtcccgggctgctgcgtctgcccctgtggagaaca gggagggagcaggcctgtctcctgggcggagctgctgaggaggctggagggagggtttcc cctcagaggagcccctcacctggatcttcaccccattatctttcttcctttgtgccctac ctgggtgctcaaggtgtacgggagaagacagattctgttgaccgaagagtttgtagagaa aatgttggaggatttagaagatttgacttctccagaggaacccttgtaaggacacccagc aaatcgtctgaattgccagctcctgagagcctcattgaagaggaagccagggccttcagg ctagtacgagttctccccagcccctgctgttccctggcagattctgaacagaacgcccat gtctgttgccagagaacattccgctggggagttctgttcaaacttcccaaagagtacagc tggcctgaaaagaagctgaaggtctccatcctgcctgacgtggtgttcgacagtccgcta cactag >gi568815588f:102544618_102757351|GENSCAN_predicted_peptide_2|803_aa MRRWEKTLSLPLHPSSILLSPKTPFALGRGAQTAPRSRPARAPQPAPGQPAGGRLHRGPG TAGLPLPPRPDLAALGAAALASWPRRPRAPAPAREALGRRRRRSPLRARREDRGSLLRGA GTRKALLTERPSGPAGAGARGSVGLQRLRTDPRGWGVGEACPGPLPAAAMAENWKNCFEE ELICPICLHVFVEPVQLPCKHNFCRGCIGEAWAKDSGLVRCPECNQAYNQKPGLEKNLKL TNIVEKFNALHVEKPPAALHCVFCRRGPPLPAQKVCLRCEAPCCQSHVQTHLQQPSTARG HLLVEADDVRAWSCPQHNAYRLYHCEAEQVAVCQYCCYYSGAHQGHSVCDVEIRRNEIRA IAQPCLPAGLAERLRVEVECAFLTVGSTTVLLCECPLSIFQCLGNLICSFGELGWGREGD GRQKMLMKQQDRLEEREQDIEDQLYKLESDKRLVEVAKPKAMLRGGGGLSAGLWEKVNQL KEEVRLQYEKLHQLLDEDLRQTVEVLDKAQAKFCSENAAQALHLGERMQEAKKLLGSLQL LFDKTEDVSFMKNTKSVKILMDRTQTCTSSSLSPTKIGHLNSKLFLNEVAKKEKQLRKML EGPFSTPVPFLQSVPLYPCGVSSSGAEKRKHSTAFPEASFLETSSGPVGGQYGAAGTASG EGQSGQPLGPCSSTQHLVALPGGAQPVHSSPVFPPSQYPNGSAAQQPMLPQYGGRKILVC SVDNCYCSSVANHGGHQPYPRSGHFPWTVPSQEYSHPLPPTPSVPQSLPSLAVRDWLDAS QQPGHQDFYRVYGQPSTKHYVTS >gi568815588f:102544618_102757351|GENSCAN_predicted_CDS_2|2412_bp atgaggcggtgggaaaagaccctctccctgccactacatccctcttctatcctcctctca ccaaaaaccccttttgccttaggaagaggggcccagacagcgccccggagtcggcccgcc cgcgccccgcagcccgcgcccggccagccagccgggggacgcctgcaccgtggcccgggg accgccggcctgcccctcccgccccgtccggatctagcagccctcggcgcggccgccctc gcctcctggccccgcagaccccgggctccggcccctgcgagggaggcgctcgggaggagg aggagacgcagcccgctgcgcgcgcggcgtgaggaccgcggctccctcctccggggggcg ggcacgcggaaggcgctgctgactgagcgaccgtcggggccggctggggccggagctcgg ggctcggtgggcctacagcggctccggacggacccccggggctggggagtcggggaggcc tgccccggccccctgcccgcggccgccatggcggagaattggaagaactgcttcgaggag gagctcatctgccctatctgcctgcacgttttcgtggagccagtgcagctgccgtgcaaa cacaacttctgccggggctgcatcggcgaggcgtgggccaaggacagcggcctcgtacgc tgcccagagtgcaaccaggcctacaaccagaagccgggcctggagaagaacctgaagctc accaacatcgtggagaagttcaatgccctgcacgtggagaagccgccggcggcgctgcac tgcgtgttctgccgccgcggccccccgctgcccgcgcagaaggtctgcctgcgctgcgag gcgccctgctgccagtcccacgtgcagacgcacctgcagcagccctccaccgcccgcggg cacctcctggtggaggcggacgacgtgcgggcctggagctgcccgcagcacaacgcctac cgcctctaccactgcgaggccgagcaggtggccgtgtgccagtactgctgctactacagc ggcgcgcatcagggacactcggtgtgcgacgtggagatccgaaggaatgaaatccgggca attgcgcagccgtgtctgcctgccgggcttgctgagcgcctgcgagttgaagttgagtgt gccttcctgacagttggcagcactactgtgctgctgtgtgagtgcccgctcagcattttc cagtgtcttggcaacctgatttgctcattcggtgaactggggtggggtcgtgaaggggac ggcaggcagaagatgctcatgaagcagcaggaccggctggaggagcgagagcaggacatt gaggaccagctgtacaaactcgagtcagacaagcgcctggtggaggtagctaagcccaag gccatgctgcggggtgggggtggcttgagcgcagggctttgggagaaagtgaaccaactg aaggaggaagttcggctgcagtacgagaagctgcaccagctgctggacgaggacctgcgg cagacagtggaggtcctagacaaggcccaggccaagttctgcagcgagaacgcagcgcag gcgctgcacctcggggagcgcatgcaggaggccaagaagctgctgggctccctgcagctg ctctttgataagacggaggatgtcagcttcatgaagaacaccaagtctgtgaaaatcctg atggacaggacccagacctgcacgagcagcagcctttcccccactaagatcggccacctg aactccaagctcttcctgaacgaagtggccaagaaggagaagcagctgcggaaaatgcta gaaggccccttcagcacgccggtgcccttcctgcagagtgtccccctgtacccttgcggc gtgagcagctctggggcggaaaagcgcaagcactcaacggccttcccagaggccagtttc ctagagacgtcgtcgggccctgtgggcggccagtacggggcggcgggcacagccagcggt gagggccagtctgggcagcccctggggccctgcagctccacgcagcacttggtggccctg ccgggcggcgcccaaccagtgcactcaagccccgtgttccccccatcgcagtatcccaat ggctccgccgcccagcagcccatgctcccccagtatggcggccgcaagattctcgtctgt tctgtggacaactgttactgttcttccgtggccaaccatggcggccaccagccctacccc cgctccggccactttccctggacagtgccctcgcaggagtactcacacccgctcccgccc acaccctccgtcccccagtcccttcccagcctggcggtcagagactggcttgacgcctcc cagcagcccggccaccaggatttctacagggtgtatgggcagccgtccaccaaacactac gtgacgagctaa >gi568815588f:102544618_102757351|GENSCAN_predicted_peptide_3|441_aa MQGDQLDENPSNPGKKRRAGGGLCSSRSPPTAWAGALAHLRTLGKAELHPAPYCCSAPQL CVRGTSGAAPQQLPLGLLSILRKLKSAPDQEVRILLLGLDNAGKTTLLKQLASEDISHIT PTQGFNIKSVQSQGFKLNVWDIGGQRKIRPYWKNYFENTDILIYVIDSADRKRFEETGQE LAELLEEEKLSCVPVLIFANKQDLLTAAPASEIAEGLNLHTIRDRVWQIQSCSALTGEGV QGASQLLFKLGAEPFTRQVPTSVENWCCPPSGFRRHRPHLGKGQMSALGQPSDYLVLLPK CTDSNWQSCNHCKFSLLREGAFKVIVPLREPSPQSLWMLVLKAIRSGPDPFKPIWSAAGR EVNAHRPASHTEKQILMTFDCNDIWNDPEDPALGSQSQLLLWMPPVEGAGAAAIPYARRV DQKFIPPLHPVLGTPLPAFYF >gi568815588f:102544618_102757351|GENSCAN_predicted_CDS_3|1326_bp atgcagggagaccagctggacgagaaccccagtaatccaggcaagaagaggcgagccggc ggcggcctctgcagctcccgtagcccgcccactgcgtgggcgggcgccttagcgcacttg cgcactctggggaaagcggagctgcaccccgccccgtattgctgctcagctcctcagctg tgcgtgcgagggacgtcgggggcggcgccgcagcagttgcccctgggcttgctctcaatt ttgcgcaagttgaaaagtgcaccagaccaggaggtgagaatacttctcctgggcttggat aatgctggcaagaccactcttctgaagcagcttgcatctgaagacatcagccacatcaca cctacacagggtttcaacatcaaaagtgtacaatcacaaggttttaaactgaatgtatgg gacattggtggacagaggaaaatcagaccatactggaagaattattttgaaaataccgat attcttatatatgtaatcgacagtgcagacagaaaaagatttgaagagacgggtcaggaa ctagcggaattactggaggaagaaaaactaagttgtgtgccagtgctcatctttgctaat aagcaggatttgctcacagcagcccctgcctctgaaattgcagaaggactgaacctgcat accatccgcgaccgagtctggcagatccagtcttgctcagctctcacaggagagggcgtt cagggtgcttcccagctcctctttaagcttggggcagaacccttcacccggcaggtaccc acatccgtggagaactggtgctgtccaccttcgggctttcggcgccaccggcctcatctt ggcaaaggtcagatgtcggctctgggacagccgtcagattatctagttctgctgcccaag tgcactgacagcaactggcagagttgtaatcactgtaaattcagccttctaagagagggc gctttcaaggtcattgtgcctttaagggaaccttctccccagtccctctggatgctggtg cttaaagctataagaagcgggccagacccttttaaacctatctggagtgctgcaggaaga gaagtaaatgcacatcgccctgccagccacacagagaagcaaattcttatgacttttgac tgtaatgacatctggaatgatccagaagaccctgccctggggtcccagagtcagctgctg ctctggatgccaccagtagagggggcaggagctgctgccatcccctatgccaggagggtg gaccagaagttcataccacctttacacccagttcttggaacacccctccctgcattctac ttctga >gi568815588f:102544618_102757351|GENSCAN_predicted_peptide_4|318_aa MEADLSGFNIDAPRWDQRTFLGRVKHFLNITDPRTVFVSERELDWAKVMVEKSRMGVVPP GTQVEQLLYAKKLYDSAFHPDTGEKMNVIGRMSFQLPGGMIITGFMLQFYRTMPAVIFWQ WVNQSFNALVNYTNRNAASPTSVRQMALSYFTATTTAVATAVGMNMLTKKAPPLVGRWVP FAAVAAANCVNIPMMRQQELIKGICVKDRNENEIGHSRRAAAIGITQVVISRITMSAPGM ILLPVIMERLEKLHFMQPHLHGASGVWAFPTEMVSAVHSLLGLIDHAASALPDLPAVQKE TGKGQEGQHLSEDKCPSA >gi568815588f:102544618_102757351|GENSCAN_predicted_CDS_4|957_bp atggaggctgacctgtctggctttaacatcgatgccccccgttgggaccagcgcaccttc ctggggagagtgaagcacttcctaaacatcacggacccccgcactgtctttgtatctgag cgggagctggactgggccaaggtgatggtggagaagagcaggatgggggttgtgccccca ggcacccaagtggagcagctgctgtatgccaagaagctgtatgactcggccttccacccc gacactggggagaagatgaatgtcatcgggcgcatgtctttccagcttcctggcggcatg atcatcacgggcttcatgctccagttctacaggacgatgccggcggtgatcttctggcag tgggtgaaccagtccttcaatgccttagtcaactacaccaacaggaatgcggcttccccc acatcagtcaggcagatggccctttcctacttcacagccacaaccactgctgtggccacg gctgtgggcatgaacatgttgacaaagaaagcgccgcccttggtgggccgctgggtgccc tttgccgctgtggctgcggctaactgtgtcaatatccccatgatgcgacagcaggagctc ataaagggaatctgcgtgaaggacaggaatgaaaatgagattggtcattcccggagagct gcggccataggcatcacccaagtagttatttctcggatcaccatgtcagctcctgggatg atcttgctgccagtcatcatggaaaggcttgagaaattgcacttcatgcagcctcatctt catggtgccagtggcgtgtgggcttttcccacagaaatggtatctgctgttcattcctta cttggtttaattgaccatgctgccagcgcgctccctgatctgcctgctgtccagaaggag acagggaaggggcaggaagggcagcacctgtctgaggacaagtgtccatcagcctaa >gi568815588f:102544618_102757351|GENSCAN_predicted_peptide_5|21_aa MALLLLQALPSPLSARAEPPQ >gi568815588f:102544618_102757351|GENSCAN_predicted_CDS_5|63_bp atggcgctcctgctcctccaggcgctgcccagccccttgtcagccagggctgaacccccg cag