GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:17:42 Sequence gi568815577f:25539822_25812383 : 272562 bp : 39.33% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 6334 6708 375 0 0 77 42 191 0.986 9.29 1.02 PlyA + 7287 7292 6 1.05 2.00 Prom + 9683 9722 40 -7.25 2.01 Init + 16838 16913 76 1 1 87 89 44 0.409 5.70 2.02 Intr + 20759 20868 110 0 2 44 87 69 0.310 1.48 2.03 Intr + 25597 25718 122 0 2 6 59 149 0.118 2.17 2.04 Intr + 28537 28685 149 1 2 93 44 65 0.206 1.56 2.05 Intr + 36558 36637 80 1 2 82 98 44 0.233 3.15 2.06 Term + 46368 46538 171 2 0 74 38 119 0.325 2.34 2.07 PlyA + 47171 47176 6 1.05 3.12 PlyA - 49842 49837 6 1.05 3.11 Term - 50719 50600 120 1 0 95 41 39 0.357 -2.61 3.10 Intr - 53144 52991 154 1 1 23 89 97 0.727 2.45 3.09 Intr - 54137 54072 66 1 0 115 111 42 0.944 6.30 3.08 Intr - 57593 57481 113 2 2 46 108 61 0.859 2.16 3.07 Intr - 60045 59978 68 1 2 41 106 81 0.749 2.91 3.06 Intr - 66834 66628 207 1 0 93 85 118 0.947 10.13 3.05 Intr - 67931 67582 350 1 2 92 64 220 0.447 14.08 3.04 Intr - 83215 83088 128 1 2 82 97 4 0.134 -0.74 3.03 Intr - 83990 83866 125 0 2 25 100 75 0.173 1.78 3.02 Intr - 87023 86986 38 1 2 111 75 24 0.207 0.39 3.01 Init - 90478 90279 200 1 2 54 75 101 0.441 3.82 3.00 Prom - 91812 91773 40 -4.05 4.00 Prom + 93461 93500 40 -8.25 4.01 Init + 100001 100067 67 1 1 101 87 209 0.987 21.39 4.02 Intr + 144062 144127 66 0 0 100 87 49 0.090 3.96 4.03 Intr + 150045 150152 108 1 0 49 121 17 0.341 0.44 4.04 Intr + 153935 154102 168 0 0 81 20 172 0.449 8.70 4.05 Intr + 158856 159058 203 1 2 37 95 151 0.635 8.78 4.06 Intr + 162349 162448 100 0 1 8 87 102 0.400 0.86 4.07 Intr + 166158 166265 108 1 0 43 108 48 0.302 1.64 4.08 Intr + 175356 175491 136 1 1 59 70 74 0.024 1.41 4.09 Term + 179218 179323 106 1 1 83 33 67 0.019 -2.40 4.10 PlyA + 180031 180036 6 1.05 5.05 PlyA - 180538 180533 6 1.05 5.04 Term - 184856 184819 38 0 2 118 49 62 0.982 1.82 5.03 Intr - 185529 185405 125 2 2 101 102 145 0.999 16.51 5.02 Intr - 189980 189810 171 1 0 100 98 202 0.836 20.54 5.01 Init - 190139 190105 35 2 2 61 80 34 0.807 -0.71 5.00 Prom - 190467 190428 40 -9.35 6.00 Prom + 191435 191474 40 -10.25 6.01 Init + 192541 192549 9 0 0 55 61 0 0.228 -5.36 6.02 Intr + 193958 194092 135 1 0 90 103 40 0.543 5.54 6.03 Intr + 194372 194438 67 1 1 67 61 69 0.713 -0.34 6.04 Intr + 194820 194948 129 1 0 93 100 82 0.777 9.65 6.05 Intr + 195043 195249 207 2 0 13 110 174 0.549 10.23 6.06 Intr + 195620 195757 138 0 0 8 92 92 0.256 1.11 6.07 Intr + 201776 201854 79 0 1 13 81 117 0.026 1.19 6.08 Intr + 205389 205533 145 0 1 36 91 134 0.999 7.86 6.09 Intr + 209215 209299 85 0 1 45 115 57 0.997 2.57 6.10 Intr + 212168 212413 246 0 0 47 45 278 0.453 15.91 6.11 Intr + 218189 218383 195 0 0 71 11 122 0.533 1.26 6.12 Intr + 222491 222544 54 0 0 88 92 79 0.982 6.33 6.13 Intr + 224389 224529 141 2 0 81 68 72 0.923 3.90 6.14 Intr + 224774 224966 193 0 1 72 121 237 0.998 23.03 6.15 Term + 229183 229411 229 1 1 65 52 274 0.999 16.62 6.16 PlyA + 229742 229747 6 1.05 7.00 Prom + 230391 230430 40 -6.85 7.01 Init + 239567 239785 219 1 0 71 73 296 0.972 25.08 7.02 Term + 239838 239894 57 2 0 95 39 59 0.823 -1.59 7.03 PlyA + 240050 240055 6 1.05 8.05 PlyA - 240845 240840 6 1.05 8.04 Term - 251798 251653 146 0 2 86 44 63 0.047 -1.21 8.03 Intr - 258858 258773 86 2 2 66 45 66 0.158 -1.36 8.02 Intr - 261801 261693 109 1 1 129 89 83 0.768 10.92 8.01 Init - 266860 266563 298 1 1 45 42 175 0.666 6.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 143027 142668 360 2 0 46 99 181 0.835 11.23 S.002 Init + 201778 201854 77 0 2 91 81 115 0.969 11.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:25539822_25812383|GENSCAN_predicted_peptide_1|124_aa MRFGGDKHLNYINLSKNGYLQEEEAGICKALRKQITTQLLEAINTVQYCQDLDSVSYCYC KTNYHKFYGLEQDKFIFLQSESQNSKMGQQGCAPSGGSSGESVSCLFQLLEAARLPWLVA PFHP >gi568815577f:25539822_25812383|GENSCAN_predicted_CDS_1|375_bp atgagatttggaggcgacaagcatctaaactatatcaacttgtccaagaatggctacctg caggaggaagaagctggcatctgcaaggctctacgaaaacagattacaactcagttactt gaggcaatcaatactgtccaatactgtcaagatttggactcagtatcctattgctactgt aaaacaaattaccacaaattttatggcttagaacaagacaaatttattttcttacagtct gaaagtcagaattctaaaatgggtcaacagggctgtgctccttctggaggctctagtgga gaatctgtctcttgccttttccagcttctagaggctgcccgccttccttggcttgtggcc cccttccatccttaa >gi568815577f:25539822_25812383|GENSCAN_predicted_peptide_2|235_aa MDAAGGHYLKQINAETENHLLHVLTSPQRHMDLESRETGLRSQPSRVGTEFESQLDYLLG SKAHHMSEQFSILTHKSKALSNVEEDVIPKITTLTHSDTLTRNFIHICSITFQHSSDLPL SQCEVRVLRKRVPAVRLQLAVVLLLLIPQNESEGFLLTSHYKNSWSMSKTESVFGLEEGT HQTDMAKSEFNTPKPVLPASSILVKGSTQIPKDSIQEFTLFFLPAYSINSPTNSA >gi568815577f:25539822_25812383|GENSCAN_predicted_CDS_2|708_bp atggatgcagctggaggccattatcttaagcaaattaatgcagaaacagaaaatcaccta ctacatgttctcacttctcctcaacgtcacatggatctggaaagcagggagactggacta cggagccagccctccagggttggaactgagtttgaatctcagcttgactacttactagga tctaaggcacatcatatgtctgagcagttcagcatcctcactcacaagtccaaggctctc agcaatgtggaagaagatgtgattcctaaaattaccacccttacacacagtgacacactc accagaaactttattcacatttgttctataacatttcagcacagttcagaccttccgctg agccagtgtgaggtcagggtgttgaggaagcgagtccctgcagtcagactccagctggct gtcgtcctgcttctgcttattccacagaatgagtctgagggattcctcctgaccagtcat tacaaaaattcttggagcatgagcaagactgagtctgtgtttggattggaggaagggact catcaaacagacatggctaaatctgaattcaacaccccaaaacctgtcctacctgcatca tctatcttggtaaagggctccacccagatacctaaggacagcattcaagaattcactctc ttcttcctacctgcatattccatcaacagtcctactaattcagcctaa >gi568815577f:25539822_25812383|GENSCAN_predicted_peptide_3|522_aa MLVAGTQLHAITDSVSSCEDVGGRERHKSGKAITEGAIKKERAIREVREDFTGESEGWSL KEVGEGRIGTQHPGALKIKVTLNVFQLSRGLNSQPVLLRLQHASESTESLVKMQIWIQQD KGLLVSKYFRLQRKGMRIKSMVIRDDFYQQGNSPKLHKQQSDCKAYVVGGRPTALGALHL WVGDRAQVSSTGNAPRLRVSGSQPLNPQRGLTPPGGGEQSECWVRNRPVLRNSECAFGFR GRRAEKDLRATVLTAAMEALAMGSRALRLWLVAPGGGIKWRFIATSSASQLSPTELTEMR NDLFNKEKARQLSLTPRTEKIEVKHVGKTDPGTVFVMNKNISTPYSCAMLISGAFCYDVV LDSKLDEWMPTKENLRSFTKDAHALIYKDLPFETLEVEAKVALEIFQHSKYKVDFIEEKA SQNPERIVKLHRIGDFIDVSEGPLIPRTSICFQYEVSAVHNLQPTQPSLIRRFQGVSLPV HLRPSQFQKMGSSFNSSLSNSQPMGCMQSRTALNETQICKLS >gi568815577f:25539822_25812383|GENSCAN_predicted_CDS_3|1569_bp atgctagtggctgggacacaactgcacgcaataacagacagtgtctcatcttgtgaagat gtaggtgggagagagagacataaatcaggcaaagcgatcactgagggtgctattaaaaaa gaacgtgctatcagggaggtcagggaagacttcacaggggaatctgaaggctggagtctg aaggaggtaggagaaggcaggatagggactcaacatcctggagcactaaaaatcaaagtt acacttaatgttttccaactgagcaggggactgaactctcaaccagtgcttctcaggctt caacatgcatcggaatcaacagagagtcttgttaaaatgcagatttggattcagcaggac aagggccttctggtgtcaaaatattttagattgcagaggaaagggatgagaataaaatcc atggttatcagagatgatttctaccaacagggaaattcacccaaattgcacaaacagcaa tcagattgtaaagcctacgtggtgggcggcaggccgaccgccctgggggccctgcacctc tgggtaggtgaccgcgcccaggtatcgagcacaggtaacgcgccccggctccgagtctcg gggtctcagccactgaatccccagcgaggcctgacgccccccggtggaggcgagcagtct gagtgctgggtccggaacagacctgtcctgcggaactccgagtgcgcgttcggtttccgg gggaggagggcggagaaggacttgcgcgcgacggttctcaccgctgctatggaggcgctg gccatgggttcccgggcgctgcggctctggctggtcgcacccggtggcgggatcaaatgg agatttatagcaacatcgtcagcttctcagctgtcaccgacagaattgacagaaatgcgg aatgatctctttaataaagagaaagccaggcagttatcattaactccccgaactgagaag atagaagttaagcatgttgggaaaactgaccccggtactgtcttcgtgatgaataaaaac atttcaactccctacagttgtgccatgctaatttctggtgccttctgttatgacgtagtt ttggatagcaaacttgatgagtggatgccaacaaaagagaacttacgttccttcacaaaa gatgctcatgctttaatttataaagatcttccatttgaaactctggaagttgaagcaaaa gtggcattggaaatatttcaacacagcaagtacaaagtagatttcatagaagagaaggca tctcagaaccctgagagaatagtcaagctacacagaataggtgacttcattgatgtgagt gagggccctcttattccaagaacaagtatttgtttccagtatgaagtatcagcagttcac aatcttcaacccacccagccaagtctcatacgaagattccagggcgtgtctttacctgtt cacttaagaccttcccaatttcagaaaatgggatcatcctttaactcaagcttgtccaac tcacagcccatgggctgcatgcagtccaggacggctttgaatgagacacaaatttgtaaa ctttcttaa >gi568815577f:25539822_25812383|GENSCAN_predicted_peptide_4|353_aa MARRSRHRLLLLLLRYLVVALGYHKAYGFSAPKDQQVVTAVEYQEAILACKTPKKTVSSR LEWKKLGRSVSFVYYQQTLQGDFKNRAEMIDFNIRIKNVTRSDAGKYRCEVSAPSEQGQN LEEDTVTLEVLGDVHVLAPAVPSCEVPSSALSGTVVELRCQDKEGNPAPEYTWFKDGIRL LENPRLGSQSTNSSYTMNTKTGTLQFNTVSKLDTGEYSCEARNSVGYRRCPGKRMQVDDL NISGIIAAVVVVALVISVCGLGVCYAQRKGYFSKRRKRDGSYAQGFALCVYFHSWELKAT SKEMPNGWSPQVCTTGFAWLSIASQRLSPGTSTLQYPLSHNVNLSSSKGSFSL >gi568815577f:25539822_25812383|GENSCAN_predicted_CDS_4|1062_bp atggcgaggaggagccgccaccgcctcctcctgctgctgctgcgctacctggtggtcgcc ctgggctatcataaggcctatgggttttctgccccaaaagaccaacaagtagtcacagca gtagagtaccaagaggctattttagcctgcaaaaccccaaagaagactgtttcctccaga ttagagtggaagaaactgggtcggagtgtctcctttgtctactatcaacagactcttcaa ggtgattttaaaaatcgagctgagatgatagatttcaatatccggatcaaaaatgtgaca agaagtgatgcggggaaatatcgttgtgaagttagtgccccatctgagcaaggccaaaac ctggaagaggatacagtcactctggaagtattaggtgatgtgcatgtattggctccagca gttccatcatgtgaagtaccctcttctgctctgagtggaactgtggtagagctacgatgt caagacaaagaagggaatccagctcctgaatacacatggtttaaggatggcatccgtttg ctagaaaatcccagacttggctcccaaagcaccaacagctcatacacaatgaatacaaaa actggaactctgcaatttaatactgtttccaaactggacactggagaatattcctgtgaa gcccgcaattctgttggatatcgcaggtgtcctgggaaacgaatgcaagtagatgatctc aacataagtggcatcatagcagccgtagtagttgtggccttagtgatttccgtttgtggc cttggtgtatgctatgctcagaggaaaggctacttttcaaaaaggaggaagcgggatggg tcttatgcccaaggatttgcactttgtgtttacttccattcctgggaactaaaagcaaca tcaaaagaaatgcctaatggctggagcccacaggtgtgcaccacaggatttgcctggctt tcaatagcatctcaacgtttatctccgggaacttccaccctccagtaccctctcagtcac aacgtcaatctctccagttccaaaggctccttttccctctaa >gi568815577f:25539822_25812383|GENSCAN_predicted_peptide_5|122_aa MGNKEEKGTVVRISMILQRLFRFSSVIRSAVSVHLRRNIGVTAVAFNKELDPIQKLFVDK IREYKSKRQTSGGPVDASSEYQQELERELFKLKQMFGNADMNTFPTFKFEDPKFEVIEKP QA >gi568815577f:25539822_25812383|GENSCAN_predicted_CDS_5|369_bp atggggaataaggaggagaagggaactgtggtcagaatcagcatgattcttcagaggctc ttcaggttctcctctgtcattcggtcagccgtctcagtccatttgcggaggaacattggt gttacagcagtggcatttaataaggaacttgatcctatacagaaactctttgtggacaag attagagaatacaaatctaagcgacagacatctggaggacctgttgatgctagttcagag tatcagcaagagctggagagggagctttttaagctcaagcaaatgtttggtaatgcagac atgaatacatttcccaccttcaaatttgaagatcccaaatttgaagtcatcgaaaaaccc caggcctga >gi568815577f:25539822_25812383|GENSCAN_predicted_peptide_6|683_aa MSQMIKLYLRKIKLHVFSQLICSKSDCIDVTPSATRVKSTFIVSIIHHYHRVPREHMIAS DMIRKTGEGNKRVSILFLGLRQTPRSTNVEVSCDPGPGREGPQDTELLSPPSKVPSCQSL RRHHLRSTSGPGSAPTRLPAIAMHYGPPFQSVDAHRTGSVSETVCDRTGLGETEAKQEEE VEGPWDLTLLVAGAAGLTRRDAARGALAAAVLSRLWSAGGGDRADSGVAMTKREAEELIE IEIDGTEKAECTEESIVEQTYAPAECVSQAIDINEPIGNLKKLLEPRLQCSLDAHEICLQ DIQLDPERSLFDQGVKTDGTVQLSVQVISYQGIEPKLNILEIVKPADTVEVVIDPDAHHA ESEAHLVEEAQVITLDGTKHITTISDETSEQVTRWAAALEGYRKEQERLGIPYDPIQWST DQVLHWVVWVMKEFSMTDIDLTTLNISGRELCSLNQEDFFQRVPRGEILWSHLELLRKYV LASQEQQMNEIVTIDQPVQIIPASVQSATPTTIKVINSSAKAAKVQRAPRISGEDRSSPG NRTGNNGQIQLWQFLLELLTDKDARDCISWVGDEGEFKLNQPELVAQKWGQRKNKPTMNY EKLSRALRYYYDGDMICKVQGKRFVYKFVCDLKTLIGYSAAELNRLVTECEQKKLAKMQL HGIAQPVTAVALATASLQTEKDN >gi568815577f:25539822_25812383|GENSCAN_predicted_CDS_6|2052_bp atgagccagatgataaaactgtatctaagaaagattaagctgcacgtattctcacagtta atttgctctaagtctgactgcatagatgttactccttctgcaacacgggtaaaatccact ttcattgtttcaattatccaccattaccatcgggtgccacgagagcatatgatagcgtct gacatgatcagaaagactggggaaggcaacaagagggtatcgatcttatttctgggtcta cggcaaactccaaggtctacaaacgtagaggtcagctgtgaccccgggccaggccgtgaa ggtccccaggacacagagctgctctctcctcctagtaaagtcccgagctgccaaagcctc cgccgccaccacctccgctctacttccggccctggctccgcccccacacgcctacccgcc atcgcaatgcattatgggccgccgtttcagtcggtcgacgctcaccggacaggaagcgtc tcggagacagtctgcgaccggacgggtctaggtgagacagaagccaaacaggaggaggaa gtggaggggccctgggacctcacacttctagtcgcgggagctgcaggtcttacccggaga gacgctgcacgtggagccctcgccgctgccgttctcagccggctctggagtgcgggcggg ggcgacagggccgattccggagtggccatgactaaaagagaagcagaggagctgatagaa attgagattgatggaacagagaaagcagagtgcacagaagaaagcattgtagaacaaacc tacgcgccagctgaatgtgtaagccaggccatagacatcaatgaaccaataggcaattta aagaaactgctagaaccaagactacagtgttctttggatgctcatgaaatttgtctgcaa gatatccagctggatccagaacgaagtttatttgaccaaggagtaaaaacagatggaact gtacagcttagtgtacaggtaatttcttaccaaggaattgaaccaaagttaaacatcctt gaaattgttaaacctgcggacactgttgaggttgttattgatccagatgcccaccatgct gaatcagaagcacatcttgttgaagaagctcaagtgataactcttgatggcacaaaacac atcacaaccatttcagatgaaacttcagaacaagtgacaagatgggctgctgcactggaa ggctataggaaagaacaagaacgccttgggataccctatgatcccatacagtggtccaca gaccaagtcctgcattgggtggtttgggtaatgaaggaattcagcatgaccgatatagac ctcaccacactcaacatttcggggagagaattatgtagtctcaaccaagaagattttttt cagcgggttcctcggggagaaattctctggagtcatctggaacttctccgaaaatatgta ttggcaagtcaagaacaacagatgaatgaaatagttacaattgatcaacctgtgcaaatt attccagcatcagtgcaatctgctacacctactaccattaaagttataaatagtagtgcg aaagcagccaaagtacaaagagcgccgaggatttcaggagaagatagaagctcacctggg aacagaacaggaaacaatggccaaatccaactatggcagtttttgctagaacttcttact gataaggacgctcgagactgcatttcttgggttggtgatgaaggtgaatttaagctaaat cagcctgaactggttgcacagaaatggggacagcgtaaaaataagcctacgatgaactat gagaaactcagtcgtgcattaagatattattacgatggggacatgatttgtaaagttcaa ggcaagagatttgtgtacaagtttgtctgtgacttgaagactcttattggatacagtgca gcggagttgaaccgtttggtcacagaatgtgaacagaagaaacttgcaaagatgcagctc catggaattgcccagccagtcacagcagtagctctggctactgcttctctgcaaacggaa aaggataattga >gi568815577f:25539822_25812383|GENSCAN_predicted_peptide_7|91_aa MNTPEGRNSEHIRTSEGTNSGHAAFKNCNTARVHGFMLEVGKTKNPPIPDTFWRPRRDFC LSLSGETIAYRQANSGAKYWAPVSQLKVTGK >gi568815577f:25539822_25812383|GENSCAN_predicted_CDS_7|276_bp atgaacacaccggaaggaagaaactcggaacacatccgaacatcagaaggaacaaactcc ggacacgccgcctttaagaactgtaacaccgccagggtccacggcttcatgcttgaagtt ggtaagaccaagaacccaccaattccggacacgttttggcgaccacgaagggacttttgc ctgtcgctgagcggtgagaccatcgcctatcgccaagcaaattcgggggctaaatactgg gcacctgtcagccagttaaaagtgaccggcaagtga >gi568815577f:25539822_25812383|GENSCAN_predicted_peptide_8|212_aa MSTHIDEDRSSLLSLWIRMLISSKNTLTDPPRNVSAYPLTRSITPKMQHQVKGRRVFTTE SCISGEFRKQKKHFAKRPNVCRSAVCRPEGHSEKYGCGRGPQKPNCRRNLSGKVSAAGVL NNISGAEKPHWPFLQWWGNAGRKKFAGQVPWPVPVILALWEIKADPSHFPMKLCLILLSE HSEPYQHGPLASRAVGGAEGLNTPAFSQAEFQ >gi568815577f:25539822_25812383|GENSCAN_predicted_CDS_8|639_bp atgtccacccatattgatgaggacagatcctctttactcagtctgtggattcgaatgcta atctcttccaaaaacactctcacagacccacccagaaatgtttctgcgtatcccttaacc cggtcaataacacctaaaatgcagcaccaagtcaaagggagaagagtgttcacaacggag agctgcatttctggtgaattcaggaagcaaaagaaacatttcgcaaagagaccgaatgta tgcagatctgctgtgtgcagacctgaaggccacagtgaaaaatatgggtgtgggcgtggt ccacagaaacccaactgtcgcagaaacctgagtggtaaagtgtcagcagcaggagttctg aataacatttcaggggcagaaaaaccgcactggcctttccttcaatggtggggaaatgca ggacgaaagaaatttgcaggccaggtgccgtggcccgtgcccgtaatcctagcactttgg gagatcaaggcagacccatctcatttccccatgaaactctgcctcatacttctgtctgag cactctgagccctaccagcatggtcctttggcatccagagctgttggtggcgctgaagga ctgaacactcctgctttttctcaggcagaattccaatga