GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:12:05 Sequence gi568815585f:79381486_79651117 : 269632 bp : 37.45% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 11947 12036 90 2 0 38 92 91 0.339 3.55 1.02 Intr + 15861 15941 81 1 0 95 71 34 0.144 1.09 1.03 Intr + 24136 24332 197 2 2 97 24 96 0.720 2.31 1.04 Term + 24568 24756 189 0 0 100 35 271 0.871 19.47 1.05 PlyA + 25467 25472 6 1.05 2.03 PlyA - 26362 26357 6 1.05 2.02 Term - 40851 40474 378 0 0 30 41 208 0.941 4.10 2.01 Init - 47696 47631 66 2 0 66 89 55 0.816 4.52 2.00 Prom - 48258 48219 40 -2.85 3.00 Prom + 49351 49390 40 -7.45 3.01 Init + 50388 50542 155 2 2 60 42 166 0.702 8.70 3.02 Intr + 51271 51528 258 1 0 85 38 102 0.307 0.56 3.03 Intr + 52613 52665 53 2 2 56 52 100 0.246 0.83 3.04 Intr + 71330 71471 142 0 1 66 77 66 0.053 1.79 3.05 Intr + 78532 78682 151 1 1 104 65 76 0.020 6.14 3.06 Term + 94245 94349 105 2 0 97 44 81 0.166 2.03 3.07 PlyA + 94818 94823 6 1.05 4.00 Prom + 95857 95896 40 -7.15 4.01 Init + 99719 100039 321 1 0 52 89 312 0.750 23.08 4.02 Intr + 105548 105730 183 1 0 78 62 52 0.199 0.66 4.03 Intr + 115934 116041 108 1 0 59 68 66 0.012 1.26 4.04 Intr + 139325 139490 166 1 1 90 80 166 0.985 14.61 4.05 Intr + 151838 151971 134 0 2 76 52 195 0.254 14.14 4.06 Intr + 158197 158290 94 0 1 46 89 88 0.181 3.32 4.07 Intr + 162073 162197 125 2 2 54 96 40 0.055 0.78 4.08 Intr + 178527 178665 139 2 1 78 50 106 0.095 4.92 4.09 Term + 183425 183552 128 0 2 86 49 78 0.395 1.16 4.10 PlyA + 183927 183932 6 1.05 5.00 Prom + 184687 184726 40 -5.75 5.01 Init + 187945 188033 89 0 2 72 75 103 0.895 7.56 5.02 Intr + 191004 191041 38 0 2 99 97 30 0.891 1.99 5.03 Term + 191251 191492 242 2 2 33 43 199 0.950 5.10 5.04 PlyA + 191999 192004 6 1.05 6.02 PlyA - 192902 192897 6 1.05 6.01 Sngl - 199217 199041 177 2 0 77 41 179 0.553 6.80 6.00 Prom - 213847 213808 40 -3.45 7.00 Prom + 215448 215487 40 -3.55 7.01 Init + 217880 217982 103 1 1 60 68 91 0.779 4.77 7.02 Intr + 218009 218133 125 0 2 43 113 77 0.642 5.08 7.03 Intr + 218578 218863 286 0 1 -34 68 317 0.346 13.49 7.04 Intr + 220335 220437 103 1 1 20 81 67 0.068 -2.49 7.05 Intr + 220587 220887 301 0 1 38 36 152 0.027 0.81 7.06 Intr + 220977 221083 107 2 2 41 74 110 0.067 2.99 7.07 Intr + 225445 225574 130 1 1 89 52 75 0.083 3.78 7.08 Intr + 230268 230429 162 2 0 66 78 43 0.130 0.35 7.09 Intr + 231152 231288 137 1 2 93 40 80 0.170 2.15 7.10 Intr + 237473 237554 82 2 1 70 103 50 0.171 3.42 7.11 Term + 247135 247206 72 0 0 97 48 52 0.050 -0.97 7.12 PlyA + 247681 247686 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 86975 86913 63 2 0 84 121 38 0.802 7.90 S.002 Init - 119218 119142 77 1 2 72 91 86 0.801 7.91 S.003 Term - 220800 220576 225 0 0 36 49 261 0.879 12.80 S.004 Init - 221965 221951 15 1 0 85 92 1 0.856 0.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:79381486_79651117|GENSCAN_predicted_peptide_1|185_aa XSPSALCLLGRILSSETRSEVAADPYGFAADWKQDMDVHSYYFNIVLLLLASMIRNEGTW GNCTDLWVRLSPTAIPAKRYPPDTHMGSRVLLSQDLSASKFSMIILETIHSGPKARHTPP PAQQIYPDRKRDTRLDPSLLSASGPGACWLRRKGKSKESEGEGQLNGNDSGGLPGSQHPQ EDGCR >gi568815585f:79381486_79651117|GENSCAN_predicted_CDS_1|558_bp ntttctccttctgccttatgtctcctcggtcgaattctttcttctgagacaagaagtgag gttgctgcagatccatatggatttgcagcagactggaaacaagacatggacgtccactct tactacttcaacattgtactattgcttctagccagcatgataagaaatgaaggcacttgg ggaaactgcacagatctctgggtccgcctatctcccactgccatccctgcaaagagatat cccccggatactcacatgggctcgagagtcttgctgagccaggacttgagtgcctcgaag ttttcaatgatcattttagaaaccatccacagcggccctaaggcccgtcacactcctccg cccgcccagcaaatctacccagaccggaagcgcgacacgcgcttagacccgagccttctc tcagcctcgggaccaggtgcttgttggctacgtcgaaaaggcaaaagtaaagaaagcgaa ggcgaagggcagctcaatgggaacgattcagggggtcttcccggctcccagcacccccaa gaagatggctgccggtaa >gi568815585f:79381486_79651117|GENSCAN_predicted_peptide_2|147_aa MEGWYFIEGKVDIGYIAFVSQRTGPDFPPHECELYLTTNECGGSNRVRLLRLIHKRQCGL CLTLFWTAYFGESQWPCHEDTRTSMKRSHMVWIRTPATAILDLDSPAPDRASNDWGPSRH LDCNFTGDSGARTSQLSSSCNHNTQKL >gi568815585f:79381486_79651117|GENSCAN_predicted_CDS_2|444_bp atggaaggatggtattttatcgagggtaaagtggatattggatacattgcatttgtgtca cagagaacaggccctgattttcctccccatgaatgtgagctgtacctaacgactaatgaa tgtggtggaagtaaccgtgttagacttctaaggctaattcataaaagacagtgtgggctc tgtcttaccctcttttggactgcttactttggggaaagccagtggccatgccatgaggac actcgaacatctatgaagaggtcccacatggtgtggataaggactcctgcaacggccatt ttggacctagattctcctgccccagacagggcttcaaatgactggggtcccagcagacat cttgactgcaacttcacaggagactctggagccagaacctctcagctaagcagctcctgc aatcataacacacagaaactgtga >gi568815585f:79381486_79651117|GENSCAN_predicted_peptide_3|287_aa MATGIATDDSGGPSGGATAMMLAAAGEAQPGLHAPQSGQELATGRSPSELAEICILNSSW RWDKNSGPANGGIKRAITQTELKHAPALAMLRAMRKEERRRKELWPFGDPRPRGSPSQGY DPTPSLGFCGSWRLQASRTCAALIGRICFTDPISLAQPKNLQGWRGAIFLSPAKLSELFL NLRNPAARKLEDKLKSNWIVWYWFYLYNVKAASTWKTFSPPLNPSPPYPLRHRNYSYACL PLLPTVSSLRTDKILSDLYPEGGNQATFLYSDLIVVSEYVANGEAVY >gi568815585f:79381486_79651117|GENSCAN_predicted_CDS_3|864_bp atggccactggaattgcaactgatgacagtggtggcccatctggaggggccactgccatg atgctggccgcagcaggggaggctcagccagggctgcatgctccacagagcggacaagag ctggcaacaggcagaagcccttctgagttggcagaaatctgcatacttaattcttcctgg aggtgggacaaaaactctggacccgctaatggcgggattaaaagagctataacacaaaca gagctgaaacatgcccctgcactcgccatgttacgggcaatgaggaaagaagagagaaga agaaaagagctgtggccctttggggatcccagacctaggggctccccgagccagggctat gaccccaccccctctttgggtttctgtggttcctggcgtctccaagcttccaggacatgt gctgctttgattggtcggatctgcttcacggaccccatctccctggcccagccaaagaac ctacaaggatggaggggagccatttttctctcccctgcaaagctgtctgaattatttctg aacttaaggaatcctgcagccagaaaattggaagacaaactgaaatctaactggatcgtt tggtattggttttacttgtacaacgtcaaagctgcttctacttggaaaacctttagtcca cctcttaaccccagtcctccttatcctctaagacacaggaattactcatatgcctgtctg cctttactaccaacagtgagctccttgagaacagacaagatattgagtgacctttatcct gagggcggaaatcaggcaactttcttatatagtgaccttattgttgtatcagaatatgtt gctaatggtgaagcagtgtactga >gi568815585f:79381486_79651117|GENSCAN_predicted_peptide_4|465_aa MARRRSQRVCASGPSMLNSARGAPELLRGTATNAEVSAAAAGATGSEELPPGDRGCRNGG GRGPAATTSSTGVAVGAEHGEDSLSRKPDPEPGRMDHHQPGTGRYQVALTSNNITLAIRV STYTFWRDTNIQFVPLPFTDKFNDLIQNITITLESSLIILTRSIHTPQNKAAKRKGNLPG AGYFYKFQLPATSLPVFFNIQNAQLLNEEDNSESSAIEQPPTSNPAPQIVQAASSAPALE TDSSPPPYSSITVEVPTTSDTEVYGEFYPVPPPYSVATSLPTYDEAEKAKAAAMAAAAAE TSQRIQEEECPPRDDFSDADQLRVGNDGIFMLAFFMAFIFNWLGFCLSFCITNTIAGRYG AICGFGLSLIKWILIVRLATQYFPVASREWLSENVSSHDFNNYLKGHSELSTLTTNKTLP REAERKYLGSQLTREVKDLYKVNYKPLLKEIRDDTNKWKNIPCHG >gi568815585f:79381486_79651117|GENSCAN_predicted_CDS_4|1398_bp atggcacgccggcggagccagcgagtctgcgcgagcggtccgagcatgctcaatagcgcg cgcggcgccccggagcttctccgcggaaccgcgaccaacgcggaggtctcggcggccgct gcgggagccacaggaagtgaagagcttccgccgggagaccgcggctgcaggaacggaggc ggaaggggccctgcggcgacgacgtcgtcgacgggggtggccgtgggagctgagcacgga gaagactccctctctcggaagccggatcccgagccgggcaggatggatcaccaccagccg gggactgggcgctaccaggtggctctaacttcaaataacatcactctggcgattagggtt tcaacatatacattttggagagacacaaacattcagtttgtaccactgccctttacagat aaatttaatgacctgatacagaatattaccatcaccctagaaagttcccttattatcctt actagatcaattcataccccccagaataaagctgcaaaaagaaaaggaaacttacctggt gcaggttacttttacaagtttcagcttcctgctacatccctgcctgtcttttttaatatt cagaatgcccagcttcttaatgaagaggataactcagaatcatcggctatagagcagcca cctacttcaaacccagcaccgcagattgtgcaggctgcgtcttcagcaccagcacttgaa actgactcttcccctccaccatatagtagtattactgtggaagtacctacaacttcagat acagaagtttacggtgagttttatcccgtgccacctccctatagcgttgctacctctctt cctacatacgatgaagctgagaaggctaaagctgctgcaatggcagctgcagcagcagaa acatctcaaagaattcaggaggaagagtgtccaccaagagatgacttcagtgatgcagac cagctcagagtggggaatgatggcattttcatgctggcatttttcatggcatttattttc aactggcttggattttgtttatccttctgtatcaccaataccatagctggaaggtatggt gctatctgcggatttggcctttccttgatcaaatggatccttattgtcaggttggcaacc cagtacttcccagtcgcttctagagaatggcttagtgaaaatgtcagcagccatgatttc aacaactatctcaaaggtcattccgagctgtccacattgaccaccaacaaaactttgcca agggaagcagaaagaaaatacctaggaagtcagttaacaagggaagtgaaggacctctac aaggtgaactacaaaccactgctcaaagaaatcagagatgacacaaacaaatggaaaaac attccatgccatggatag >gi568815585f:79381486_79651117|GENSCAN_predicted_peptide_5|122_aa MNTGRHQKVERRQYIGWEPYDMRNNTGEVPPLTTRTVIVIYSVVNRGLYDTGWNMMSVLS TVARRMKILIMALGQLKGSDSQAEFHGRFDSSFIAASCVRSLNRLNYRTRGGPGIAQPFV DI >gi568815585f:79381486_79651117|GENSCAN_predicted_CDS_5|369_bp atgaacactggaagacatcaaaaggtggagagaaggcagtacattggctgggagccttat gatatgagaaacaacacaggtgaagttccgcccttgaccacacgcacagttattgttatt tattcagtggtaaatcgtggcctgtacgacactggatggaatatgatgtcagtgctttca acagtagccagaagaatgaagattctaatcatggctcttggtcagctcaagggctctgat agtcaagctgaatttcacgggaggtttgattcatctttcattgctgcctcctgtgttagg tctcttaacagattgaattatagaactagaggaggcccaggaatagcccaaccatttgtg gacatttag >gi568815585f:79381486_79651117|GENSCAN_predicted_peptide_6|58_aa MGTTTKILYEKINPKTHNHQILQGRNQEKMLRAVREKGQVTYKGKPIRLTVDLSAETL >gi568815585f:79381486_79651117|GENSCAN_predicted_CDS_6|177_bp atggggaccaccactaagatactctatgagaagatcaaccccaagacacataatcatcag attctccaaggtcgaaatcaggaaaaaatgttaagggcagtcagagagaaaggccaggtc acctacaaagggaagcccatcagactaacagtggacctttcagcagaaaccctataa >gi568815585f:79381486_79651117|GENSCAN_predicted_peptide_7|535_aa MKPQTLAVSVTVLKGGVSEFAPSDVQMCLEFLPSGVKLQTFAVSVTALKVAQLELFVPPG GLVISLASGVKLQTFTAPPDSGAQLASPSRSRTGAAGGAACQSRAMRLHSSAVGLSMGLG AVEQGMALVEEARAAQEPTERVGGSGMAGCSSRALPRGKAAKAQREIERSAVIQLVKGGR DCYGLNVVSPPKFMLKPNHQGDGIGRGSSIRVLCWTPANFSRDGTRFKRLKKRPRASKQD IESYWDGPVVAGPIHHTGTVQWWQDRSTIQERSSGGRLDRRPTTTCKKHAVDTAPSLRTL PSNLHVVHLPYTRHHLESGDLALTKQLNLLAPRSWNYPSSELSRRCKRWAPTTLGSSAPV ALQGTAPSWLLAQAGVECLAFQVHGVFFEVLVFGSFSSLARQGKTSQSVQVVIKFNWRFS SPCGLSPVPLEAILKDACEAQHHMEAAKAWGLYPLKSWPEWYLVPFSMDGAAGTQGTNSL GCTQCEGYLFLQASRSHPTNAFGLSPWILANVCIFLIGFGFCQSVSHGTSGFIPT >gi568815585f:79381486_79651117|GENSCAN_predicted_CDS_7|1608_bp atgaagccgcagaccctggcagtgagtgttacagttcttaaaggcggcgtgtcggagttt gctccttctgatgttcagatgtgtttggagtttcttccttctggagtgaagctgcagacc ttcgcggtgagtgttacagctcttaaggtggcgcaactggagttgttcgttcctcccggt gggctcgtgatctcgctggcttcaggagtgaagctgcagaccttcacggccccaccagac tcaggagcccagctggcttcacccagtagatcccgcactggggctgcaggtggagctgcc tgccagtcccgtgccatgcgcctgcactcctcagccgttgggttgtcgatgggactgggc gcagtggagcaggggatggcgctcgtcgaggaggctcgggccgcacaggagcccacggag agggtgggaggctcaggcatggcgggctgcagttcccgagccctgccccgtgggaaagca gctaaggcccagcgagaaattgagcgcagcgccgtcatacaactggtaaagggtggcaga gactgctatggtttgaatgtggtgtcccccccaaaattcatgttgaaacctaaccatcaa ggtgatggtattggaagaggaagcagcatcagggtgttgtgttggacccctgcaaacttc agtagggatggcaccaggttcaagaggctgaagaagagacccagagccagcaaacaagac atagagtcttactgggacggtccagtggtggcaggaccgatccaccatacaggaacggtc cagtggtggcaggaccgatccaccatacaggaacggtccagtggtggcaggctagacagg agacccacaaccacttgcaaaaagcatgcagttgatacagcaccttcactcagaaccctc cccagcaacctccatgtggtgcatctaccatacacaaggcaccatcttgagagtggagac ttggccctcaccaaacaattgaacctgctggcacctagatcttggaactacccgtcttca gaactgtcacgccgatgtaagaggtgggctcccacaaccttgggcagctctgcccctgtg gctttgcagggtacagccccctcgtggctgcttgcacaggctggtgttgagtgtctggct ttccaggtgcatggtgtgttctttgaggttcttgtatttggaagttttagttctctagca aggcaggggaagacttctcaatcagttcaagttgttataaagttcaattggaggttttct tctccctgtggcctttccccagtgcctctggaagccatccttaaggacgcctgtgaggct caacaccacatggaagctgccaaggcttggggcttgtaccctctgaagtcatggcctgag tggtaccttgtcccttttagcatggatggagcagctggaacacagggcaccaactcccta ggctgtacacagtgtgagggctatctttttctccaagcttcccggagccaccctaccaat gcgtttggattgtctccctggatccttgccaacgtgtgtattttcctcatcggttttggt ttctgtcagtcagtgtcacatggcacttccggcttcatccctacttga