GENSCAN 1.0 Date run: 3-Nov-116 Time: 01:04:50 Sequence gi568815591r:22022409_22293661 : 271253 bp : 40.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 2439 2434 6 1.05 1.03 Term - 7394 7207 188 0 2 90 34 209 0.907 12.37 1.02 Intr - 9261 9058 204 1 0 -24 47 189 0.152 1.85 1.01 Init - 10073 9896 178 2 1 100 7 139 0.141 5.99 1.00 Prom - 18516 18477 40 -6.15 2.00 Prom + 26318 26357 40 -3.25 2.01 Init + 27809 27977 169 1 1 35 49 175 0.741 8.04 2.02 Intr + 41186 41229 44 0 2 104 93 53 0.156 4.44 2.03 Intr + 41848 42077 230 0 2 59 92 77 0.590 0.94 2.04 Intr + 58542 58707 166 0 1 52 -9 191 0.333 4.84 2.05 Term + 60317 60817 501 1 0 15 49 213 0.166 3.59 2.06 PlyA + 61712 61717 6 1.05 3.04 PlyA - 62638 62633 6 -1.75 3.03 Term - 63828 63297 532 2 1 52 53 191 0.781 4.73 3.02 Intr - 64175 63938 238 0 1 -3 92 183 0.539 5.35 3.01 Init - 69236 69083 154 2 1 60 117 108 0.768 11.10 3.00 Prom - 70325 70286 40 -6.25 4.05 PlyA - 72921 72916 6 1.05 4.04 Term - 74928 74702 227 1 2 47 41 140 0.532 1.16 4.03 Intr - 77034 76889 146 2 2 83 46 154 0.806 9.71 4.02 Intr - 77817 77768 50 0 2 84 76 46 0.143 -0.44 4.01 Init - 79713 79492 222 0 0 81 97 152 0.227 14.00 4.00 Prom - 82417 82378 40 -7.25 5.03 PlyA - 82937 82932 6 1.05 5.02 Term - 83858 83409 450 2 0 -12 39 347 0.993 14.00 5.01 Init - 84364 84203 162 1 0 56 40 154 0.443 7.18 5.00 Prom - 84611 84572 40 -5.35 6.20 PlyA - 85044 85039 6 1.05 6.19 Term - 86773 86503 271 0 1 28 54 191 0.126 3.67 6.18 Intr - 88821 88532 290 0 2 43 26 202 0.315 4.62 6.17 Intr - 98312 98083 230 0 2 42 42 199 0.223 7.37 6.16 Intr - 99965 99773 193 2 1 100 -22 119 0.256 0.24 6.15 Intr - 100113 100004 110 1 2 76 45 114 0.876 4.98 6.14 Intr - 103250 103196 55 2 1 96 97 47 0.820 4.03 6.13 Intr - 114575 114525 51 2 0 108 98 28 0.690 3.99 6.12 Intr - 117707 117617 91 1 1 90 83 74 0.788 6.18 6.11 Intr - 122814 122636 179 0 2 55 89 123 0.903 6.90 6.10 Intr - 124611 124489 123 0 0 57 84 42 0.539 0.56 6.09 Intr - 128096 127999 98 0 2 41 87 156 0.998 9.61 6.08 Intr - 132172 132047 126 2 0 54 98 209 0.699 18.23 6.07 Intr - 134480 134402 79 2 1 89 87 31 0.973 1.31 6.06 Intr - 135477 135447 31 2 1 111 121 1 0.986 2.91 6.05 Intr - 138207 138110 98 0 2 115 89 69 0.966 7.59 6.04 Intr - 140120 139989 132 2 0 32 84 103 0.804 4.22 6.03 Intr - 159848 159809 40 1 1 104 50 52 0.066 0.31 6.02 Intr - 167255 167126 130 0 1 68 116 56 0.628 5.23 6.01 Init - 171253 170959 295 1 1 75 110 525 0.927 50.79 6.00 Prom - 172340 172301 40 -6.35 7.09 PlyA - 173940 173935 6 1.05 7.08 Term - 176760 176650 111 0 0 94 47 47 0.206 -1.22 7.07 Intr - 178962 178845 118 2 1 68 40 96 0.277 2.35 7.06 Intr - 180929 180879 51 1 0 89 105 29 0.221 1.80 7.05 Intr - 181252 181152 101 1 2 76 60 58 0.212 -0.21 7.04 Intr - 197583 197458 126 0 0 43 87 162 0.076 11.56 7.03 Intr - 208511 208438 74 0 2 53 80 146 0.923 8.51 7.02 Intr - 244604 244556 49 2 1 82 119 39 0.424 3.73 7.01 Intr - 268833 268767 67 2 1 117 95 90 0.895 10.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:22022409_22293661|GENSCAN_predicted_peptide_1|189_aa MPEPPRCSVGSCAARASPTSAAPCSKAPSPINHPRAEECGHMARDWQAAPPAAPVRDPLA SLLSQRDHEPHQKEETPNTSEHQKEKTPDTPPLRTVTLTVRVHGFILEVSETKNPPIPDT KLRGGGCKGSVSLALGVKLQTFAVSVTAHKGSVDPKSEQQQDLLQRAKDQSFRSVEGDPS GLPLLAQAA >gi568815591r:22022409_22293661|GENSCAN_predicted_CDS_1|570_bp atgcctgagcctccccgctgctccgtgggctcctgtgcggcccgagcctcccctacgagc gctgccccctgctccaaggcgcccagtcctatcaaccacccaagggctgaggagtgcggg cacatggcgcgggactggcaggcagctccacctgcggccccagtgcgggatccactggct tcactcctgagccagcgagaccacgaaccccaccagaaggaagaaactccgaacacatcc gaacatcagaaggaaaaaactccggacacgccgcctttaagaactgtaacactcaccgtg agggtccacggcttcattcttgaagtcagtgagaccaagaacccaccaattccagacaca aaactgagaggcggaggttgcaagggttcggtctcgctggctttaggagtgaagctgcag accttcgccgtgagtgttacagctcataaaggcagtgtggacccaaagagtgagcagcag caagatttattgcaaagagcgaaagatcaaagcttccgcagtgtggaaggggacccgagc gggttgccactgctggctcaggcagcctga >gi568815591r:22022409_22293661|GENSCAN_predicted_peptide_2|369_aa MKETLEPDSSIGNEYGENENIMHQIGRTEDGVTISGHKHRVCLSVFSGQLVTAEYPPCKM CLASSSSSAMTVVLHSSSQLSYEIVDLKHYIELGAASDLKKALIVHNFICTFAGDGERGS IFIEFLNLRIGKYFKVISSFNVHNNPMRYLPNANTEDLAIFTAKSLPSWSLYGSFGKGSK TVENTSQVGRSRGKRRGKKAHLLFLIRNSRVETQDLHPWDHTAFREPSTCQLQVGRGSSG TWDPTGPGRVKSRFLPASPKGPLPPPQQEQSLSGNCGCLRLGLLFSTPVPQHRVRRRPAS RGLADAHSPDHSDTCRGPAEPRCTATQRGGGSGLGPFSGPHPPPVTTVRCQRGGCGNGQR GLAGGLVVF >gi568815591r:22022409_22293661|GENSCAN_predicted_CDS_2|1110_bp atgaaggagacgctggaacctgacagcagtattggaaatgaatatggagaaaatgaaaat attatgcaccaaattggcagaaccgaagacggagtgactattagtggacacaaacataga gtgtgcctgagtgttttctcaggccagctggtgactgcagaatatccgccatgtaagatg tgccttgcttcctcttcgtcttctgccatgactgtagtactacattcatcctcacaactg tcctatgaaatagttgatcttaagcactacattgagttaggtgctgcaagtgatctcaaa aaggctctcatcgtccataacttcatatgtacatttgctggggatggggagagagggagt atttttatagagttcttaaatctcaggattggaaagtactttaaagttattagctcattt aatgttcacaataatcccatgagatatttgccaaatgccaacactgaagatttagcaatt ttcacggcaaagtccctgccctcatggtctttatatggaagttttggcaaaggctccaag acagtagaaaatacatcccaagtgggaaggagtcgtggaaaaaggagaggaaagaaagcc cacttgcttttcctaatcagaaactccagggtggagacgcaggatctgcatccctgggat cacactgcctttcgagaaccatccacctgccagctccaagtaggtcgagggagcagcggc acctgggatcccaccggaccgggaagagtcaaatccaggtttctccctgcttcaccaaaa ggccccctgcctccaccgcaacaggaacagtctctgtctgggaactgcggttgcctaagg ctcggtttgctcttcagcactccggtcccccaacaccgcgtcaggcggcgcccagcctcc cggggactcgcagacgctcactccccagaccacagcgacacctgccgaggccccgcggaa ccgcgctgcacagcaacccagcgcggcggggggagtggcctggggccattctccggcccc caccccccgcccgtgaccaccgtgcgctgccagaggggaggctgcgggaatggccagcga ggtctcgcagggggactggttgtcttttag >gi568815591r:22022409_22293661|GENSCAN_predicted_peptide_3|307_aa MKPRTLAVSVTVIKDGVSGVCSFRCSDVSGVYSFWWIRGLADFRSEAADLPDLWNFKLER GDLGYLVEEISKQQSIQEEAEHRSLESLQPKHGIEEKNPFSGEKFKPAAEICISNEEPNF NLQNNGERSPRDFVPYIPAASAVAKRGQSTAQAVASEGASPKPWHLPCGVEPAGTQTSRI EVWEPLPRFQRMYGNARMFRQKFAAVVEHSWRTSARAVRKGNVGSEPPHRVLTKALSSGA VRRQPPSSRLQNGRSTNSLHHVPGKATDTQHQSVKTARKRGQYPAKPHRWSCPRPWEPTS CISVTWM >gi568815591r:22022409_22293661|GENSCAN_predicted_CDS_3|924_bp atgaagccgcggaccctcgcagtgagtgttacagttattaaagatggtgtgtccggagtt tgttccttcagatgttcagatgtgtctggagtttattccttctggtggattcgtggtctc gctgacttcaggagtgaagctgcagaccttcccgatctgtggaacttcaaacttgagaga ggtgatttagggtatctggtggaagaaatttctaagcagcaaagcattcaagaggaagca gagcatagaagtttggaaagtttgcagcctaaacatggaatagaagagaaaaacccattt tctggggagaaattcaagcctgctgctgaaatttgcataagtaatgaggagccaaatttt aatctccaaaacaatggggaaaggtctccacgggactttgtgccttacatcccagccgct tcagctgtggctaaaaggggccagagtacagctcaggctgtggcttcagagggtgcaagc cccaagccttggcaccttccatgtggtgttgagcctgcaggtacacagacgtcaagaatt gaggtttgggaacctctgcctagatttcagaggatgtatggaaatgccaggatgttcagg cagaagtttgctgcagtggtggagcactcatggagaacctctgctagggcagtgaggaag ggaaatgtggggtcagagcctccacacagagtccttactaaggcactgtctagtggagct gtgagaagacagccaccatcatccagactccagaatggtagatccaccaacagcttgcac catgtacctggaaaagccacagatactcagcaccagtctgtgaaaacagccaggaagagg gggcaataccctgcaaaaccacacaggtggagctgcccaaggccgtgggaacccacctct tgcatcagtgtgacctggatgtga >gi568815591r:22022409_22293661|GENSCAN_predicted_peptide_4|214_aa MYNQTKSKMVKGSHPSEKVLILYPVARPEQVFKVRSHQLTERSSFQDEDPVTTSQAHVTT IPQPSSKRFAPFTQLAEEENVQAVFTNGPGWASDSTIIRRLCSKGGGHIIRRSMCHSANH IIPQLPADKAMEWLKFQLRAHLVGRRDRLQFRGSPRPPSTSVDSLEGFTELRRVAILAVT ICYNGGTQTKISRGNDTALMSGGTPGFFLSSQIG >gi568815591r:22022409_22293661|GENSCAN_predicted_CDS_4|645_bp atgtacaatcaaacaaaatcgaagatggttaagggcagtcatcccagtgaaaaggtcctc atcctttacccagttgctagacccgaacaagttttcaaagtcagatcccaccaactgaca gagaggtcgagtttccaggatgaagaccctgtgacaacaagtcaagcgcacgtgacaaca attccacagccttcctcaaagagatttgctccatttactcagctggcagaagaggaaaac gtccaagctgtttttacaaatgggccaggttgggcctcagacagcactattatcagacga ctttgcagcaaaggaggtgggcacattatccgaagatccatgtgtcatagcgcaaaccac attattcctcagctgcccgctgacaaggcaatggaatggctgaagttccagctcagagct catctggtgggaagaagagacaggctgcagttcagggggtccccaagaccaccgtcaact tccgttgattctctcgaaggattcacagaactcagaagagttgctatacttgcagttaca atttgttacaatggaggaacacagactaaaatcagcagaggaaatgatacggctctgatg agtggaggaacaccagggttcttcctctcaagtcaaattggataa >gi568815591r:22022409_22293661|GENSCAN_predicted_peptide_5|203_aa MMLDIKQIQVIFLFEFKMGRKIAETTRNIDNAFGPGTANECTVQWQFKETRSVVVLVTVW WSTAGLIHCSFLNPSGTIISEKYAQQTDEMHRKLQCLQPAFVNRKGPILLHDNAQPKFQK LNKLGYKVLPHLPHSPDFSPTDYHFFKHLDNFLQGKCFPNKQDAENAFQEFVESRSMDFY TAGINKLINHWQKCVDCNGSCFD >gi568815591r:22022409_22293661|GENSCAN_predicted_CDS_5|612_bp atgatgttagacataaagcaaattcaagtgattttcttattcgagttcaaaatgggtcgt aaaatagcagagacaactcgtaacatcgacaatgcatttggcccaggaactgctaacgaa tgtacagtgcagtggcagttcaaggagacaaggagcgtagtggtcctggtcactgtttgg tggtctacggccggtctgatccactgcagctttctgaatcccagcggaaccattatatct gaaaagtatgctcagcaaactgatgagatgcaccgaaaactgcaatgcctgcagccggca ttcgtcaacagaaagggcccaattcttcttcatgacaatgcacaaccaaagtttcaaaag ttgaacaaattgggctacaaagttttgcctcatctgccacattcaccggacttctcgcca actgactaccacttcttcaagcatctcgataactttttgcagggaaaatgcttccccaac aagcaggatgcagaaaatgcattccaagagttcgtcgaatcccgaagcatggatttttac actgcaggaataaacaaacttattaatcattggcaaaaatgtgttgattgcaatggttcc tgttttgattaa >gi568815591r:22022409_22293661|GENSCAN_predicted_peptide_6|873_aa MGSSRLRVFDPHLERKDSAAALSDRELPLPTFDVPYFKYIDEEDEDDEWSSRSQSSTEDD SVDSLLSDRYVVVSGTPEKILEHLLNDLHLEEVQDKETGSGFDLKYFEMLEENQIVSGAL SFPRNWANITGHEVYMRELRGRDLVSSPQTVVGFKKYQGKEENSDVPRRKRKVLHLVSQW IALYKDWLPEDEHSKMFLKTIYRNVLDDVYEYPILEKELKEFQKILGMHRRHTVDEYSPQ KKNKALFHQFSLKENWLQHRGTVTETEEKHSYVSVKAKVSSIAQEILKVVAEKIQYAEED LALVAITFSGEKHELQPNDLVISKSLEASGRIYVYRKDLADTLNPFAENEESQQRSMRIL GMNTWDLALELMNFDWSLFNSIHEQELIYFTFSRQGSGEHTANLSLLLQRCNEVQLWVAT EILLCSQLGKRVQLVKKFIKIAAHCKAQRNLNSFFAIVMGLNTASVSRLSQTWEKIPGKF KKLFSELESLTHMIADTVRTLRHCRTNQFGDLSPKEHQELKSYVNHLYVIDSQQALFELS HRIEPRHFELRECLCQARCFPVRKEVAEFYQYNPRHSQESQPKRVQEVMSATREGERFLH ATLGTRRQKERFYRRASSRAGPFREMLHAAVMECGPQDTRAVRTGEQTSLTAGLSKPSKA LLTQLLTIAWAYSTSTCSHLFPMPWTKGTFYIIHIWDFTSQSHSMSWGVVLDEYLLNNDA SNLQVQAPSIVSTAMWSNFDAQNLFKSNQIDTTHLGNKPSQRATTSWGSRHSSPSNHTST LTSRTGSESTTPEVCASCTWSRRPGTTVSPFFLEAPEQRGTLAELCSEATLSLIRAAESP IVCSPVNQSSLFTAILTILSGLITLAAHRTHED >gi568815591r:22022409_22293661|GENSCAN_predicted_CDS_6|2622_bp atgggcagctcccggctgagggtctttgaccctcatttggagaggaaagattccgccgcg gcgctctcagaccgagagctgcccttgcctaccttcgatgtgccttatttcaaatacatc gacgaggaggatgaggacgatgaatggagcagccgctcgcagtcttccaccgaggatgac tcagtggactctctgctctctgacagatatgtggtggtgtccgggaccccggagaagatt ttggagcaccttttgaatgacttgcacctggaagaagtccaggacaaagaaacaggttca ggttttgacttgaaatattttgaaatgttagaggaaaatcaaattgtttctggagcttta tcttttcccagaaattgggcaaacatcactgggcatgaggtttacatgagagaattacga ggaagagatcttgtgtcatctccccagacggttgttggatttaaaaagtatcaaggcaaa gaggaaaactcagacgttccgcgtaggaaacgtaaagtcttgcatcttgtttcccagtgg attgctctgtacaaagactggttacctgaagatgaacattcaaaaatgtttttaaagacc atatataggaatgtactggatgatgtttatgaatatccaatacttgaaaaagaattgaaa gaatttcaaaagatacttggaatgcaccgtcgtcacactgtagatgaatattcaccacaa aaaaagaataaagcccttttccaccaattcagtcttaaggagaactggctccagcataga ggaactgtgactgaaacggaggaaaagcactcctatgtcagtgtgaaggcaaaagtttcc agtatagcccaagagatcctaaaagtcgtggcagaaaagatccagtatgcagaagaggat ctggctctggtggccatcacattctctggggaaaagcatgaacttcagccaaatgactta gtcatctccaaatccctcgaggcatctggtcgaatatatgtctaccggaaagacctggcg gacactttgaacccatttgcagaaaatgaggaatcacagcaaaggtcgatgaggattttg ggaatgaacacttgggatcttgctctggaattaatgaattttgattggagtctattcaat tcaattcacgagcaagagctgatctacttcacgttcagcagacagggaagtggggaacac actgcaaatctcagccttctgctccagagatgcaatgaggtccagctttgggtggccacg gagattctgctctgcagccagctgggcaagcgagtgcagctggtgaaaaaattcatcaaa attgcggctcactgcaaagcccagagaaacctgaattctttctttgccattgtgatgggt ctcaacactgcttctgtcagtcgactgtcgcagacctgggagaaaatccctgggaagttt aagaaacttttctctgaacttgaaagtttaacacatatgatcgcagacactgtccgaacc ctgagacactgcaggactaaccagtttggtgacctgtctccaaaagagcatcaagagtta aagtcctatgttaatcacctgtatgtcattgacagccagcaggctctgtttgagctctca cacaggatcgagcctcggcactttgagctacgggaatgtctatgccaagcacgttgcttt cctgtgagaaaagaagttgctgagttttatcagtataacccaagacattcacaggaaagc cagccaaagcgtgttcaggaagtgatgtcagccaccagagagggggagaggtttctccat gctactctcgggacaagaaggcagaaggagaggttttatagaagggcatcctccagggct ggtccattcagagaaatgctgcatgctgccgtcatggaatgtggcccacaggacaccaga gccgtgagaaccggagagcagacttccctcacggctgggctgagcaaaccctccaaagcc ctcctcacgcagttactaacaatagcatgggcttacagcacaagcacgtgttctcacctt tttcctatgccctggactaaggggactttctacatcatccacatctgggatttcaccagt caaagccattctatgagctggggtgttgtgctggatgaatacttgctcaataatgatgct tcaaacctccaggtccaagctccaagcattgtcagcacagccatgtggagcaacttcgat gcacagaacctcttcaagtccaaccaaatagacaccacccacttgggcaacaagccatct cagagggcaaccacctcatggggctctcgacactcttcccctagcaaccatacttcaact ttgacttcacgaacgggttccgagtcaaccacgccagaggtctgcgccagctgcacttgg tctaggaggcctgggaccaccgtcagcccattcttccttgaggctccggaacagcgtggg acattagctgagctgtgctcagaagcaactctctcgctgattagggctgccgagagtccc atcgtctgctcacctgtcaatcaatcaagcctcttcactgccatcctgaccatcctgtca gggctgatcaccttggctgctcacagaacccatgaggactga >gi568815591r:22022409_22293661|GENSCAN_predicted_peptide_7|232_aa XSHQKIEDSEESSDEILVRLTSAVQRELAAVIALKARKSAIEQDEENNDKHVAVTEAESV PDSQAGVMCKLQERDEIGRIELVQKLAKENYQFLQTDKKEQEKSEHGEWMYLRVSAEGNN HHIWNKEQQKAVAFQLAHFSFWSLEVYTSWRVRFAGSESSPHTVECITLDLLSSCSEWDA ASGKHRQEIRGWKERKNESPCMAFPNQTGHPELKTGYARTFDSHFGLDLPVA >gi568815591r:22022409_22293661|GENSCAN_predicted_CDS_7|699_bp nngtctcatcagaaaattgaagactccgaagaaagcagtgatgaaattcttgtgcgtcta acatctgcggtgcagagagagctagcagctgttattgctttgaaagcaaggaagtctgca attgaacaagatgaagaaaacaacgacaaacatgtagctgtaacagaagccgaaagtgtt ccagattctcaggcaggggtgatgtgcaagctccaggaaagagatgaaatcggacgaatt gaactagtccagaagctggcaaaagaaaactatcagtttttgcagacggacaaaaaagaa caggagaagtctgaacacggcgaatggatgtacctccgagtctctgcagaaggaaataac caccatatctggaacaaggagcagcagaaagcagtggcattccagctggctcacttcagt ttttggtctctagaggtgtacacttcttggcgagtgagatttgcggggagcgagtcctca ccacacactgtggaatgcatcaccctggatttattgtcctcctgctctgaatgggatgca gccagtgggaagcatcggcaagagatcagagggtggaaggagagaaagaatgaaagccct tgtatggctttcccaaaccagacaggccatccagaactaaaaaccggctatgctagaact tttgactctcactttggcctagacctccctgttgcctga