GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:44:35 Sequence gi568815588f:68888833_69116413 : 227581 bp : 43.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12508 12639 132 0 0 112 91 76 0.094 11.14 1.02 Intr + 17879 18175 297 1 0 103 64 212 0.994 17.37 1.03 Intr + 21475 21550 76 0 1 62 91 40 0.693 0.79 1.04 Intr + 22269 22414 146 1 2 41 84 45 0.326 -0.60 1.05 Intr + 24330 24447 118 2 1 55 87 55 0.878 2.14 1.06 Intr + 24559 24744 186 2 0 67 36 222 0.503 14.56 1.07 Intr + 25227 25372 146 1 2 47 100 10 0.449 -1.90 1.08 Intr + 31000 31149 150 0 0 65 111 48 0.490 5.16 1.09 Intr + 45367 45528 162 0 0 101 91 99 0.993 11.67 1.10 Intr + 47174 47247 74 1 2 110 85 43 0.966 4.40 1.11 Intr + 57520 57822 303 1 0 78 39 187 0.091 8.51 1.12 Intr + 67162 67480 319 1 1 90 52 126 0.728 5.16 1.13 Intr + 70974 71417 444 2 0 50 88 441 0.863 33.90 1.14 Intr + 76545 76662 118 2 1 85 82 58 0.887 4.94 1.15 Intr + 78186 78371 186 1 0 56 84 253 0.982 21.36 1.16 Intr + 80144 80289 146 0 2 91 84 40 0.999 3.90 1.17 Intr + 81369 81518 150 2 0 13 77 122 0.774 3.96 1.18 Intr + 83059 83220 162 0 0 61 95 36 0.766 1.77 1.19 Intr + 85838 85911 74 1 2 81 91 33 0.993 1.10 1.20 Intr + 88697 88856 160 2 1 23 79 159 0.770 8.49 1.21 Intr + 90010 90144 135 0 0 69 95 70 0.930 6.56 1.22 Intr + 93757 93921 165 0 0 76 26 230 0.263 15.76 1.23 Intr + 99721 99919 199 0 1 78 97 98 0.270 8.62 1.24 Intr + 100023 100426 404 1 2 34 94 692 0.268 58.55 1.25 Intr + 111592 111690 99 0 0 114 87 -4 0.315 2.41 1.26 Intr + 116214 116293 80 2 2 70 113 64 0.998 5.45 1.27 Intr + 116900 117083 184 2 1 97 121 102 0.998 14.19 1.28 Intr + 126709 127527 819 0 0 137 49 581 0.072 50.72 1.29 Intr + 134138 134394 257 1 2 -1 41 458 0.010 28.44 1.30 Term + 134597 134915 319 2 1 -20 48 414 0.956 20.95 1.31 PlyA + 136058 136063 6 1.05 2.05 PlyA - 137582 137577 6 1.05 2.04 Term - 173159 172886 274 1 1 32 40 217 0.805 6.24 2.03 Intr - 173343 173310 34 1 1 86 88 46 0.499 1.68 2.02 Intr - 180853 180700 154 1 1 58 55 87 0.325 2.05 2.01 Init - 182872 182831 42 1 0 57 64 52 0.358 0.12 2.00 Prom - 185548 185509 40 -3.16 3.00 Prom + 194791 194830 40 -7.46 3.01 Init + 199329 199404 76 2 1 80 116 130 0.801 14.25 3.02 Intr + 208252 208399 148 2 1 53 94 60 0.302 2.39 3.03 Intr + 215039 215212 174 2 0 72 -12 162 0.024 3.65 3.04 Intr + 221895 222039 145 0 1 -3 74 114 0.013 1.18 3.05 Intr + 223609 223716 108 0 0 75 32 90 0.414 2.58 3.06 Intr + 225832 225961 130 0 1 74 39 84 0.590 2.37 3.07 Intr + 226000 226137 138 2 0 -18 18 220 0.413 4.84 3.08 Term + 226522 226727 206 2 2 31 39 169 0.484 3.83 3.09 PlyA + 226850 226855 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 126709 127584 876 0 0 137 36 563 0.928 48.58 S.002 Init + 134144 134394 251 1 2 72 41 453 0.873 35.94 S.003 Init + 167268 167309 42 2 0 77 98 47 0.855 5.02 S.004 Term + 215039 215288 250 2 1 72 42 170 0.959 5.98 S.005 Init + 221974 222039 66 0 0 63 74 67 0.810 3.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:68888833_69116413|GENSCAN_predicted_peptide_1|2069_aa VVVATVPGGRRRWPVMPGKLLWGDIMELEAPLEESESQKKERQKSDRRKSRHHYDSDEKS ETRENGVTDDLDAPKAKKSKMKEKLNGDTEEGFNRLSDEFSKSHKSRRKDLPNGDIDEYE KKSKRVSSLDTSTHKSSDNKLEETLTREQKEGAFSNFPISEETIKLLKVKTFGPVYEGKD LIAQARTGTGKTFSFAIPLIERLQRNQETIKKSRSPKVLVLAPTRELANQVAKDFKDITR KLSVACFYGGTSYQSQINHIRNGIDILVGTPGRIKDHLQSGRLDLSKLRHVVLDEVDQML DLGFAEQVEDIIHESYKTDSEDNPQTLLFSATCPQWVYKVAKKYMKSRYEQVDLVGKMTQ KAATTVEHLAIQCHWSQRPAVIGDVLQVYSGSEGRAIIFCETKKNVTEMAMNPHIKQNAQ CLHGDIAQSQREITLKGFREGSFKVLVATNVAARGLDIPEVDLVIQSSPPQGITFKRVGV PSTMDLVKSKSMDAIRQSGMIPTGYSQCQPNYLKLKNIMMETHLLIPDRGVAGQVVDQAG QAGQVVDLAAGQVDRVDKEVAQEVDKMVEDEVGIEIDQEVGATNGVLTEYLIVNLPVCLL DQGPQVVNSLNPTTVTGRCFEFTTRKAAFPVLPASPGSALLREVTRAGWTQGGELPLPLH AVEKTGRPGQPALKMPGKLRSDAGLESDTAMKKGETLRKQTEEKEKKEKPKSDKTEEIAE EEETVFPKAKQVKKKAEPSEVDMNSPKSKKAKKKEEPSQNDISPKTKSLRKKKEPIEKKV VSSKTKKVTKNEEPSEEEIDAPKPKKMKKEKEMNGETREKSPKLKNGFPHPEPDCNPSEA ASEESNSEIEQVLVLAPTRELANQVSKDFSDITKKLSVACFYGGTPYGGQFERMRNGIDI LVGTPGRIKDHIQNGKLDLTKLKHVVLDEVDQMLDMGFADQVEEILSVAYKKDSEDNPQT LLFSATCPHWVFNVAKKYMKSTYEQVDLIGKKTQKTAITVEHLAIKCHWTQRAAVIGDVI RVYSGHQGRTIIFCETKKEAQELSQNSAIKQDAQSLHGDIPQKQREITLKGFRNGSFGVL VATNVAARGLDIPEVDLVIQSSPPKGIKFKRIGVPSATEIIKASSKDAIRLLDSVPPTAI SHFKQSAEKLIEEKGAVEALAAALAHISGATSVDQRSLINSNVGFVTMILQCSIEMPNIS YAWKELKEQLGEEIDSKVKGMVFLKGKLSNQNWKDHGKDMEASGDSGKAVEASGDSGTET EDSEDSGKAVEAREDSDQEVATKGQSGDKLTAAGGGEGGGNTLIDNFISMRMVAVVIAAS SHWSLPSVLRFSGRVILVPIPQRPQRVESQVCEKFQAALALSRVELHKNPEKEPYKSKYS ARALLEEVKALLGPAPEDEDERPEAEDGPGAGDHALGLPAEVVEPEGPVAQRAVRLAVIE FHLGVNHIDTEELSAGEEHLVKCLRLLRRYRLSHDCISLCIQAQNNLGILWSEREEIETA QAYLESSEALYNQYMKEVGSPPLDPTERFLPEEEKLTEQERSKRFEKVYTHNLYYLAQVY QHLEMFEKAAHYCHSTLKRQLEHNAYHPIEWAINAATLSQFYINKDNIGELDLDKQSELR ALRKKELDEEESIRKKAVQFGTGELCDAISAVEEKVSYLRPLDFEEARELFLLGQHYVFE AKEFFQIDGYVTDHIEVVQDHSALFKVLAFFETDMERRCKMHKRRIAMLEPLTVDLNPQY YLLVNRQIQFEIAHAYYDMMDLKVAIADRLRDPDSHIVKKINNLNKSALKYYQLFLDSLR DPNKVFPEHIGEDVLRPAMLAKFRVARLYGKIITADPKKELENLATSLEHYKFIVDYCEK HPEAAQEIEVELELSKEMHGMVIYWDDMEKIWHHTFYNELHVAFEEYPVLLTKGPLNPKA NHEKMAQIMLETFNTPPMHVAIQVVLYLYSSGHTTGIVMDSGDRRWPHGSSWEKSYELSD GQVITSSNKRFHCPEALFQPSFLGMESCGIYKTTFKSIVKCDVDIHKDLYANTVLSGSTT MYPGIALQDAEGDHCPGSQHEEDQDHCSF >gi568815588f:68888833_69116413|GENSCAN_predicted_CDS_1|6210_bp gtggttgtggccactgtgcccggagggaggcggcggtggccagtaatgcctgggaaactc ctctggggggacattatggagctggaagcacccttggaggagtccgagagccagaagaag gagaggcaaaagagtgacagaaggaagtcaaggcaccattatgactcggatgagaaatca gaaacaagagaaaatggtgttacagatgacctggatgctcccaaggccaaaaaatctaaa atgaaagagaagctaaatggagacactgaagaaggatttaatagactttcagatgaattc tccaaatctcataagtcaagaagaaaagatctaccaaatggagatatagatgaatatgaa aaaaaatcaaagcgagtatcatctttagatacttctactcataaatcaagtgataataaa ctagaggagaccttaacacgtgaacagaaagaaggagccttctccaattttcctatttct gaagagactataaagcttctgaaagttaagacctttggtcctgtatatgaaggaaaagat ttaatagctcaagcacggacaggaacaggaaagacattctcttttgccatccccttaatt gaaagactccaaagaaatcaagaaacaattaaaaaaagccgctcaccaaaggtacttgtt ttggctccaacaagggaactggcaaaccaagtagccaaagacttcaaagatataactagg aaactcagcgtggcgtgtttttatggtggaacatcatatcaaagccaaattaatcatatt cgaaatggtattgacatcttggttggaacacctggtcgtatcaaagaccatctgcagagt ggccgattggatctttctaaactgcgacatgttgtgcttgatgaagtggatcagatgtta gatttaggtttcgctgaacaagttgaagatattattcatgaatcctacaaaactgattct gaagacaatcctcagactttacttttttctgcaacttgcccacagtgggtatacaaagtt gcaaaaaaatacatgaaatccagatatgaacaggttgaccttgttggaaaaatgactcaa aaggctgcaactactgtggaacatttggccatccagtgtcattggtctcagaggccagca gttattggagatgtccttcaagtctacagtgggtctgaagggagggctattattttctgt gagaccaagaagaatgtaactgaaatggccatgaatccacacataaaacagaatgcccag tgtttacatggggacattgcacagtcacaaagagaaattacactaaaaggcttcagagaa ggtagttttaaagttttggtggcaaccaatgtggctgcccgtggtttggacattcctgaa gttgacctggtgattcaaagttctcctcctcagggaattacttttaaacgtgtaggtgtt ccttctacaatggatttagttaaatctaaaagcatggatgccatcaggcagagtggcatg attccgactggatactctcagtgccagccaaattacctgaaattgaagaatattatgatg gaaacacatcttctaattccagacagaggagtggctggtcaagtggtcgatcaggccggt caggccggtcaggtggtcgatctggcggccggtcaggtagacagagtcgacaaggaagtc gctcaggaagtcgacaagatggtagaagacgaagtgggaatagaaatcgatcaagaagtg ggggccacaaacggagttttgactgagtatttgatagttaatctaccagtgtgccttcta gaccaagggccgcaagtggtcaattcactaaatccaaccacagtcacaggccgctgcttt gaatttaccacgcggaaagcagcttttcctgtgcttcctgcttcccccggaagtgccttg ctacgggaagtgacgagagccgggtggacccagggtggggaactacctcttcctctccac gcggttgagaagaccggtcggcctgggcaacctgcgctgaagatgccgggaaaactccgt agtgacgctggtttggaatcagacaccgcaatgaaaaaaggggagacactgcgaaagcaa accgaggagaaagagaaaaaagagaagccaaaatctgataagactgaagagatagcagaa gaggaagaaactgttttccccaaagctaaacaagttaaaaagaaagcagagccttctgaa gttgacatgaattctcctaaatccaaaaaggcaaaaaagaaagaggagccatctcaaaat gacatttctcctaaaaccaaaagtttgagaaagaaaaaggagcccattgaaaagaaagtg gtttcttctaaaaccaaaaaagtgacaaaaaatgaggagccttctgaggaagaaatagat gctcctaagcccaagaagatgaagaaagaaaaggaaatgaatggagaaactagagagaaa agccccaaactgaagaatggatttcctcatcctgaaccggactgtaaccccagtgaagct gccagtgaagaaagtaacagtgagatagagcaggtactggttcttgcacctacaagagag ttggcaaatcaagtaagcaaagacttcagtgacatcacaaaaaagctgtcagtggcttgt ttttatggtggaactccctatggaggtcaatttgaacgcatgaggaatgggattgatatc ctggttggaacaccaggtcgtatcaaagaccacatacagaatggcaaactagatctcacc aaacttaagcatgttgtcctggatgaagtggaccagatgttggatatgggatttgctgat caagtggaagagattttaagtgtggcatacaagaaagattctgaagacaatccccaaaca ttgcttttttctgcaacttgccctcattgggtatttaatgttgccaagaaatacatgaaa tctacatatgaacaggtggacctgattggtaaaaagactcagaaaacggcaataactgtg gagcatctggctattaagtgccactggactcagagggcagcagttattggggatgtcatc cgagtatatagtggtcatcaaggacgcactatcatcttttgtgaaaccaagaaagaagcc caggagctgtcccagaattcagctataaagcaggatgctcagtccttgcatggagacatt ccacagaagcaaagggaaatcaccctgaaaggttttagaaatggtagttttggagttttg gtggcaaccaatgttgctgcacgtgggttagacatccctgaggttgatttggttatacaa agctctccaccaaagggaattaagttcaaacgaataggtgttccttctgcaacagaaata ataaaagcttccagcaaagatgccatcaggcttttggattccgtgcctcccactgccatt agtcacttcaaacaatcagctgagaagctgatagaggagaagggagctgtggaagctctg gcagcagcactggcccatatttcaggtgccacgtccgtagaccagcgctccttgatcaac tcaaatgtgggttttgtgaccatgatcttgcagtgctcaattgaaatgccaaatattagt tatgcttggaaagaacttaaagagcagctgggcgaggagattgattccaaagtgaaggga atggtttttctcaaaggaaagctgagcaaccagaactggaaggaccacgggaaggatatg gaggcttcaggggacagcgggaaggcagtcgaggcttcaggggacagcgggacggaaaca gaagattcagaggacagcgggaaggcagtagaggcccgagaggacagcgatcaggaggtg gcaacaaaaggtcaatctggagataaactcaccgctgcagggggcggggaaggaggtggg aacacgctcattgacaacttcatttccatgagaatggttgctgtggtgattgctgcgtct tcccattggtcattgccgagcgtattgcggttctccgggagggttatattggttcccatt ccacagcggccgcaacgtgtcgagagccaggtctgcgagaaattccaggcggcgctcgct ctgtcgcgggtggaactgcataaaaatccggagaaggaaccatacaagtccaaatacagc gcccgggcgctactggaagaggtcaaggcgctgctcggccctgcgcctgaggacgaggat gagcggcctgaggccgaggacggcccgggtgccggtgaccacgccctggggctgccggct gaggtggtggagcccgaggggcccgtcgcccagcgagcggtgaggctggcagtcatcgag ttccacctcggggtgaaccacatcgacacggaggagctgtcggcgggggaggagcacctg gtgaaatgcctgcggctgctgcgcaggtaccggctctcgcacgactgcatctctctctgc atccaggcgcagaataacctgggtatcttgtggtctgaaagagaagaaattgaaactgca caggcttacctagagtcatcagaagcactatataatcagtatatgaaagaggttgggagt cctcctcttgatcctactgagcgttttcttcctgaagaagagaaacttactgaacaagag agatcaaaaagatttgaaaaggtttatactcataacctatattacctagctcaagtctac cagcatctggaaatgtttgagaaggctgctcactattgccatagtacactaaaacgccag cttgagcacaatgcctaccatcctatagagtgggctatcaatgctgctaccttgtcacag ttttacatcaataaggacaacataggagagcttgatcttgataaacagtctgaacttaga gctttaaggaaaaaagaactagatgaggaggaaagcattcggaaaaaagctgtgcagttt ggaaccggtgaactgtgtgatgccatctctgcagtagaagagaaagtgagctacttgaga cctttagattttgaagaagccagagaacttttcttattgggtcagcactatgtctttgag gcaaaagagttctttcagattgatggttatgtcactgaccatattgaagttgtccaagac cacagtgctctgtttaaggtgcttgcattctttgaaactgacatggagagacggtgcaag atgcataaacgcagaatagccatgctagagcccctaactgtagacctgaatccacagtat tatctgttggtcaacagacagatccagtttgaaattgcacatgcttactatgatatgatg gatttgaaggttgccattgctgacaggctaagggatcctgattcacacattgtaaaaaaa ataaataatcttaataagtcagcactgaagtactaccagctcttcttagactccctgaga gacccaaataaagtattccctgagcatataggggaagatgttcttcgccctgccatgtta gctaagtttcgagttgcccgtctctatggcaaaatcattactgcagatcccaagaaagag ctggaaaatttggcaacatcattggaacattacaaatttattgttgattactgtgaaaag catcctgaggccgcccaggaaatagaagttgagctagaacttagtaaagagatgcacggc atggtcatctactgggatgacatggagaagatctggcaccacaccttctacaatgagctg catgtggctttcgaggagtaccctgtgctgctgaccaaaggccccctgaatcccaaggcc aaccatgagaagatggcccagattatgttggagaccttcaacaccccacccatgcacgtg gccatccaggtcgtgctgtacctgtactcctctggccataccactggcatcgtgatggac tctggcgacaggagatggccccacgggtcctcctgggagaagagctatgagctgtcagat ggccaggtcatcaccagcagcaacaagcggttccactgccccgaggcgctcttccagcct tctttcctgggcatggaatcctgtggcatctacaaaactaccttcaaatccatcgtgaag tgtgacgtggacatccacaaagacctgtacgccaacacagtgctatctggcagcaccacc atgtaccctggcatcgctctgcaggatgcagagggggatcactgccctggctcccagcat gaggaggatcaagatcattgctccttctga >gi568815588f:68888833_69116413|GENSCAN_predicted_peptide_2|167_aa MRENVLISTFPSPQAGETGIKATHCLGPMSPGCPNPFATLGHLRPVLVRGCQHVTSSRPG LSDLQVLKVFGKKNKWRRSSRPSDLVGISSDLQKLLVWWAPDTWCFCCLGAAEDPEPCLS STLTEAAKYRKYTRSKGKHALQRRPLITIVYVGSNGTEFAARVFFLS >gi568815588f:68888833_69116413|GENSCAN_predicted_CDS_2|504_bp atgcgagagaacgtgctgatcagtacttttccatctccacaggcaggggagacaggaatc aaggccactcactgcctgggcccaatgtcccctggatgtcccaacccttttgctaccctg ggccatctacgaccagttctggtgcgaggatgccagcatgtaacctccagtaggccaggg ctgtcggatctccaggtgctaaaagtttttggcaagaaaaacaagtggaggaggagcagc cgcccgtcagacctggtggggatcagcagtgacctgcagaagttgctggtgtggtgggca ccggatacctggtgtttctgctgcctcggtgctgctgaggacccagagccctgcctgagc agcacgctcacagaggcagcgaagtacaggaagtacacaaggtcaaaaggaaaacacgct ctccagagacgtcctctgatcactatcgtgtacgttgggtcgaatggaacagaatttgct gctagagtgtttttcctctcctaa >gi568815588f:68888833_69116413|GENSCAN_predicted_peptide_3|374_aa MQKLLKCSRLVLALALILVLESSVQGYPTRRARYQWVRCNPDSNSANCLEEKGPMFELLP GESNKIPRLRTDLFPKTRIQDLNRIFPLSEDYSGSGFGSGSGSGSGSGSGFLTEMEQDYQ LVDESDAFHDNLRKIGRLGTKQLRVKACGWTNERRQSVKMYVSNVNAHQKTSITEEALNN QERLADGCKFLDAESQFIQGHAPFSWTAHSQRLIYVGTTEIMLRQKQIQAIFLFEFKTGL KAAETTRNINNPFGAGIANKHESLEDEKRSGRPWEFDNDQLRAIIEADPLTTTQEVAEKL NIDHSTAGIGPQKGPVLLHKNAQVQVVEPMLQKLNELGYEVLPHPPYLPDFLPTDYHSFK HLGNFLQGKRFHNQ >gi568815588f:68888833_69116413|GENSCAN_predicted_CDS_3|1125_bp atgcagaagctactcaaatgcagtcggcttgtcctggctcttgccctcatcctggttctg gaatcctcagttcaaggttatcctacgcggagagccaggtaccaatgggtgcgctgcaat ccagacagtaattctgcaaactgccttgaagaaaaaggaccaatgttcgaactacttcca ggtgaatccaacaagatcccccgtctgaggactgacctttttccaaagacgagaatccag gacttgaatcgtatcttcccactttctgaggactactctggatcaggcttcggctccggc tccggctctggatcaggatctgggagtggcttcctaacggaaatggaacaggattaccaa ctagtagacgaaagtgatgctttccatgacaaccttagaaagattggaagattgggcacg aagcagctcagggtaaaggcatgtggatggaccaatgagagaagacaaagtgtgaagatg tacgtatcaaatgtgaatgcccaccagaaaacatccatcacagaagaggcactaaacaac caggaaaggttggctgacggctgcaagttcttagatgcagagagccagttcatccaaggt catgctcccttctcatggacagctcacagtcaaagactcatctatgtggggactacggaa ataatgttaagacaaaagcaaattcaagctattttcttattcgagttcaaaacaggtctt aaagcagcagagacaactcgcaacatcaacaacccatttggcgcaggaattgctaacaaa catgagagccttgaagatgagaagcgtagtggccggccatgggaatttgacaatgaccaa ttgagagcaatcattgaagctgatcctcttacaactacacaagaagttgctgaaaaactc aacatcgaccattctacagccggcataggtccacagaaaggcccagttctgctccataag aacgcccaagtgcaggtcgtagaaccaatgcttcaaaagttgaatgaattgggctacgaa gttttgcctcatccgccatatttacctgacttcttgccaactgactaccactccttcaag catctcggcaactttttgcagggaaaacgcttccacaaccagtag