GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:16:11 Sequence gi568815595f:119707252_119917209 : 209958 bp : 38.76% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 177 310 134 2 2 57 113 59 0.851 4.96 1.02 Intr + 1340 1423 84 2 0 87 70 93 0.972 5.42 1.03 Intr + 2588 2644 57 2 0 104 108 26 0.890 3.28 1.04 Intr + 8311 8492 182 1 2 107 84 168 0.572 16.89 1.05 Intr + 18920 19097 178 0 1 118 56 121 0.877 9.96 1.06 Intr + 22969 23126 158 1 2 22 34 217 0.228 8.43 1.07 Intr + 25043 25225 183 0 0 18 121 122 0.257 7.34 1.08 Intr + 26113 26255 143 2 2 95 70 73 0.975 5.35 1.09 Intr + 32004 32075 72 2 0 95 66 72 0.949 4.38 1.10 Intr + 33298 33444 147 0 0 36 0 233 0.900 8.81 1.11 Intr + 36724 36963 240 0 0 61 57 329 0.967 23.92 1.12 Intr + 39864 40012 149 2 2 84 82 127 0.951 9.81 1.13 Intr + 40560 40651 92 0 2 50 121 42 0.953 2.42 1.14 Term + 43686 43846 161 1 2 36 45 178 0.991 5.42 1.15 PlyA + 45641 45646 6 1.05 2.00 Prom + 49808 49847 40 -5.85 2.01 Init + 55969 56065 97 0 1 61 106 40 0.176 3.62 2.02 Intr + 60604 60714 111 2 0 65 69 71 0.028 2.33 2.03 Intr + 61561 61728 168 2 0 47 17 152 0.005 3.00 2.04 Intr + 73302 73376 75 1 0 71 44 108 0.047 3.27 2.05 Intr + 73453 73622 170 2 2 94 47 79 0.031 3.14 2.06 Intr + 84755 85015 261 1 0 27 99 313 0.242 23.06 2.07 Intr + 85096 85298 203 0 2 73 53 173 0.042 9.46 2.08 Intr + 99978 100196 219 0 0 90 80 234 0.989 19.30 2.09 Intr + 102391 102591 201 1 0 47 13 191 0.687 4.98 2.10 Intr + 102810 102943 134 0 2 101 44 146 0.999 10.87 2.11 Intr + 104288 104475 188 0 2 74 57 239 0.999 17.89 2.12 Intr + 105435 105709 275 2 2 78 91 212 0.811 15.91 2.13 Intr + 107728 107870 143 1 2 73 49 182 0.158 11.88 2.14 Intr + 108072 108188 117 1 0 105 94 151 0.987 16.92 2.15 Intr + 108475 108580 106 2 1 111 71 107 0.996 9.65 2.16 Term + 108743 108896 154 2 1 45 48 146 0.115 2.71 2.17 PlyA + 108926 108931 6 -3.94 3.09 PlyA - 109009 109004 6 1.05 3.08 Term - 109982 109731 252 2 0 6 33 301 0.032 11.05 3.07 Intr - 110506 110363 144 1 0 84 42 130 0.055 7.56 3.06 Intr - 117739 117646 94 0 1 118 21 76 0.078 2.95 3.05 Intr - 119604 119471 134 0 2 100 7 169 0.399 8.52 3.04 Intr - 122994 122885 110 1 2 26 63 21 0.007 -7.52 3.03 Intr - 136102 136004 99 2 0 83 110 148 0.717 15.66 3.02 Intr - 146419 146254 166 1 1 -41 86 161 0.071 1.61 3.01 Init - 147915 147247 669 0 0 69 41 320 0.074 20.64 3.00 Prom - 148381 148342 40 -8.55 4.08 PlyA - 148782 148777 6 1.05 4.07 Term - 149964 149891 74 1 2 69 47 108 0.745 1.89 4.06 Intr - 156354 156168 187 0 1 116 88 204 0.989 21.54 4.05 Intr - 166425 166321 105 0 0 57 98 30 0.486 0.39 4.04 Intr - 169257 169162 96 0 0 59 113 43 0.530 3.19 4.03 Intr - 198601 198504 98 2 2 115 100 50 0.616 7.71 4.02 Intr - 205559 205453 107 1 2 72 97 77 0.969 5.94 4.01 Intr - 208923 208793 131 0 2 43 121 119 0.654 9.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 85096 85302 207 0 0 73 55 183 0.918 9.86 S.002 Intr + 109817 109953 137 2 2 58 70 201 0.849 14.59 S.003 Term + 110405 110625 221 0 2 44 38 134 0.835 0.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:119707252_119917209|GENSCAN_predicted_peptide_1|659_aa MFSNLIHYPRYSLYWSKSDPVPPFISREWKGHKEKHREALRQLTTTDASFQMPKEVYEDP EVTGKNRYKYFERPFLPFFQQMPFNVVYAVSKAEPYTFPPTSTKHLSIPSKSTVGTQTDY RDADVQTDPYSAEYVVCQDSIPELLTLATLTWGRGLPAGQAEVEMIERAREKRAWEASLP ALSDTSQFEKRRKMMNEMERKEWAFREQEIEKLQEIRLEVLKELLRKREENQNEVNMKHL NARWSKLQEGKEAKMAKIQRTHVSTIRKLVGKRKNIEGKLERRNIIKDYSDYASQVYGPL SRLGCFPDNNSEDFVVKNYYLNTYEGLVELESCLPDFVTQPQIRAPKPKVITTKAGFLKR AARLDYELAEVHKEEEEMEMAVIYLQKLLRGRVVQNMMFEGKEKRLELIQELRTCHALQE DEKLVKKAEKQVTLALQRQRNLHEHKVSLVENHLAGLEGRALADMFDFLSKELVRLQEER RIHAFVMLAERQRRVREAEESGRRQVEKQRLREEDEIFKEASRGSWVVKVHHSTISSYLE DIILNTEANTAEEQARAEIEKMAEKINDIAYEMESRRTYLQSEEIVAELVYSFLIPEVQK YFVKEKVRNAQRKHILAAHQIIHSYTESMVQKKLTEGEQDEASNAAMLLEKETQNENNS >gi568815595f:119707252_119917209|GENSCAN_predicted_CDS_1|1980_bp atgttcagtaacctgatccattatccaagatattctctatattggagcaagtcagatcct gtcccaccatttatcagtcgggaatggaagggacataaggagaaacacagagaagccctc cggcagctcaccacaactgacgcttcttttcagatgcctaaagaagtttatgaagatcct gaagttactggaaagaatcgctataaatactttgaaaggccttttctcccattttttcag cagatgccattcaatgttgtttatgccgtatccaaggcagaaccatacacttttcctcct acttctactaagcacctatccatcccttcaaagtctactgtgggcactcagactgattat cgggatgctgacgttcaaacagatccatactctgcagaatatgtagtatgtcaggactca atccctgagctcttgaccctggctacgcttacttggggtcggggtctcccagcaggacaa gctgaggtggagatgatagaaagagcccgcgagaagcgtgcttgggaagcctctctcccc gctctgagtgacacctcccagtttgagaagaggaggaaaatgatgaatgaaatggagagg aaggagtgggccttcagagagcaggagattgaaaaactgcaggagattcgcctggaagtt ctaaaagagctgttgaggaagcgtgaagagaatcagaatgaagtgaatatgaagcacttg aatgcccggtggtctaaactgcaggagggaaaagaggcaaaaatggcaaaaattcagcgc acgcatgtatcaacaatcagaaaacttgtaggaaagagaaagaatatagaagggaagttg gagagaagaaatatcatcaaggattattctgattatgcatcacaggtctatggacctctg tctcgtcttgggtgtttcccagacaacaactcagaggactttgtagtaaaaaactactat ctcaacacctatgaaggattagtggaacttgagtcatgtctcccagattttgtgacacaa ccccaaatcagagctccaaaacctaaagtcattaccaccaaagctggttttctgaagagg gcagcaaggttggactatgagttggcagaggttcataaggaagaagaagaaatggaaatg gctgtgatctaccttcaaaagttactccggggcagagtcgttcagaacatgatgtttgaa gggaaagaaaagcgactggagttgatccaggagttgcgcacctgccacgcactacaagaa gatgaaaagctggtgaaaaaagccgagaagcaagtgaccctggccttacagcggcagagg aacttgcatgagcacaaggtgtcactggttgaaaaccatttggccggactggaaggaagg gcactagcagacatgtttgacttcctgtccaaagagctggtgagactgcaggaggagagg aggatccatgcctttgtcatgctggctgagcgccagcggcgggtacgagaggctgaagag agtggtcggcgccaggtggaaaaacagcgcctgcgggaggaggacgagatatttaaggag gcaagtaggggcagctgggtggttaaagttcaccatagtactataagctcctacctagaa gacataatactgaataccgaagcgaatactgcagaagaacaagccagggcagaaatagag aagatggctgagaaaatcaatgacattgcttatgaaatggaaagccgccgaacctatctt cagtcagaggagattgttgctgagttggtttatagttttctgatcccagaggtgcaaaaa tactttgtcaaagaaaaagtgaggaacgcacagcggaaacatattcttgcagcccatcag atcatccacagttacacggaaagcatggttcaaaagaaattaactgagggagagcaagat gaggcctcaaatgctgccatgttacttgagaaagaaactcaaaatgagaacaacagctaa >gi568815595f:119707252_119917209|GENSCAN_predicted_peptide_2|873_aa MPNITNRSRNANKSHNVISPHTVRMAIIKKTKSRSSGRVFSSQQKGGNGVGGSSLQAGHQ DEYSALSREAPTSSMEGAALAAPPCCSQCDGSGRPSDVATAISSMEVKRAGSGPRLSGLK SQSKFRDLQAPDIAPCCPPTPEHTVQATSTGPPCLRPKSTFCPVGFVIQYLLALAESHVD NQKCDWKKDRALWLWVPVPTAGSKSPAKPIIFDHRSQPHNVPVITGSKDLQNVNIIPCIL FGPVTSQLPRIFTRIGEDYDERVLPSITTEILKSVVARFDAGELITQRELVSRQEGVHRS SGSQTGGSAGCREGQNSLATAGDGLMELCKLEAAEDITYQLSRSWNITNLPAGQSVLLQL PHPRGPEANLEVRPKESWNHADFVHCEDTESVPGKPSVNADEEVGGPQICRVCGDKATGY HFNVMTCEGCKGFFRRGNRGRGPSPWKLSGNKAVKEKGTGVSGRREGAEDGPGEEVSLKV VLGLPAPAKSTCPISCRNGQKRRAMKRNARLRCPFRKGACEITRKTRRQCQACRLRKCLE SGMKKEMIMSDEAVEERRALIKRKKSERTGTQPLGVQGLTEEQRMMIRELMDAQMKTFDT TFSHFKNFRLPGVLSSGCELPESLQAPSREEAAKWSQVRKDLCSLKVSLQLRGEDGSVWN YKPPADSGGKEIFSLLPHMADMSTYMFKGIISFAKVISYFRDLPIEDQISLLKGAAFELC QLRFNTVFNAETGTWECGRLSYCLEDTAGGFQQLLLEPMLKFHYMLKKLQLHEEEYVLMQ AISLFSPDRPGVLQHRVVDQLQEQFAITLKSYIECNRPQPAHRKNTSKQGDVAQKGASIP SLVLSSSCDLLPRLMPTDPETNLDFTEPKAMKG >gi568815595f:119707252_119917209|GENSCAN_predicted_CDS_2|2622_bp atgcccaacattactaatcgttcgagaaatgcaaacaaaagccacaatgtgatatcacct catactgttagaatggctattatcaaaaagacaaaaagcaggtcatctggtcgagtgttc agctctcagcaaaaaggaggcaatggagtgggtggctcctctctgcaggcaggtcatcag gatgagtattcagctctcagcagagaagctcccaccagctccatggagggtgcagccttg gctgcacctccctgctgcagccagtgtgatggcagtggcagaccgtctgatgtggccact gccatcagtagcatggaagttaagagagcaggctctggacccagactgtccggattgaaa tcacagtccaagttcagggacctgcaggccccagatatagccccatgctgtcctcctacc ccagagcacactgttcaggctacttccactggtcctccctgcctaaggcccaagtcaact ttctgtccagtgggatttgtaatccaatacctcctagccctagcagaatcccatgtggat aatcagaaatgtgactggaaaaaggacagagctctatggctgtgggtcccagtccccact gctggcagtaagtccccagcaaaaccaattatctttgaccaccgttctcaaccacataat gtgccagtcatcactggtagcaaagatttacagaatgtcaatatcattccgtgcatcctc tttgggcctgtcactagccagcttcctcgcatcttcaccaggatcggagaagactatgat gagcgtgtgctgccatccatcactactgagatcctcaagtcagtggtggctcgctttgat gctggagaactaatcacccagagagaactggtctccaggcaggaaggagttcacagaagc agtggaagccaaacaggtggctcagcaggatgcagagagggccagaactcactggccact gcaggggacggcctgatggagctgtgcaagctggaagctgcagaggacatcacgtaccag ctctctcgctcttggaacatcaccaatctgccggcagggcagtccgtgctcctccagctg ccccatccaagaggcccagaagcaaacctggaggtgagacccaaagaaagctggaaccat gctgactttgtacactgtgaggacacagagtctgttcctggaaagcccagtgtcaacgca gatgaggaagtcggaggtccccaaatctgccgtgtatgtggggacaaggccactggctat cacttcaatgtcatgacatgtgaaggatgcaagggctttttcaggaggggaaatcgagga agaggcccatctccatggaaactttctggtaacaaggctgtgaaagagaaggggacaggt gtttcaggaagaagggaaggcgctgaggatgggcctggagaggaagtgtccctgaaagtg gttttggggctgcccgcgccagcgaaaagcacgtgtcccatttcctgccggaatggccag aaaaggagggccatgaaacgcaacgcccggctgaggtgccccttccggaagggcgcctgc gagatcacccggaagacccggcgacagtgccaggcctgccgcctgcgcaagtgcctggag agcggcatgaagaaggagatgatcatgtccgacgaggccgtggaggagaggcgggccttg atcaagcggaagaaaagtgaacggacagggactcagccactgggagtgcaggggctgaca gaggagcagcggatgatgatcagggagctgatggacgctcagatgaaaacctttgacact accttctcccatttcaagaatttccggctgccaggggtgcttagcagtggctgcgagttg ccagagtctctgcaggccccatcgagggaagaagctgccaagtggagccaggtccggaaa gatctgtgctctttgaaggtctctctgcagctgcggggggaggatggcagtgtctggaac tacaaacccccagccgacagtggcgggaaagagatcttctccctgctgccccacatggct gacatgtcaacctacatgttcaaaggcatcatcagctttgccaaagtcatctcctacttc agggacttgcccatcgaggaccagatctccctgctgaagggggccgctttcgagctgtgt caactgagattcaacacagtgttcaacgcggagactggaacctgggagtgtggccggctg tcctactgcttggaagacactgcaggtggcttccagcaacttctactggagcccatgctg aaattccactacatgctgaagaagctgcagctgcatgaggaggagtatgtgctgatgcag gccatctccctcttctccccagaccgcccaggtgtgctgcagcaccgcgtggtggaccag ctgcaggagcaattcgccattactctgaagtcctacattgaatgcaatcggccccagcct gctcataggaaaaacacaagcaaacagggggacgtggcccagaaaggggcttctatacct tcactggtcctcagtagctcctgtgacctactgcccagactgatgcccacagacccagag acaaacttggattttacggagcccaaggccatgaagggttaa >gi568815595f:119707252_119917209|GENSCAN_predicted_peptide_3|555_aa MKAEIKMFFETNENKDTTYQNLWDTFKAAYRGKFIALNAHKRKKERSKIDTLTSQLKELE KQEQTRSKDSRRQEITKVRAELKEIETQKTLQKVSESRSWFFEKINKIDRPPARLIKKKR EKNRIDAIKNDKGGITTDPTEIQTAIREYYEHFYANKLENLEEMDKFLETYTLPRLNQEE VESLNRPITGSEVEAIINSLPTKKSPGPDRFTAKIYQRYTEELNKIPGNPTYKGREGPRQ GELQTTAQRNKRGHKQMKEHSMLMDRRNQCRENGPTAQELSSNPPLATILIPPHARIQAA ASTPTNATAASVSSQPPRGQLLLLYLAQGPVHLYDGIVEIMKCNFTQSMLILETVDRPIM LLLHQLPTPPEQSRAASCTGKTTSYLSVTQQHWSLYGSRPPEGSEVESVGSPPAGPSTLS CSLTRTPTDTHAPADEYRPGAHKAARKRRGTSGKWETGLLEVAMKRLRKRTNVSPKGSRS ATCDAEQLLHEGRSKGVYVLDAQQPLGVLSIDAAELGEHSHDLQEQEPACQPEKRLWVQP ALTHKPQGCSEAAQA >gi568815595f:119707252_119917209|GENSCAN_predicted_CDS_3|1668_bp atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggacacatttaaagcagcatatagaggaaaatttatagcactaaatgcccac aagagaaagaaggaaagatctaaaattgacaccctaacatcacaattaaaagaactagag aagcaagagcaaacacgttcaaaagatagcagaaggcaagaaataactaaggtcagagca gaactgaaggagatagagacgcaaaaaacccttcaaaaagtcagtgaatccaggagctgg ttttttgaaaagatcaacaaaattgatagaccgccagcaaggctaataaagaagaaaaga gagaagaatcgaatagacgcaataaaaaatgataaagggggtatcaccaccgatcccaca gaaatacaaactgccatcagagaatactatgaacacttctacgcaaataaactagaaaat ctagaagaaatggataaattcctggaaacatacaccctcccaagactaaaccaggaagaa gttgaatccctgaatagaccaataacaggctctgaagttgaggcaataattaatagcctc ccaaccaaaaaaagtccaggaccagacagattcacagccaaaatctaccagaggtacaca gaggagctgaataaaatacctgggaatccaacttacaagggacgtgaaggacctcgtcaa ggagaactacaaaccactgctcaacgaaataaaagaggacacaaacaaatgaaagaacat tccatgctcatggataggaggaatcaatgtcgtgaaaatggccctactgcccaagaactg tcaagtaatccacctctggctaccatccttattcctcctcatgctcggattcaagcagct gcttcaacccccacaaatgccacagcagcgtcagtttcttcccagccaccaagaggtcag ctcctcctactttacctggcccaggggccagtccatctttatgatgggattgtagagatt atgaaatgtaactttactcagtcaatgctaatactggagaccgtggacagaccaataatg ctgcttctgcatcagcttccaactccacctgaacagtcccgagcagccagctgcacagga aaaaccaccagttacttgagtgtcactcagcaacactggtcactttatgggtctcgtcct ccagagggttcggaagtggagtctgttggcagccctcctgcaggccctagcaccctgtcc tgctccttaactaggactcccacagatactcatgcgcctgccgatgagtacaggcctgga gcccacaaagcagctcggaagaggaggggaacgagtgggaagtgggagacaggactatta gaggtagcaatgaaaagactcaggaagcgaacaaacgtgtcacccaagggcagccgctca gctacctgtgatgccgaacaactcctgcatgaggggcgtagcaaaggggtgtatgtcctg gatgcgcagcagccgctgggtgtgctgagcattgatgctgcggagctcggtgagcatagc catgatcttcaggaacaagaacctgcatgccagccagagaaaagattgtgggtgcagccc gccctgacccacaagcctcagggctgctctgaggctgcacaagcataa >gi568815595f:119707252_119917209|GENSCAN_predicted_peptide_4|265_aa LYMYQLFRSLAYIHSFGICHRDIKPQNLLLDPDTAVLKLCDFGSAKQLVRGEPNVSYICS RYYRAPELIFGATDYTSSIDVWSAGCVLAELLLGQPIFPGDSGVDQLVEIIKVLGTPTRE QIREMNPNYTEFKFPQIKAHPWTKVIIRIKMQNLKFLEGTQTPFPTTVIVDSNLRNALRV FRPRTPPEAIALCSRLLEYTPTARLTPLEACAHSFFDELRDPNVKLPNGRDTPALFNFTT QEIAIATLTFSNQYPGQSAAIDIKV >gi568815595f:119707252_119917209|GENSCAN_predicted_CDS_4|798_bp ttgtatatgtatcagctgttccgaagtttagcctatatccattcctttggaatctgccat cgggatattaaaccgcagaacctcttgttggatcctgatactgctgtattaaaactctgt gactttggaagtgcaaagcagctggtccgaggagaacccaatgtttcgtatatctgttct cggtactatagggcaccagagttgatctttggagccactgattatacctctagtatagat gtatggtctgctggctgtgtgttggctgagctgttactaggacaaccaatatttccaggg gatagtggtgtggatcagttggtagaaataatcaaggtcctgggaactccaacaagggag caaatcagagaaatgaacccaaactacacagaatttaaattccctcaaattaaggcacat ccttggactaaggttattataagaattaaaatgcaaaacttaaaattcctagagggaact cagactcctttccctaccactgtgatagtggacagcaacttgagaaatgctcttagggtc ttccgaccccgaactccaccggaggcaattgcactgtgtagccgtctgctggagtataca ccaactgcccgactaacaccactggaagcttgtgcacattcattttttgatgaattacgg gacccaaatgtcaaactaccaaatgggcgagacacacctgcactcttcaacttcaccact caagaaattgccatagccaccctaaccttcagcaaccaatatcctggtcaatcagcagcc atcgacataaaggtataa