GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:23:40 Sequence gi568815589f:78197211_78429083 : 231873 bp : 41.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7122 7285 164 2 2 70 83 100 0.564 6.95 1.02 Intr + 9039 9364 326 0 2 -9 61 228 0.459 4.89 1.03 Term + 21898 22064 167 2 2 94 47 251 0.991 18.70 1.04 PlyA + 22396 22401 6 1.05 2.00 Prom + 28897 28936 40 -3.15 2.01 Init + 39141 39393 253 2 1 77 72 217 0.377 16.56 2.02 Intr + 42813 42985 173 1 2 102 121 40 0.950 7.44 2.03 Intr + 43082 43154 73 1 1 75 87 43 0.784 0.96 2.04 Intr + 44486 44589 104 0 2 77 88 34 0.692 1.27 2.05 Intr + 46258 46426 169 0 1 -2 107 138 0.408 5.30 2.06 Intr + 51081 51145 65 1 2 79 61 43 0.812 -1.78 2.07 Intr + 57626 57754 129 1 0 57 87 180 0.969 14.77 2.08 Intr + 65697 65774 78 2 0 63 84 54 0.697 1.33 2.09 Intr + 66940 67106 167 0 2 82 40 126 0.979 5.04 2.10 Intr + 68162 68333 172 2 1 95 99 141 0.996 14.92 2.11 Intr + 69232 69493 262 0 1 116 68 143 0.258 11.24 2.12 Term + 84280 84416 137 2 2 55 52 178 0.942 8.10 2.13 PlyA + 85183 85188 6 1.05 3.00 Prom + 85925 85964 40 -9.05 3.01 Init + 86218 86371 154 0 1 60 73 149 0.956 10.79 3.02 Intr + 86603 86776 174 0 0 39 73 98 0.396 2.39 3.03 Intr + 93320 93451 132 0 0 74 61 54 0.575 1.00 3.04 Term + 93718 93887 170 2 2 37 47 184 0.980 6.26 3.05 PlyA + 95186 95191 6 1.05 4.00 Prom + 96454 96493 40 -5.25 4.01 Init + 99740 100060 321 1 0 77 89 294 0.593 25.78 4.02 Intr + 100191 100393 203 2 2 13 26 286 0.661 12.06 4.03 Term + 100789 101023 235 1 1 87 52 133 0.914 4.41 4.04 PlyA + 101676 101681 6 1.05 5.00 Prom + 102824 102863 40 -9.55 5.01 Init + 103240 103320 81 0 0 80 72 1 0.674 -1.28 5.02 Intr + 103392 103452 61 2 1 143 91 17 0.688 4.89 5.03 Intr + 104744 104813 70 0 1 96 87 38 0.737 1.82 5.04 Intr + 107525 107730 206 2 2 71 108 106 0.968 8.82 5.05 Intr + 109104 109276 173 1 2 45 75 225 0.886 15.64 5.06 Intr + 111204 111373 170 2 2 62 93 196 0.141 15.32 5.07 Intr + 117611 117754 144 2 0 29 61 105 0.080 0.38 5.08 Intr + 120466 120594 129 1 0 34 86 150 0.222 8.29 5.09 Intr + 126857 127005 149 2 2 75 97 106 0.292 9.16 5.10 Intr + 130849 130978 130 2 1 -16 95 100 0.003 -0.87 5.11 Term + 131771 131876 106 2 1 97 47 132 0.990 6.90 5.12 PlyA + 135921 135926 6 1.05 6.00 Prom + 136263 136302 40 -8.65 6.01 Sngl + 139966 140622 657 0 0 49 41 321 0.471 19.42 6.02 PlyA + 140800 140805 6 1.05 7.00 Prom + 143096 143135 40 -3.65 7.01 Init + 149238 149448 211 2 1 27 31 189 0.042 6.09 7.02 Intr + 160520 160639 120 0 0 59 65 99 0.019 4.25 7.03 Intr + 161070 161222 153 1 0 15 84 168 0.108 8.22 7.04 Term + 168292 168476 185 2 2 13 43 138 0.084 -1.48 7.05 PlyA + 169121 169126 6 1.05 8.09 PlyA - 170227 170222 6 1.05 8.08 Term - 172672 172548 125 2 2 77 39 89 0.614 0.47 8.07 Intr - 174074 173911 164 1 2 55 110 119 0.710 9.50 8.06 Intr - 174973 174859 115 2 1 -1 54 53 0.315 -8.41 8.05 Intr - 175533 175345 189 1 0 30 94 206 0.811 14.04 8.04 Intr - 177783 177762 22 0 1 133 99 -2 0.236 1.60 8.03 Intr - 201726 201615 112 2 1 29 58 106 0.008 1.16 8.02 Intr - 213929 213800 130 0 1 39 20 166 0.240 3.73 8.01 Init - 217569 217371 199 0 1 58 76 104 0.570 5.33 8.00 Prom - 222671 222632 40 -3.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 111204 111326 123 2 0 62 56 209 0.816 14.96 S.002 Init + 161111 161222 112 1 1 90 84 161 0.805 16.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:78197211_78429083|GENSCAN_predicted_peptide_1|218_aa MDKFLDTYTLSGLNQEAVKYLNRSITNSEIEVVINRLPTKNSPGPDGFTAKFYLRWIKDL NVRPKTIKTLEENLGGTFQDTGMGKDFMTKTPKAMATDAKIDKWNLIKLKSFCKAKETII RVNSQPTEWEKIFAIYPSDKGLISRITRNLNKFTRKRQTTPSKMCQQDGPQAIYYSDKYF DEHYKYRHVMLPRELSKQAPETHLMSEEEWRRLMSNGV >gi568815589f:78197211_78429083|GENSCAN_predicted_CDS_1|657_bp atggataaattccttgacacatacaccctctcaggacttaaccaggaagcagtcaaatac ctgaatagatcaataacaaattctgaaattgaggtagtaattaatagactgccaaccaaa aacagcccaggaccagatggattcacagccaaattctacctgagatggattaaagactta aatgtaagacctaaaaccataaaaaccctagaagaaaacctaggcggtacctttcaggac acaggcatgggcaaagacttcatgactaaaacaccaaaagcaatggcaacagacgccaaa attgacaaatggaatctaattaaactaaagagcttctgcaaagcaaaagaaactatcatc agagtgaacagtcaacctacagaatgggagaaaatctttgcaatctatccatccgacaaa gggctaatatccagaatcacaaggaacttaaacaaatttacaagaaaaagacaaacaact ccatcaaaaatgtgccagcaggatggcccacaagccatctactactcggacaagtacttc gacgagcactacaagtaccggcatgttatgttacccagagaactttccaaacaagcacct gaaactcatctgatgtctgaagaggagtggaggagacttatgtccaatggcgtctag >gi568815589f:78197211_78429083|GENSCAN_predicted_peptide_2|593_aa MIDSVKLRRDSAADFFSHYEYLCALQNSVPLPAVRACLREGVLDFNADRLRGVDWAPLLS TLKINKDLPLVSIKSFFQPWLGDTGSDMNKFCRSRVPAIRYKDVTFQLCKALKGCLSISS VLKNLELNGLILRERDLTILAKGLNKSASLVHLSLANCPIGDGGLEIICQGIKSSITLKT VNFTGCNLTWQGADHMAKILKTMRRHEETWAESLRYRRPDLDCMAGLRRITLNCNTLIGD LGACAFADSLSEDLWLRDHSMMKAVIKKVLQNGRSAKSEQPGFPVTVTVESPSSSEVEEV DDSSESVHEVPEKTSIEQEALQEKLEECLKQLKEERVIRLKVDKRVSELEHENAQLRNIN FSLSEALHAQSLTNMILDDEGVLGSIENSFQKFHAFLDLLKDAGLGQLATMAGIDQSDFQ LLGHPQMTSTVSNPPKEEKKALEDEKPEPKQNALGQMQNIQFQKITGDARIPLPLDSFPV PVSTPEGLGTSSNNLGVPATEQRQESFEGFIARMCSPSPDATSGTGSQRKEEELSRNSRS SSEKKTKTVKFISHGLRTAAQDLEWHFGEPEAKVLLQGTAAVQKESESTTIAK >gi568815589f:78197211_78429083|GENSCAN_predicted_CDS_2|1782_bp atgatcgactccgtgaagctgcgccgcgacagcgcggcggacttcttctcccactacgag tacctgtgcgcgctgcagaactcggtgccgctgcccgccgtgcgcgcctgtctccgggag ggcgtgctggatttcaacgccgaccgcctccgcggggtggactgggcgcctctgctgagc accctcaagatcaataaagacctgcccttggtctccatcaagagcttcttccagccctgg ctgggggacacaggttctgacatgaataaattttgcagaagtcgtgttcctgcgataaga tacaaagatgtgaccttccagttgtgtaaagctcttaaaggctgtttaagtatatcaagt gtgctaaagaacctggagctaaatggactaattctgagagagagggatttaactattcta gcaaagggattgaataaatcggcttctttggtgcacctgtctcttgcaaattgtccaatt ggagatggaggtttagaaattatttgtcaaggtataaagagctctatcactcttaagaca gtcaacttcacaggatgtaatctgacatggcagggagcagatcacatggccaagatctta aagaccatgagaaggcatgaagaaacctgggctgagagtcttcgctataggagacctgat cttgactgtatggctggcttaagacgtatcacactgaattgcaacacacttattggtgac ctaggtgcatgtgcttttgcagactctctcagtgaggatttatggctgagagatcattct atgatgaaagcagttatcaaaaaagtcctccagaatggaaggagtgccaaatcagagcaa ccaggttttcctgtgactgtgacagtagagagtccttcatcctctgaagttgaagaggtt gatgattcttcagagagtgttcatgaagtgcctgagaaaactagtatagaacaagaagca ttacaggaaaaactggaggagtgcctaaagcagttaaaggaagaaagagtgataaggctt aaggttgataaacgagtcagtgagctggaacatgaaaatgcccagttaagaaatataaat ttctctttgtctgaagcccttcatgcacagtcattgacaaatatgatcctggatgatgaa ggtgttttgggcagcattgagaattcttttcagaagtttcatgctttcttggatctcctt aaagatgctgggcttgggcagcttgccacaatggctgggatagatcagtcagattttcaa ttactaggtcatccccagatgacttctactgttagtaatccacctaaagaagaaaagaag gcgcttgaagatgaaaaaccagaaccgaagcagaatgccctagggcaaatgcaaaatatc cagtttcagaaaattacaggtgatgctagaattcctttgcctctcgactcctttcctgtc ccagtttctactccagagggcttaggaacttccagcaacaacctaggagtcccagctact gagcagcggcaggagtcttttgaaggattcattgctagaatgtgttctccttcaccagat gcgacttctggaactggaagtcaaagaaaagaagaggagttgtccagaaatagcagatct tcttcagagaaaaagaccaaaacagttaagttcataagtcatggactcaggacagctgcc caagatcttgagtggcattttggagagcctgaagccaaggttcttctgcaggggactgca gctgtgcagaaggagtcagagtctaccaccattgctaaatga >gi568815589f:78197211_78429083|GENSCAN_predicted_peptide_3|209_aa MKPRTLAVSVTVLKDDVSGVCPFSCSDVSGVSSFQWVRGLADFRNEAADPCRMKPQTLAV SVTALKDGVSGVCSFSCSDVSGVSSFQWVHGLADFRSLADFRSEAAELCKLSTQQQTVSI FINFLLPALYIHSRNIYQAPNMLDAGDTAVKKAVWRLDSKSVMSMQGVQAQESRRDHGSL DEAGSMEIEVDRAQKNCEAEQIEFENEER >gi568815589f:78197211_78429083|GENSCAN_predicted_CDS_3|630_bp atgaagccgcggacccttgcggtgagtgttacagttctaaaagatgatgtgtccggagtt tgtcccttcagctgttcagatgtgtctggagtttcttccttccagtgggttcgtggtctt gctgacttcaggaatgaagctgcagacccttgcagaatgaagccgcagaccctcgcagtg agtgttacagctcttaaagatggtgtgtccggagtttgttccttcagctgttcagatgtg tctggagtttcttccttccagtgggttcatggtctcgctgacttcaggagtctcgctgac ttcaggagtgaagccgcagagctttgcaaattatcaacccagcaacagactgttagcatt ttcattaatttccttcttcctgccctgtacattcactcaagaaatatttatcaagcacct aatatgcttgatgctggggatacagcagtgaaaaaggcagtttggagattggattctaag tcagtgatgtcaatgcaaggagtccaggcccaggagtctagaagagatcatgggagcttg gacgaggcaggcagcatggagatagaagtcgatagagcacagaaaaattgtgaagcagaa cagattgaatttgagaatgaggaaagatga >gi568815589f:78197211_78429083|GENSCAN_predicted_peptide_4|252_aa MGVETVNARRSNCFDSAQKRDQWGCELLRANQLAQGLRQHGPMGRRLGAGTRRGFGAGCR LSPQRPGTPAVHAFGPPWLTHRPGRRTMDAPRQVVNFGPGPAKLPHSRAQRDQQLRQAGF GKNAVGEEARRGVPVQLPLALTAGARQCTTQLAGACCPAGATHLQTPGLCHLLGRRSSIL TLVDFGLLNPIPCSFRSGVNRVIEFSPGPQSDLAGGTHSLPSQAPARLLPSPAKPTVPSP TRSEEGKVSDVP >gi568815589f:78197211_78429083|GENSCAN_predicted_CDS_4|759_bp atgggggttgaaacagtaaacgcgaggaggagcaactgcttcgactcggctcagaagcgc gaccaatggggatgtgagctccttcgcgcgaaccaattagcgcagggcctgcgacagcac gggccaatggggcgccgactcggcgcaggaacaaggcgggggttcggggccggctgcaga ctctcaccgcagcggccaggaacgccagccgttcacgcgttcggtcctccttggctgact caccgccctggccgccgcaccatggacgcccccaggcaggtggtcaactttgggcctggt cccgccaagctgccgcactcacgtgcacagcgggatcagcagctccggcaagcgggcttc gggaagaatgcagttggtgaggaagctcggcgaggcgtgcccgtgcagctgcccctggcc ctgactgctggtgcgaggcagtgcacgactcagctggccggggcctgctgtcccgccggt gccacgcacctgcagacgcccgggctgtgccatctcctgggccggcgttcgagtattcta actctggtagactttgggctgcttaacccaattccctgttcatttcgttctggagtaaac cgagtgattgaattctctcccgggccacaatcagatcttgctggtggaactcactctctg ccttcccaggccccagcccggctcctcccctcccccgccaaacccaccgtccccagcccc acccgcagtgaagaaggcaaagtctccgatgtgccttga >gi568815589f:78197211_78429083|GENSCAN_predicted_peptide_5|472_aa MGRKETGCGWGWKPGDLTVDPSLSPRQVLLEIQKELLDYKGVGISVLEMSHRSSDFAKII NNTENLVRELLAVPDNYKVIFLQGGGCGQFSAVPLNLIGLKAGRCADYVVTGAWSAKAAE EAKKFGTINIVHPKLGSYTKIPDPSTWNLNPDASYVYYCANETVHGVEFDFIPDVKGAVL VCDMSSNFLSKPVDVSKFGVIFAGAQKNVGSAGVTVVIVRDDLLGFALRECPSVLEYKVQ AGNSSLYNTPPCFRQTKSTVQQGPMSALKPWANTELQAKGKLGHLEVLLVTWALLGFNHM AGIYVMGLVLEWIKNNGGAAAMEKLSSIKSQTIYEIIDNSQGFYVTLESPGVSLSSVSPR GLSGTLIPTIGRVFQSWGNTAFLSDLLGFLIYVVVEPQNRSKMNIPFRIGNAKGDDALEK RFLDKALELNMLSLKGHRSVGGIRASLYNAVTIEDVQKLAAFMKKFLEMHQL >gi568815589f:78197211_78429083|GENSCAN_predicted_CDS_5|1419_bp atgggaaggaaggagactggctgtgggtggggatggaagcctggggacctcactgtagac ccttccttgtcccctcgtcaggtgttgttagagatacaaaaggaattattagactacaaa ggagttggcattagtgttcttgaaatgagtcacaggtcatcagattttgccaagattatt aacaatacagagaatcttgtgcgggaattgctagctgttccagacaactataaggtgatt tttctgcaaggaggtgggtgcggccagttcagtgctgtccccttaaacctcattggcttg aaagcaggaaggtgtgctgactatgtggtgacaggagcttggtcagctaaggccgcagaa gaagccaagaagtttgggactataaatatcgttcaccctaaacttgggagttatacaaaa attccagatccaagcacctggaacctcaacccagatgcctcctacgtgtattattgcgca aatgagacggtgcatggtgtggagtttgactttatacccgatgtcaagggagcagtactg gtttgtgacatgtcctcaaacttcctgtccaagccagtggatgtttccaagtttggtgtg atttttgctggtgcccagaagaatgttggctctgctggggtcaccgtggtgattgtccgt gatgacctgctggggtttgccctccgagagtgcccctcggtcctggaatacaaggtgcag gctggaaacagctccttgtacaacacgcctccatgtttcagacaaaccaaatccacagtg cagcaagggcccatgtcagctttgaaaccctgggctaacacagagcttcaggcaaaaggg aagcttggtcacctggaagtattgttggtgacctgggctctgcttggattcaatcacatg gcaggcatctacgtcatgggcttggttctggagtggattaaaaacaatggaggtgccgcg gccatggagaagcttagctccatcaaatctcaaacaatttatgagattattgataattct caaggattctacgttaccttggaaagtcccggtgtgtccctgagcagcgtgtctccgagg ggactgtctggaactctcatccccacaataggaagggtttttcagtcatgggggaacact gcgttcctgtctgacctcctggggttcctcatttacgtggtcgtggagccccaaaataga agcaagatgaatattccattccgcattggcaatgccaaaggagatgatgctttagaaaaa agatttcttgataaagctcttgaactcaatatgttgtccttgaaagggcataggtctgtg ggaggcatccgggcctctctgtataatgctgtcacaattgaagacgttcagaagctggcc gccttcatgaaaaaatttttggagatgcatcagctatga >gi568815589f:78197211_78429083|GENSCAN_predicted_peptide_6|218_aa MGDFNTPLSILDRSARQKVNKDIQDLNSALQQADLTDIYRVPLSKSTEYTFFSAPHCTYS KIDHIIGSKALISKCKRTEMTTNCPSDHSAIKLELRIKKLTKNRTTTWKLKNLLLNDYWV NNEMKAEIKVSFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSEINTLTSQLK ELEKQEQTNSKDSRRQEITKIRAELKEIETQKALQKNQ >gi568815589f:78197211_78429083|GENSCAN_predicted_CDS_6|657_bp atgggagactttaacaccccactgtcaatattagacagatcagcgagacagaaggttaac aaagatattcaggacctgaactcagctctgcaacaagcagacctaacagacatctataga gttcccctctccaaatcaacagaatatacattcttctcagcaccacattgcacttattct aaaattgaccacataattggaagtaaagcactcatcagcaaatgtaaaagaacagaaatg acaacaaactgtccctcagaccacagtgcaatcaaattagaactcaggattaagaaactc actaaaaaccgcacaactacatggaaactgaaaaacttactcctgaatgactactgggta aataacgaaatgaaggcagaaataaaggtgtcctttgaaaccaatgagaacaaagacaca acgtaccagaatctctgggacacatttaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctgaaatcaacaccttaacatcacaattaaaa gaactagagaagcaagagcaaacaaattcaaaagatagcagaaggcaagaaataactaaa atcagagcagaactgaaggagatagagacacaaaaagcccttcaaaaaaatcaatga >gi568815589f:78197211_78429083|GENSCAN_predicted_peptide_7|222_aa MRHSKGDISQEVIYRGLKLWRVVLEGDRFQPGESEESSRKTAKEESRGPDTEPEKHQKER EAQKKTERAWCTPMASHGPISTHFLPSEAHEKPQTQSDQKRRWANQLRREGSAVPGVFKL LGATMFPGVSWELLVVRLVRPQLRRELALVPVPGAARPAAAVRGEAQYTHQMIRRQQPLM DKMSQAKSLQQYSTVHKQLCQKEKLPIKQSYYRVLPGQQKAP >gi568815589f:78197211_78429083|GENSCAN_predicted_CDS_7|669_bp atgagacattcaaagggagatatttcacaggaagtcatatatagaggtctaaagctgtgg agagtggtcttggagggagacagatttcagccaggggagtcggaggaatcctccaggaag acagccaaggaagagagcagaggacctgacacagaacccgagaaacaccaaaaggaaaga gaagctcagaagaagaccgagagggcatggtgcacgcccatggcctcccatggaccaatc agcacacacttcctcccctctgaggcccatgaaaagccccagacgcagtcagaccagaag agacgttgggccaaccagctgcggagggaaggctctgcggttcctggagtcttcaagctt ctgggtgccaccatgttccctggtgtcagctgggaactgcttgtggtacgcctggtccgg ccgcagcttcgcagggagctggcgcttgtgccagtgcctggagctgcccgccctgctgca gccgtaagaggagaagcacagtatacacatcagatgatccgaaggcagcaaccactcatg gataagatgagccaggctaagtctttacaacaatactccacggtgcataagcagctctgt caaaaagaaaaacttcctataaagcaatcttattacagagtgcttccaggacagcagaaa gccccttag >gi568815589f:78197211_78429083|GENSCAN_predicted_peptide_8|351_aa MRKRRCVYDEHPPLAAGASMWRRPDTKASWLDVQFHHTPRCPAGKPQTAILLKAWPFRSY TPTPDVGVKLQTFVVSVTALKGSVDPKSEQQQDLLQRAKEQSFHSVEGDPARAVMLEGHL HCQKEHRGEEEPPQSHAVQLAHRGTGHNSPHPVTGFYKYPGLSDGRVGGEQLGRDSGTEE LLVIPPAMVLTPRNREKVDTREDNLSHDQPVGRSTTEVTDVSIMCSLHQQPQLPATHYST PIAYSDPAHVELAGRGQTSSAFMSNWLDFLQRWASFSQEHISIESESSESPCPIALSSLL QAFNADLQGTGVPRDFALIPANNPVLTRQPLGGAVSARLHHTLPDLGPQLG >gi568815589f:78197211_78429083|GENSCAN_predicted_CDS_8|1056_bp atgagaaagagaaggtgtgtctatgatgaacacccaccactggctgcaggagcctccatg tggagacgccctgataccaaagcatcatggttggatgtgcagttccaccatacaccacgc tgcccagctggaaagccacagacagcaatcctgctaaaggcctggccattcagatcctac actcccacgccagatgttggagtgaagctgcagaccttcgtggtgagtgttacagctctt aaaggcagtgtggacccaaagagtgagcagcagcaagatttattacaaagagcgaaagaa caaagcttccacagtgtggaaggggacccagcccgagcagtcatgttagaaggtcaccta cactgccaaaaagaacacagaggggaagaagagcccccacagagccatgcagtccagcta gcccatagaggcactgggcataatagtccccaccctgtgacagggttttacaagtacccg ggactctcggatggaagagttggtggggaacagttgggaagagactcaggtacagaggag ctgttggtgatacctcctgccatggttctgaccccaaggaacagggagaaagtggatact agggaagacaacctcagccatgatcagcctgtaggaaggagcaccacagaagtgactgat gtttccatcatgtgttcgctccaccagcagccacagttgcctgccactcactacagcacc cccatcgcatactctgacccagcccacgtagagttggccggaagagggcagactagctca gcttttatgagcaactggctcgattttctacagcgatgggcaagcttttcacaggaacac atctccattgaaagtgaaagtagtgaaagtccatgccccatagctttgtccagcctctta caagcatttaacgcagatctacagggcacaggagttcccagggattttgcactcatccct gcaaacaatcccgtcctcaccagacagccactgggtggcgctgtgtctgcacgacttcat cacacactcccagacctgggaccccagcttggctaa