GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:27:24 Sequence gi568815589f:610802_845232 : 234431 bp : 43.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 174 169 6 1.05 1.07 Term - 8373 8318 56 1 2 54 43 72 0.165 -2.98 1.06 Intr - 8925 8834 92 2 2 43 92 60 0.161 1.54 1.05 Intr - 11148 11020 129 2 0 81 52 76 0.215 3.11 1.04 Intr - 11743 11641 103 2 1 78 98 -1 0.135 -0.87 1.03 Intr - 26840 26726 115 2 1 79 115 35 0.180 5.32 1.02 Intr - 39123 39034 90 0 0 69 42 89 0.086 2.59 1.01 Init - 48845 48840 6 2 0 96 93 0 0.305 2.15 1.00 Prom - 50808 50769 40 -2.46 2.04 PlyA - 53668 53663 6 1.05 2.03 Term - 58140 58027 114 0 0 56 42 102 0.380 0.97 2.02 Intr - 75579 75498 82 2 1 69 62 37 0.055 -1.16 2.01 Init - 86559 86372 188 0 2 38 93 139 0.488 7.93 2.00 Prom - 93235 93196 40 -5.16 3.02 PlyA - 93268 93263 6 1.05 3.01 Sngl - 97524 96622 903 0 0 70 42 1046 0.917 94.42 3.00 Prom - 98194 98155 40 -5.86 4.00 Prom + 98395 98434 40 -10.15 4.01 Init + 99215 99302 88 1 1 65 66 -59 0.326 -9.59 4.02 Intr + 100003 102663 2661 2 0 136 96 2454 0.942 238.36 4.03 Intr + 119250 119447 198 1 0 93 48 81 0.517 3.92 4.04 Intr + 120357 120465 109 1 1 73 86 101 0.984 7.74 4.05 Intr + 121577 121816 240 2 0 54 48 311 0.982 20.36 4.06 Intr + 123947 124034 88 2 1 144 101 32 0.998 10.07 4.07 Intr + 127484 127703 220 1 1 35 96 307 0.611 24.07 4.08 Intr + 129991 130133 143 2 2 103 92 259 0.999 27.87 4.09 Intr + 131404 131604 201 0 0 87 85 380 0.999 37.08 4.10 Intr + 133690 133788 99 0 0 47 77 116 0.936 6.71 4.11 Term + 156770 157333 564 1 0 51 49 335 0.469 20.29 4.12 PlyA + 158226 158231 6 1.05 5.00 Prom + 161575 161614 40 -1.86 5.01 Init + 174352 174397 46 0 1 66 86 33 0.162 1.74 5.02 Term + 189860 190011 152 0 2 78 47 117 0.595 4.67 5.03 PlyA + 190158 190163 6 1.05 6.00 Prom + 211747 211786 40 -3.86 6.01 Init + 215730 215981 252 2 0 82 16 237 0.418 11.25 6.02 Intr + 222713 222820 108 1 0 97 62 15 0.035 0.28 6.03 Intr + 230888 231391 504 1 0 45 94 608 0.237 50.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:610802_845232|GENSCAN_predicted_peptide_1|196_aa MTGNGDDAEKLQCRAILSSCIATVIPWVTPKEDSGSQVWSPDKALASSGNLTEIQILRLH PRLGDLDIPEGGYPVFPTVFFEETVLSPIAYSWHPCQDASMQSSRGFGSTRAPQNKPKPF GGAFGFHLPSGLYPFHRRGQSSHVKSTENVLHIKSLKITVRLIYEHLHSAVREHPQVKVL AVKQELRIQNYKEAFN >gi568815589f:610802_845232|GENSCAN_predicted_CDS_1|591_bp atgactggcaacggggatgacgcagagaagctgcaatgtagagccatcctttcaagctgc atcgcaactgtcatcccatgggtcacaccaaaggaggacagtggttctcaagtgtggtcc ccagataaagcattagcatcatctgggaacttaacagaaatacaaattctcaggcttcac cccagacttggtgacttagatatccctgaaggtggatatccagttttcccaacagtattc ttcgaagagactgtcctttcccccattgcgtattcttggcacccttgtcaggatgcttct atgcaatcctccagaggcttcggcagcaccagggctccccagaacaaaccaaagcccttt ggtggagcatttggatttcatctcccatctggactgtaccccttccaccggcgagggcaa agttcacatgtgaagtctacagagaatgtattgcatattaaatccttgaaaattacagta cggctgatctacgaacatctgcattctgcagtccgtgagcacccacaagtcaaagtattg gctgtaaagcaagaactgaggattcagaactacaaggaagcttttaactga >gi568815589f:610802_845232|GENSCAN_predicted_peptide_2|127_aa MRKLTNNHLDQVIKVDIMYLHGEEQSITFALSCQEGMAWTLPETHSTECQASLFKTIMAM KKRSQEDKPVFVLTAEGEHAKRVAPKNIFVIISSIDSHKSANPIVNCKCEGSRLSVPYED LMPDDLR >gi568815589f:610802_845232|GENSCAN_predicted_CDS_2|384_bp atgagaaagctgacaaataaccacctcgaccaggtgatcaaagttgacatcatgtatctg catggggaagagcagagcatcacttttgcactgtcctgccaagaaggcatggcctggacc ctgcctgaaacacattctacagaatgtcaggcctctctttttaaaaccatcatggccatg aaaaaaaggtcacaagaagacaaaccagtctttgtcctcacagctgagggggaacatgcc aaacgggttgccccaaaaaacatattcgtgatcattagcagcatagattctcataagagt gcaaaccctattgtgaactgcaagtgcgaaggatctaggttgagcgttccttatgaggat ctaatgcctgatgatctgaggtag >gi568815589f:610802_845232|GENSCAN_predicted_peptide_3|300_aa MEKVITQNMQPDLSPVLSEIPLTGEKLQQPDLSPGLSEIPLTGEKLQQPDLSPGLSEIPL TGEKLQQPDLSPGLSETPLTGEKLQQPDLSPGLSETPLTGEKLQQPDLSPGMSETPLTGE KLQQPDLSPGLSETPLTGEKLQQPDLSPGLSETPLTGEKLQQPDLSPGLSETPLTGEKLQ QPDLSPGMSETPLTGEKLQQPDLSPGLSETPLTGEKLQQPDLSPGLSETPLTGEKLQQPD LSPGMSETPLTGEKLQQPDLSPGLSETPLTGEKLQQPDLSPGLSETPLTGEKLQLGCQAG >gi568815589f:610802_845232|GENSCAN_predicted_CDS_3|903_bp atggagaaggtgatcactcaaaatatgcagccagacctgtctccagtactgtctgaaatt ccattaacaggagagaagctgcagcagccagacctgtctccaggactgtctgaaattcca ttaacaggagagaagctgcagcagccagacctgtctccaggactgtctgaaattccatta acaggagagaagctgcagcagccagacctgtctccaggactgtctgaaactccattaaca ggagagaagctgcagcagccagacctgtctccaggactgtctgaaactccattaacagga gagaagctgcagcagccagacctgtctccaggaatgtctgaaactccattaacaggagag aagctgcagcagccagacctgtctccaggactgtctgaaactccattaacaggagagaag ctgcagcagccagacctgtctccaggactgtctgaaactccattaacaggagagaagctg cagcagccagacctgtctccaggactgtctgaaactccattaacaggagagaagctgcag cagccagacctgtctccaggaatgtctgaaactccattaacaggagagaagctgcagcag ccagacctgtctccaggactgtctgaaactccattaacaggagagaagctgcagcagcca gacctgtctccaggactgtctgaaactccattaacaggagagaagctgcagcagccagac ctgtctccaggaatgtctgaaactccattaacaggagagaagctgcagcagccagacctg tctccaggactgtctgaaactccattaacaggagagaagctgcagcagccagacctgtct ccaggactgtctgaaactccattaacaggagagaagctgcagcttggctgtcaggcaggc tga >gi568815589f:610802_845232|GENSCAN_predicted_peptide_4|1536_aa MEKDGNSRHFRLQKWGKRGVRVEKLPIEYGKAGDILSGDQDKEQKDPYFVETPYGYQLDL DFLKYVDDIQKGNTIKRLNIQKRRKPSVPCPEPRTTSGQQGIWTSTESLSSSNSDDNKQC PNFLIARSQVTSTPISKPPPPLETSLPFLTIPENRQLPPPSPQLPKHNLHVTKTLMETRR RLEQERATMQMTPGEFRRPRLASFGGMGTTSSLPSFVGSGNHNPAKHQLQNGYQGNGDYG SYAPAAPTTSSMGSSIRHSPLSSGISTPVTNVSPMHLQHIREQMAIALKRLKELEEQVRT IPVLQVKISVLQEEKRQLVSQLKNQRAASQINVCGVRKRSYSAGNASQLEQLSRARRSGG ELYIDYEEEEMETVEQSTQRIKEFRQLTADMQALEQKIQDSSCEASSELRENGECRSVAV GAEENMNDIVVYHRGSRSCKDAAVGTLVEMRNCGVSVTEAMLGVMTEADKEIELQQQTIE SLKEKIYRLEVQLRETTHDREMTKLKQELQAAGSRKKVDKATMAQPLVFSKVVEAVVQTR DQMVGSHMDLVDTCVGTSVETNSVGISCQPECKNKVVGPELPMNWWIVKERVEMHDRCAG RSVEMCDKSVSVEVSVCETGSNTEESVNDLTLLKTNLNLKEVRSIGCGDCSVDVTVCSPK ECASRGVNTEAVSQVEAAVMAVPRTADQDTSTDLEQVHQFTNTETATLIESCTNTCLSTL DKQTSTQTVETRTVAVGEGRVKDINSSTKTRSIGVGTLLSGHSGFDRPSAVKTKESGVGQ ININDNYLVGLKMRTIACGPPQLTVGLTASRRSVGVGDDPVGESLENPQPQAPLGMMTGL DHYIERIQKLLAEQQTLLAENYSELAEAFGEPHSQMGSLNSQLISTLSSINSVMKSASTE ELRNPDFQKTSLGKITGNYLGYTCKCGGLQSGSPLSSQTSQPEQEVGTSEGKPISSLDAF PTQEGTLSPVNLTDDQIAAGLYACTNNESTLKSIMKKKDGNKDSNGAKKNLQFVGINGGY ETTSSDDSSSDESSSSESDDECDVIEYPLEEEEEEEDEDTRGMAEGHHAVNIEGLKSARV EDEMQVQECEPEKVEIRERYELSEKMLSACNLLKNTINDPKALTSKDMRFCLNTLQHEWF RVSSQKSAIPAMVGDYIAAFEAISPDVLRYVINLADGNGNTALHYSVSHSNFEIVKLLLD ADVCNVDHQNKAGYTPIMLAALAAVEAEKDMRIVEELFGCGDVNAKASQAGQTALMLAVS HGRIDMVKGLLACGADVNIQDDEGSTALMCASEHGHVEIVKLLLAQPGCNGHLEDNDGST ALSIALEAGHKDIAVLLYAHVNFAKAQSPPPLVIPTQTGPGVDLQQGAADLQKTVLLEEK LTKSSNININKKDPHTKTPSKGHQPQRLRVDKSTKMRKKQRKNAETSKNQNASSPNDCNS SPTRAQNWMESEFDKLTEVGYRRWVINSSELKEHVLTKSKKAKNLDKMLQRLLTRITSLE KNINDLMELRNIAQELREVYTSINSQINQVEKRIRD >gi568815589f:610802_845232|GENSCAN_predicted_CDS_4|4611_bp atggaaaaagatggaaatagtagacattttagactacaaaagtggggaaagaggggagtg agggttgaaaaattacctattgagtacggaaaagcaggtgatattctcagtggagaccag gacaaggaacagaaagacccttactttgtggagaccccctatggttatcaactagactta gatttcctcaaatatgtggatgacatacagaagggaaataccatcaaaagactgaacatc cagaagaggcggaagccgtccgtgccatgcccagaacccaggaccacatctggtcagcaa ggtatatggacttccactgaatccctctcatcctccaacagtgatgacaacaagcagtgc cccaacttcctcatagccagaagtcaagttacatcaactccaatctcaaagccacctccc cctctggagacctcactcccttttcttaccatcccagaaaatcgacagctgccacctccc tcaccacaactcccaaagcataaccttcatgtcaccaagacactgatggagacccggaga agactggaacaggagagagccaccatgcagatgacaccgggtgagttcagaaggcccagg ctggccagttttggaggcatgggcaccacaagctccctcccttcttttgtgggttctgga aaccacaatcctgccaagcaccagcttcagaatggataccaaggtaatggggattatggt agctatgccccagctgctcccaccacttcctccatggggagctccatccgccacagcccc ctgagctcagggatctccaccccagtgaccaacgtgagccccatgcacctgcagcacatc cgcgagcagatggccattgctctgaaacgcctgaaggagctggaggagcaggtgcgaacc atccctgtgctccaggtaaagatctctgtcttgcaagaagagaaaaggcagttggtctca cagctgaaaaaccaaagggctgcatcccagatcaatgtctgtggtgtgaggaagcggtcc tatagtgcggggaacgcctcccagctggaacagctctcccgggcccgaagaagtggcggg gaattatacattgactatgaggaggaagaaatggagaccgtagaacagagcacgcagagg ataaaggagttccggcaacttacagcagacatgcaagccctggagcagaagatccaggac agcagctgtgaggcctcctcagagctcagggagaatggagagtgccggtctgtggctgtg ggtgccgaggagaacatgaacgacatcgtcgtgtaccacagaggctccaggtcctgtaag gatgcagctgtagggacacttgttgagatgagaaattgtggggtcagcgtgacagaggcc atgcttggagtgatgactgaagctgacaaagaaattgagctgcaacagcagaccatagaa tccttgaaggaaaagatctatcgcctagaagtacagcttagagaaaccacccatgaccgg gagatgactaaactgaaacaagagctgcaggctgctggatcgaggaaaaaggttgacaaa gccacgatggcccagccgcttgttttcagtaaggtggtggaggcagtggtgcagaccaga gaccaaatggtcggcagtcacatggacctggtggacacgtgtgttgggacctccgtggaa acaaacagtgtaggcatctcctgccagcctgaatgtaagaataaagtcgtagggcctgag ctgcctatgaattggtggattgttaaggagagggtggaaatgcatgaccgatgtgctggg aggtctgtggaaatgtgtgacaagagtgtgagtgtggaagtcagcgtctgcgaaacaggc agcaacacagaggagtctgtgaacgacctcacactcctcaagacaaacttgaatctcaaa gaagtgcggtctatcggttgtggagattgttctgttgacgtgaccgtctgctctccaaag gagtgcgcctcccggggcgtgaacactgaggctgttagccaggtggaagctgccgtcatg gcagtgcctcgtactgcagaccaggacactagcacagatttggaacaggtgcaccagttc accaacaccgagacggccaccctcatagagtcctgcaccaacacttgtctaagcactttg gacaagcagaccagcacccagactgtggagacgcggacagtagctgtaggagaaggccgt gtcaaggacatcaactcctccaccaagacgcggtccattggtgttggaacgttgctttct ggccattctgggtttgacaggccatcagctgtgaagaccaaagagtcaggtgtggggcag ataaatattaacgacaactatctggttggtctcaaaatgaggactatagcttgtgggcca ccacagttgactgtggggctgacagccagcagaaggagcgtgggggttggggatgaccct gtaggggaatctctggagaacccccagcctcaagctccacttggaatgatgactggcctg gatcactacattgagcgtatccagaagctgctggcagaacagcagacactgctggctgag aactacagtgaactggcagaagctttcggggaacctcactcacagatgggctccctcaac tctcagctcatcagcaccctgtcgtctatcaactctgtcatgaaatctgcaagcactgaa gagctgaggaaccctgacttccagaaaaccagtctgggtaaaatcacaggcaattatttg ggatatacctgtaagtgtgggggccttcagtcaggaagtcccttaagctcccagacatcc cagcctgagcaagaagtggggacctcagaaggaaagccaatcagcagcctggatgccttc cccactcaggaaggtacgctgtctccagtgaacctgacagacgaccagatcgccgctggc ctctatgcatgtacaaacaatgaaagtacactgaagtccatcatgaagaagaaagatggt aacaaagattcaaatggcgcaaaaaagaatcttcagtttgttggcattaatggagggtat gaaacaacttcaagtgatgattccagctcagatgaaagctcttcttccgagtcagatgac gagtgtgatgtcattgagtatcctcttgaagaagaggaggaggaggaggatgaagacact cggggaatggcagaagggcaccatgcagttaatattgaaggtttgaagtctgccagggtg gaagatgaaatgcaggttcaagaatgtgaacctgagaaggtggaaatcagagagaggtat gaattaagtgaaaagatgttgtctgcatgcaacttactgaaaaatactataaatgacccc aaagctttgaccagcaaagatatgaggttctgtctgaacaccctccagcacgagtggttc cgcgtgtccagtcagaagtcagccattccagccatggtgggggactacatagctgctttt gaggccatttccccagatgtcctccgctatgtcatcaacttggcagacggcaacggcaac acagccctccattacagcgtgtcccactccaacttcgagattgtgaagctgctgttagat gccgatgtgtgtaatgtggatcaccagaacaaggcaggctacacccccatcatgttggcg gccctcgccgctgtggaagcagagaaggacatgcggattgtggaagaactcttcggctgt ggggatgtgaatgccaaagctagtcaggcgggacagacggccctcatgctggcggtcagt cacggacggatagacatggtgaagggccttctggcctgtggggctgatgtcaacatccag gatgacgagggctccacggccctcatgtgtgccagcgagcacggacacgtggagattgtc aagctgctgctggcccagcccggctgcaacggtcacctagaggacaacgatggcagcact gcgctctcaatcgccctggaagcaggacacaaggacatcgctgttcttctgtatgcccat gtcaactttgcaaaagcccagtctccgcctccgctggtgatacccacgcaaacagggcct ggagtggacctccagcaaggtgcagcagacctgcagaagacagtcctgttagaagagaaa ctgacaaaaagcagtaacatcaacatcaacaaaaaggatccccacacaaaaaccccatcc aaaggtcaccagcctcaaagattgagggtagataaatccacaaagatgaggaaaaagcag cgcaaaaatgctgaaacttccaaaaaccagaatgcctcttctccaaatgattgcaactcc tctccaacaagggcacaaaactggatggagagtgagtttgacaaattaacagaagtaggc tacagaaggtgggtaatcaactcctctgagctaaaggagcatgttctaaccaaaagcaaa aaagctaagaaccttgataaaatgttacagagactgctaaccagaataaccagtttagaa aagaacataaatgacctgatggagctgagaaacatagcacaagaacttcgtgaagtgtac acaagtatcaatagccaaatcaatcaagtggaaaaaaggatcagagattga >gi568815589f:610802_845232|GENSCAN_predicted_peptide_5|65_aa MTERWIAMWNWLDKKARDSVNSSSTVKRHKALCLPQECTFYQETKDEIPKSGILGNTKTM EEAVE >gi568815589f:610802_845232|GENSCAN_predicted_CDS_5|198_bp atgactgaaagatggatagccatgtggaactggctggacaaaaaagcacgggattctgtg aactcaagctcaactgtaaaaaggcataaagcattatgcttgccacaagaatgtacgttt taccaggaaacaaaggatgaaattcctaagagtggcattttgggcaacaccaagacaatg gaggaagctgtggaataa >gi568815589f:610802_845232|GENSCAN_predicted_peptide_6|288_aa MLPVFFLPVFPSFPFPTPLSSRISPPPSPPPVRAISYRCIFPRGEPFDLTVWAGEQNHSL SISGERAEDYLCPARELQEDGASHPPQILCDYSSIFHLIQGPPPLNASGPSSPTCPFYLL TSPLQLRLRLQRTRLLRLLLRSVAVRRVHPSQQSPGERGGQSARTSPRGTMPNDEAFSKP STPSEAPHAPGVPPQGRAGGFGKASGALVGAASGSSAGGSSRGGGSGSGASDLGAGSKKS PRLPKCARCRNHGYASPLKGHKRFCMWRDCQCKKCNLIAERQRVMAAQ >gi568815589f:610802_845232|GENSCAN_predicted_CDS_6|864_bp atgctccccgtcttcttccttcctgtcttcccttcttttcccttccccaccccgctctcc agccgcatttccccacctccatccccgcccccggtgagagctatcagttaccgctgtata ttcccgcgtggagagccctttgacctcaccgtgtgggcgggagaacaaaaccactccctg tcaatcagtggggagcgggccgaggactacctgtgcccagcacgggagctgcaggaagat ggcgccagtcaccctcctcaaatcctgtgtgactacagttcaatcttccatctcatacaa ggtcctcctccactaaatgcttcaggtcctagcagtcccacttgtcccttctacctgctg acctcgccactccagctgcgcctccggctgcagcgcacacgtctcctgcgcctcctcctc cggagcgtcgctgtccgtcgggttcatccctcgcagcagtctccaggcgagagagggggc cagagtgctcgcacttctcctaggggcaccatgcccaacgacgaggcattcagcaagccc tctacaccgtcggaagcccctcacgcccccggggtaccgccgcagggcagagccgggggc tttggcaaagcgtctggggcgctagtgggggcggccagcggctcgagcgccgggggcagc agcagaggaggcggctccggctccggggcgtcggacctgggtgccgggagcaagaagtcc ccgcggctgcccaagtgcgcacgctgcaggaaccacggctacgcctcgccgctcaagggc cacaagcgcttctgcatgtggcgcgactgccagtgcaagaagtgcaacctgatcgccgag aggcagcgcgtgatggccgcgcag