GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:57:32 Sequence gi568815578f:50410528_50681460 : 270933 bp : 47.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 36995 37033 39 1 0 72 103 25 0.601 2.59 1.02 Term + 45407 45784 378 1 0 75 39 230 0.946 11.59 1.03 PlyA + 46880 46885 6 1.05 2.06 PlyA - 52415 52410 6 1.05 2.05 Term - 57111 56991 121 2 1 69 48 85 0.569 0.55 2.04 Intr - 58209 57969 241 1 1 -1 13 312 0.004 11.41 2.03 Intr - 60308 60272 37 2 1 92 100 3 0.006 -0.26 2.02 Intr - 68794 68708 87 1 0 79 80 134 0.993 11.87 2.01 Init - 72024 72004 21 0 0 93 110 16 0.988 3.50 2.00 Prom - 73188 73149 40 -2.16 3.00 Prom + 73301 73340 40 -5.96 3.01 Init + 79307 79319 13 1 1 84 97 2 0.254 1.04 3.02 Intr + 93355 93476 122 2 2 112 89 23 0.034 5.01 3.03 Intr + 97986 98115 130 2 1 41 60 87 0.005 1.47 3.04 Intr + 99945 100063 119 1 2 -14 80 158 0.006 4.98 3.05 Intr + 123146 123244 99 1 0 64 94 82 0.088 6.71 3.06 Intr + 150836 150926 91 1 1 118 95 54 0.941 8.67 3.07 Intr + 154442 154542 101 0 2 106 115 103 0.945 14.63 3.08 Intr + 157853 157951 99 1 0 115 123 30 0.081 9.51 3.09 Intr + 163990 164127 138 0 0 83 79 63 0.071 5.56 3.10 Intr + 167893 168102 210 0 0 87 95 51 0.368 4.81 3.11 Intr + 168641 168802 162 1 0 119 76 247 0.996 26.77 3.12 Intr + 169176 169399 224 2 2 71 85 309 0.855 25.83 3.13 Intr + 170738 170933 196 2 1 77 75 249 0.983 21.92 3.14 Term + 170952 171065 114 2 0 46 39 88 0.685 -1.73 3.15 PlyA + 172035 172040 6 1.05 4.24 PlyA - 172590 172585 6 1.05 4.23 Term - 176805 176705 101 1 2 95 42 74 0.799 1.79 4.22 Intr - 177365 177275 91 2 1 75 23 142 0.829 5.97 4.21 Intr - 179242 179159 84 1 0 99 105 102 0.726 13.02 4.20 Intr - 182019 181817 203 1 2 113 102 194 0.942 22.30 4.19 Intr - 182669 182508 162 0 0 69 116 150 0.981 15.85 4.18 Intr - 184977 184842 136 0 1 29 89 66 0.483 0.94 4.17 Intr - 185736 185613 124 2 1 29 39 147 0.616 4.49 4.16 Intr - 186368 186225 144 1 0 112 6 69 0.489 0.50 4.15 Intr - 187183 187053 131 1 2 88 117 213 0.833 23.69 4.14 Intr - 192117 191545 573 0 0 79 86 513 0.934 43.14 4.13 Intr - 194247 194118 130 2 1 109 105 164 0.999 20.90 4.12 Intr - 197230 197126 105 0 0 113 58 29 0.765 1.73 4.11 Intr - 198007 197862 146 1 2 84 81 281 0.980 26.08 4.10 Intr - 198211 198086 126 1 0 74 81 304 0.998 29.18 4.09 Intr - 198428 198385 44 0 2 54 80 24 0.678 -3.84 4.08 Intr - 198829 198766 64 1 1 74 87 84 0.726 5.19 4.07 Intr - 199195 199046 150 1 0 86 64 346 0.524 32.36 4.06 Intr - 200677 200654 24 1 0 128 113 34 0.577 8.12 4.05 Intr - 205553 205475 79 1 1 121 85 25 0.994 5.05 4.04 Intr - 209605 209459 147 0 0 96 80 206 0.100 19.95 4.03 Intr - 213787 213704 84 0 0 59 90 56 0.001 1.84 4.02 Intr - 220329 220211 119 0 2 158 96 131 0.049 20.16 4.01 Init - 221486 221430 57 2 0 44 38 74 0.066 -0.69 4.00 Prom - 227511 227472 40 -3.46 5.00 Prom + 236093 236132 40 -0.86 5.01 Init + 248332 248389 58 0 1 23 88 84 0.520 3.37 5.02 Intr + 249320 249473 154 0 1 86 19 64 0.262 -1.57 5.03 Intr + 251494 251618 125 1 2 79 19 132 0.445 5.63 5.04 Term + 251669 251727 59 0 2 79 42 82 0.381 0.55 5.05 PlyA + 251747 251752 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 99940 99699 242 2 2 102 38 183 0.806 10.89 S.002 Init + 210232 210459 228 0 0 110 35 357 0.857 30.97 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:50410528_50681460|GENSCAN_predicted_peptide_1|138_aa MKKKKEKHKFQRVLWEQAVWLEEEAELRWAPDGFGNRKEVGSWPHTDPLLWAFENSAKRR NMGSVFAGASPKPNRTPRASWSKVLANLKTHPVEPQGAEGHRRGPASKAVADRTGTISTQ RHRVMSSRLRPGIVTAPL >gi568815578f:50410528_50681460|GENSCAN_predicted_CDS_1|417_bp atgaagaaaaagaaggaaaagcataaattccagagggtgctgtgggagcaggcggtgtgg ctggaggaggaagctgaattgcgttgggctccagatgggtttgggaaccggaaggaagtg ggaagctggccccacacagacccacttctctgggcctttgagaacagtgccaagaggcgc aacatgggctcagtatttgccggggcctctcctaaaccaaacaggacgccacgggcaagt tggagcaaagtcctggcaaacctgaagacacatcccgtggagccccaaggagccgagggt cacaggaggggccctgcctccaaggccgtggcagacagaactggcaccatctccacacag cgtcaccgagtgatgtccagtcgtctcaggccggggattgtaactgccccgctctga >gi568815578f:50410528_50681460|GENSCAN_predicted_peptide_2|168_aa MELLILQKPPDPSRQRHKWLDAKKNALVEEDTSDGQCSSLRLPFPDKEGLGVKQDSVSKK PTNQKKKNQEEEKKEEEEEEEKEEKKEKEKEKEEEKEKKEEKEKKKEGRGGGGGGRKRRK EEGGRALFIPIFGINRLSLPPPPPEYNLIPNKETRKVNKLIRGTHTKS >gi568815578f:50410528_50681460|GENSCAN_predicted_CDS_2|507_bp atggagttgctgatccttcagaaacccccagaccctagcaggcagagacacaagtggctg gatgccaagaagaacgcactggtggaagaagacacaagcgacggccagtgttcatctcta agacttccttttccagataaagaaggcctgggtgtcaagcaagactctgtttcaaaaaaa ccaacaaaccaaaagaagaagaatcaagaagaagagaagaaggaggaggaagaggaggag gagaaggaggagaagaaggagaaggagaaggagaaggaggaggagaaggagaagaaggaa gagaaggagaagaagaaagaaggaagaggaggaggaggaggaggaagaaaaagaagaaaa gaagaaggaggaagagctttatttattcctatcttcggtattaacagactatccctgccc ccgccaccacctgaatacaacttgattccaaataaggaaacaaggaaagtgaacaaactt attcgtgggacccatactaagtcctag >gi568815578f:50410528_50681460|GENSCAN_predicted_peptide_3|605_aa MSQPGMSLAVPKSANQSKCPNSLRRAWVNLVGGRAIFLPGYVYPVDHRYENQLELLVKMQ IPGLIPRPKISESFGDGSRNLHYSTHFPERQTAQWAEKEAQQPPWPVMEMEKEFEQIDKS GSWAAIYQQKGTPLPPPRKEDLMYHFRKVQSGITVTKKIGQDIRHEASDFPCRVAKLPKN KNRNRYRDVSPFDHSRIKLHQEDNDYINASLIKMEEAQRSYILTQGPLPNTCGHFWEMVW EQKSRGVVMLNRVMEKGSLKCAQYWPQKEEKEMIFEDTNLKLTLISEDIKSYYTVRQLEL ENLTTQETREILHFHYTTWPDFGVPESPASFLNFLFKVRESGSLSPEHGPVVVHCSAGIG RSGTFCLADTCLLLMDKRKDPSSVDIKKVLLEMRKFRMGLIQTADQLRFSYLAVIEGAKF IMGDSSVQDQWKELSHEDLEPPPEHIPPPPRPPKRILEPHNGKCREFFPNHQWVKEETQE DKDCPIKEEKGSPLNAAPYGIESMSQDTEVRSRVVGGSLRGAQAASPAKGEPSLPEKDED HALSYWKPFLVNMCVATVLTAGAYLCYRTRWRDARVQRALAASPMVGFSSVVHLSQSQKK QIKGF >gi568815578f:50410528_50681460|GENSCAN_predicted_CDS_3|1818_bp atgagccagccagggatgtctctggctgtccctaagtcagccaaccagtcaaagtgccca aactctcttaggcgagcatgggtgaatcttgtggggggaagagccatcttccttcctggc tatgtgtacccagtggatcataggtacgagaatcagctggagttgttggtgaaaatgcag attccgggtctcatccctagaccaaagatatctgaatcatttggcgatgggtccaggaac ctgcattattcaacacacttcccagagcggcagacggcgcagtgggccgagaaggaggcg cagcagccgccctggcccgtcatggagatggaaaaggagttcgagcagatcgacaagtcc gggagctgggcggccatttaccagcagaaagggacacccttgcccccccccagaaaggaa gatttgatgtaccacttccgaaaggttcagtcgggcatcactgtaaccaagaagataggt caggatatccgacatgaagccagtgacttcccatgtagagtggccaagcttcctaagaac aaaaaccgaaataggtacagagacgtcagtccctttgaccatagtcggattaaactacat caagaagataatgactatatcaacgctagtttgataaaaatggaagaagcccaaaggagt tacattcttacccagggccctttgcctaacacatgcggtcacttttgggagatggtgtgg gagcagaaaagcaggggtgtcgtcatgctcaacagagtgatggagaaaggttcgttaaaa tgcgcacaatactggccacaaaaagaagaaaaagagatgatctttgaagacacaaatttg aaattaacattgatctctgaagatatcaagtcatattatacagtgcgacagctagaattg gaaaaccttacaacccaagaaactcgagagatcttacatttccactataccacatggcct gactttggagtccctgaatcaccagcctcattcttgaactttcttttcaaagtccgagag tcagggtcactcagcccggagcacgggcccgttgtggtgcactgcagtgcaggcatcggc aggtctggaaccttctgtctggctgatacctgcctcttgctgatggacaagaggaaagac ccttcttccgttgatatcaagaaagtgctgttagaaatgaggaagtttcggatggggctg atccagacagccgaccagctgcgcttctcctacctggctgtgatcgaaggtgccaaattc atcatgggggactcttccgtgcaggatcagtggaaggagctttcccacgaggacctggag cccccacccgagcatatccccccacctccccggccacccaaacgaatcctggagccacac aatgggaaatgcagggagttcttcccaaatcaccagtgggtgaaggaagagacccaggag gataaagactgccccatcaaggaagaaaaaggaagccccttaaatgccgcaccctacggc atcgaaagcatgagtcaagacactgaagttagaagtcgggtcgtggggggaagtcttcga ggtgcccaggctgcctccccagccaaaggggagccgtcactgcccgagaaggacgaggac catgcactgagttactggaagcccttcctggtcaacatgtgcgtggctacggtcctcacg gccggcgcttacctctgctacaggacgcgctggcgagatgctcgtgtgcagagagcactg gccgctagcccgatggtaggattcagttctgtggtgcatctgagccagtctcagaagaaa cagatcaaaggtttttaa >gi568815578f:50410528_50681460|GENSCAN_predicted_peptide_4|1007_aa MLEHGGLIVQEDEVDWSFFVTTMSVRLRFLSPGDTGAVGVVGRSASFAGFSSAQSRRIAN LPFPESAALTVSASLKMANLRRARFKRKSINRNSVRSRMPAKSSKMYGTLRKGSVCADPK PQQVKKIFEALKRGLKEYLCVQQAELDHLSGRHKDTRRNSRLAFYYDLDKVDELYEDYCI QCRLRDGASSMQRAFARCPPSRAARESLQELGRSLHECAEDMWLIEGALEVHLGEFHIRM KGLVGYARLCPGDHYEVLMRLGRQRWKLKGRIESDDSQTWDEEEKAFIPTLHENLDIKVT ELRGLGSLAVGAVTCDIADFFTTRPQVIVVDITELGTIKLQLEVQWKVGVRAGGHKAAAL SLVSWVNSVERPPAPSGSSPVCPFDTESFLVSPSPTGKFSMGSRKGSLYNWTPPSTPSFR ERYYLSVLQQPTQQALLLGGPRATSILSYLSDSDLRGPSLRSQSQELPEMDSFSSEDPRD TETSTSASTSDVGFLPLTFGPHASIEEEAREDPLPPGLLPEMAHLSGGPFAEQPGWRNLG GESPSLPQGSLFHSGTASSSQNGHEEGATGDREDGPGVALEGPLQEVLELLRPTDSTQPQ LRELEYQVLGFRDRLKPCRARQEHTSAESLMECILESFAFLNADFALDELSLFGGSQGLR AHTSHDSFPRGYAASPAPAPEGFCIRISPHGRAEAQFLLLASHNPTDEKDRPLPPPSSLK ASSRELTAGAPELDVLLMVHLQVCKALLQKLASPNLSRLVQECLLEEVAQQKHVLETLSV LDFEKVGKATSIEETCRRLLEQVVSCGGLLPGAGLPEEQIITWFQFHSYLQRQSVSDLEK HFTQLTKEVTLIEELHCAGQAKVVRKLQGKRLGQLQPLPQTLRAWALLQLDGTPRVCRAA SARLAGAVRNRSFREKALLFYTNALAENDARLQQAACLALKHLKGIESIDQTASLCQSDL EAVRAAARETTLSFGEKGRLAFEKMDKLCSEQREVFCQEADVEITIF >gi568815578f:50410528_50681460|GENSCAN_predicted_CDS_4|3024_bp atgttggaacatgggggtctcatcgtgcaggaagatgaggtggactggagctttttcgtg accaccatgtcggtgaggttgcggttcctgtcccctggggacacaggggccgtgggggtc gtgggccggagcgcctccttcgcaggcttcagcagtgcacagagccggaggatcgcaaac ctgccttttcccgagtctgcagccctgaccgtctctgccagcctcaagatggcaaacttg cggagggcccggttcaagagaaagtccatcaacaggaactccgtgagatcgcgaatgcct gcaaaatcctccaagatgtacggcacgctgcggaaggggtcggtctgtgcagacccgaag ccccagcaggtgaagaagatcttcgaagcattgaaaagaggcctcaaggagtatctgtgt gtgcagcaggctgagctggaccacctgtctggacgccacaaagacaccaggaggaattcc aggctggctttctattatgacctggacaaggtggatgagctgtacgaggactactgcatc cagtgccgcctgcgcgacggcgcctccagcatgcagcgggccttcgcccggtgccccccg agccgcgcagcccgagagagcctgcaggagctgggccgcagcctgcacgagtgcgccgag gacatgtggctcatcgagggggccctggaggttcacctgggcgagttccacatcaggatg aaaggcttggtgggctacgcacgcctctgtcccggagaccactatgaggtgctcatgcgt ctgggccgccagcgttggaagctcaagggtcggatcgagtcagatgacagccagacctgg gacgaagaggagaaggccttcatccccacgctgcatgagaacctggacatcaaggtgacg gagttgcggggcctgggctcgctggctgtgggtgcagtgacgtgtgacatcgccgacttc ttcacgacgcggccgcaggtcatcgtggtggacatcacggagttgggtaccatcaagctg cagctggaggtgcagtggaaggtgggtgtccgagctgggggtcacaaagctgctgccctc tccctcgtgagctgggtgaattctgttgagcgaccacctgcaccctcggggtcctcacct gtctgcccgtttgatactgagagcttcctggtgtcacccagccccacgggcaagttttct atgggcagcaggaagggctccttgtacaactggacacccccgagcacccccagcttccgg gagagatactacctgtctgtcctacagcagccaacacagcaggccttgctgctgggtggc ccaagggccacctccatcctcagctacctgtctgacagcgacctccggggtcccagccta agaagccagagtcaggagctgcctgagatggactccttcagctctgaggacccccgagac acggagaccagcacgtcggcgtccacctcagatgtgggcttcctgcccttgaccttcggt ccccacgcctccattgaagaggaggctcgggaggaccccctgcccccaggtctcctgcca gagatggcccacctctctggaggcccgtttgcagagcagcctggctggaggaacttagga ggggagagccccagcctgccacagggctccctgttccacagcggcacagcctcgagtagc cagaacggccacgaggaaggggcaaccggggacagagaggacgggcctggcgtggccctc gaggggcctctgcaggaggtcctggagttgctgaggcccacggactccacccagccccag ctccgggagctggagtaccaggtcctcggcttccgggaccggctgaagccctgcagagca cggcaggagcacacctcggccgagagcctgatggagtgcatcctggagagcttcgccttc ctcaatgccgacttcgccctggatgagctgtccctgtttgggggctcccagggtctccgg gcccacacctctcatgacagcttccctcgaggctacgcagcctcacctgccccagccccc gagggcttctgcatccgcatctcaccacatggaagagcagaggctcaattccttctcctc gcctctcacaatcccacggacgaaaaggaccggcccctgcccccaccgtcatcactgaaa gcgtcatccagggaactcacagccggtgccccagagctggacgtgctgctgatggtacac ctccaagtctgcaaagctctgctgcagaaactggcctcccctaatttatcaaggctggtc caggaatgcctcctggaagaagtggcacagcaaaagcacgttctggagacactttctgtc cttgactttgagaaggtcggcaaggcaacatccattgaagagacgtgccgcaggctcctg gagcaggtggtcagctgtggtgggctgctccccggagctgggctcccagaagaacagatc attacctggttccagtttcacagctacctgcagaggcagagcgtctctgacctggagaag cacttcacccagctcaccaaggaagtgacactcatcgaggagcttcactgtgcgggacag gccaaggtggtccggaagctgcaggggaagcggctgggccagctccagcctctgccccag accttaagagcctgggcgctgctccagctggacggcactccgagggtgtgcagggcggcc agcgctcgcctggctggtgcagtcaggaacagaagcttccgggaaaaggctttgctgttc tacaccaacgccctggcagagaacgacgcaaggctccagcaggccgcatgcctagcgctc aaacacctcaagggcattgaaagcatcgaccagactgccagcctgtgccagtctgacctg gaggccgtgcgggcggcagcccgggaaaccacactgtcgttcggtgaaaaaggacggtta gcttttgagaagatggacaagctctgctcagaacaaagagaagtcttttgccaggaggca gatgttgaaatcacaatattttaa >gi568815578f:50410528_50681460|GENSCAN_predicted_peptide_5|131_aa MPGLTLQGYVILQILVDLGDTLSPIRWRRHVTHMVWPLMTEQAMDTGPRRDHSLTGHEHE FRYMPQKDEPGWFCNQQPLQTSLGGSKELTPDEEKDQPQTHSSSSHFQTHGRRQDPLKEE PSEKKGKAKRN >gi568815578f:50410528_50681460|GENSCAN_predicted_CDS_5|396_bp atgccaggtcttactctccaaggctacgttatcctgcaaattctggtggacctgggagac accctatctccaatcaggtggaggaggcacgtgacccacatggtctggccactgatgact gaacaagctatggacaccggaccccggagagaccattcactcactggccacgaacatgag ttcagatacatgccccaaaaggatgagcctggctggttctgcaatcagcagcccctccag acatccctgggaggctccaaggagctgactcctgatgaggaaaaggatcagccgcagact cactcctcctccagccacttccagacccacgggaggcggcaggaccctctgaaggaggag ccctcagagaagaagggaaaagccaaaaggaattaa