GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:44:08 Sequence gi568815584f:52630475_52871103 : 240629 bp : 39.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1686 1835 150 2 0 68 94 43 0.344 2.34 1.02 Intr + 3354 3472 119 2 2 76 71 116 0.779 7.04 1.03 Term + 4464 4641 178 0 1 80 48 60 0.688 -2.62 1.04 PlyA + 5072 5077 6 1.05 2.13 PlyA - 6395 6390 6 1.05 2.12 Term - 7805 7769 37 1 1 93 37 44 0.130 -4.37 2.11 Intr - 15813 15680 134 0 2 109 96 79 0.686 9.32 2.10 Intr - 21834 21765 70 2 1 84 86 21 0.496 -0.23 2.09 Intr - 22841 22595 247 0 1 91 38 126 0.494 3.50 2.08 Intr - 33373 33327 47 0 2 96 94 16 0.859 0.13 2.07 Intr - 36021 35901 121 1 1 46 44 149 0.902 4.93 2.06 Intr - 41229 41156 74 2 2 75 30 94 0.922 0.43 2.05 Intr - 41397 41321 77 0 2 67 20 195 0.943 7.99 2.04 Intr - 47998 47960 39 1 0 127 93 40 0.979 5.90 2.03 Intr - 51934 51851 84 1 0 112 47 54 0.766 2.80 2.02 Intr - 53433 53314 120 0 0 103 63 64 0.842 5.17 2.01 Init - 65007 64894 114 0 0 77 99 253 0.292 23.56 2.00 Prom - 70784 70745 40 -6.35 3.00 Prom + 73791 73830 40 -2.15 3.01 Init + 76704 76830 127 2 1 102 65 214 0.990 20.87 3.02 Intr + 77835 77914 80 1 2 45 96 111 0.949 5.95 3.03 Intr + 78009 78048 40 2 1 93 82 37 0.867 0.48 3.04 Intr + 78290 78342 53 0 2 79 90 78 0.813 4.81 3.05 Intr + 83407 83494 88 0 1 82 77 35 0.661 0.42 3.06 Intr + 87607 87668 62 2 2 102 98 16 0.776 1.53 3.07 Intr + 87755 87878 124 1 1 79 91 73 0.936 5.94 3.08 Intr + 90636 90716 81 1 0 69 82 50 0.687 1.19 3.09 Intr + 93491 93562 72 0 0 88 110 100 0.999 10.66 3.10 Term + 97025 97143 119 0 2 94 28 144 0.980 6.72 3.11 PlyA + 97195 97200 6 1.05 4.00 Prom + 97425 97464 40 -6.25 4.01 Init + 100001 100149 149 1 2 107 29 211 0.764 16.71 4.02 Intr + 127400 127450 51 2 0 101 110 10 0.357 1.60 4.03 Intr + 129208 129280 73 1 1 117 93 12 0.457 3.09 4.04 Intr + 138366 138459 94 2 1 111 110 28 0.443 5.92 4.05 Term + 140559 140632 74 1 2 36 38 101 0.205 -3.01 4.06 PlyA + 141520 141525 6 1.05 5.10 PlyA - 143201 143196 6 1.05 5.09 Term - 147984 147837 148 2 1 110 42 49 0.548 -1.11 5.08 Intr - 150266 150205 62 2 2 54 87 83 0.460 1.41 5.07 Intr - 151437 151310 128 1 2 83 95 145 0.997 14.18 5.06 Intr - 154190 154023 168 0 0 73 80 103 0.679 7.00 5.05 Intr - 163777 163622 156 2 0 81 37 71 0.002 0.36 5.04 Intr - 171120 171023 98 2 2 46 78 69 0.045 0.43 5.03 Intr - 189562 189512 51 0 0 104 80 60 0.014 3.90 5.02 Intr - 199375 199222 154 2 1 92 131 10 0.013 3.91 5.01 Init - 213376 213337 40 1 1 66 85 90 0.769 6.90 5.00 Prom - 216962 216923 40 -3.55 6.07 PlyA - 220899 220894 6 1.05 6.06 Term - 228076 227903 174 1 0 113 42 149 0.587 9.58 6.05 Intr - 229240 229099 142 0 1 84 52 193 0.999 14.73 6.04 Intr - 229991 229867 125 2 2 43 86 108 0.986 4.56 6.03 Intr - 234148 233927 222 1 0 79 80 152 0.994 10.90 6.02 Intr - 234379 234273 107 2 2 65 101 73 0.981 5.31 6.01 Init - 237652 237604 49 1 1 72 91 49 0.950 4.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 160448 160751 304 1 1 40 39 171 0.871 4.91 S.002 Init - 201355 201258 98 1 2 64 89 42 0.800 1.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:52630475_52871103|GENSCAN_predicted_peptide_1|148_aa GMSLCQTVVVGSVVILLYSSRACYNLVVVTISQDTLESPFNYGWDNLSDKAPAGMINSHS YSSRAYFFDNPRRYDSDDDLPRLGSSREGSLPNSQSLGWYGTMTGCGSSSYTVTPHLNGP MTDTAPLLFTCSNLDLNNHHSLYVTPQN >gi568815584f:52630475_52871103|GENSCAN_predicted_CDS_1|447_bp ggtatgtctctgtgccagactgtcgtcgtgggctctgtagtcattcttctgtactcttcc agagcttgttataatttggtggtggtcaccatatctcaggatacattagaaagtccattt aattatggctgggataatctttcagataaggcacctgctggcatgataaatagtcacagt tatagttccagagcttactttttcgacaatccaagacgatatgatagtgatgatgacctg ccaagactgggaagttcaagagaaggaagtttaccaaattcgcaaagtttgggctggtat ggcaccatgactgggtgtggcagcagcagttacacagtcactccccacctgaatggacct atgacagatactgctcctttgctctttacttgtagtaatttagatttgaacaatcatcat agcttatatgtgacaccacaaaactga >gi568815584f:52630475_52871103|GENSCAN_predicted_peptide_2|387_aa MGRGWGFLFGLLGAVWLLSSGHGEEQPPETAAQRCFCQVSGYLDDCTCDVETIDRFNNYR LFPRLQKLLESDYFRYYKVNLKRPCPFWNDISQCGRRDCAVKPCQSDEVPDGIKSASYKY SEEANNLIEECEQAERLGAVDESLSEETQKAVLQWTKHDDSSDNFCEADDIQSPEAEYVD LLLNPERYTGYKGPDAWKIWNVIYEENCFKPQTIKRPLNPLASGQETWLEKKWGHNITEF QQRFDGILTEGEGPRRLKNLYFLYLIELRALSKVLPFFERPDFQLFTGNKIQDEENKMLL LEILHEIKSFPLHFDENSFFAGDKKEAHKLKTQGLGTALKILFSEKLIANMPESGPSYEF HLTRQEIVSLFNAFGRDMDEIGNNHSQ >gi568815584f:52630475_52871103|GENSCAN_predicted_CDS_2|1164_bp atgggccgcggctggggattcttgtttggcctcctgggcgccgtgtggctgctcagctcg ggccacggagaggagcagcccccggagacagcggcacagaggtgcttctgccaggttagt ggttacttggatgattgtacctgtgatgttgaaaccattgatagatttaataactacagg cttttcccaagactacaaaaacttcttgaaagtgactactttaggtattacaaggtaaac ctgaagaggccgtgtcctttctggaatgacatcagccagtgtggaagaagggactgtgct gtcaaaccatgtcaatctgatgaagttcctgatggaattaaatctgcgagctacaagtat tctgaagaagccaataatctcattgaagaatgtgaacaagctgaacgacttggagcagtg gatgaatctctgagtgaggaaacacagaaggctgttcttcagtggaccaagcatgatgat tcttcagataacttctgtgaagctgatgacattcagtcccctgaagctgaatatgtagat ttgcttcttaatcctgagcgctacactggttacaagggaccagatgcttggaaaatatgg aatgtcatctacgaagaaaactgttttaagccacagacaattaaaagacctttaaatcct ttggcttctggtcaagagacctggttagaaaagaaatggggacacaacattacagaattt caacagcgatttgatggaattttgactgaaggagaaggtccaagaaggcttaagaacttg tattttctctacttaatagaactaagggctttatccaaagtgttaccattcttcgagcgc ccagattttcaactctttactggaaataaaattcaggatgaggaaaacaaaatgttactt ctggaaatacttcatgaaatcaagtcatttcctttgcattttgatgagaattcatttttt gctggggataaaaaagaagcacacaaactaaagactcagggtttgggcactgctctgaag atcttattttctgagaaattgatagcaaatatgccagaaagtggacctagttatgaattc catctaaccagacaagaaatagtatcattattcaacgcatttggaagggacatggatgaa attggaaataatcattctcagtaa >gi568815584f:52630475_52871103|GENSCAN_predicted_peptide_3|281_aa MAIPGIPYERRLLIMADPRDKALQDYRKKLLEHKEIDGRLKELREQLKELTKQYEKSEND LKALQSVGQIVGEVLKQLTEEKFIVKATNGPRYVVGCRRQVIELPLTNPELFQRVGIIPP KGCLLYGPPGTGKTLLARAVASQLDCNFLKVVSSSIVDKYIGESARLIREMFNYARDHQP CIIFMDEIDAIDIDLPNEQARLDILKIHAGPITKHGEIDYEAIVKLSDGFNGADLRNVCT EAGMFAIRADHDFVVQEDFMKAVRKVADSKKLESKLDYKPV >gi568815584f:52630475_52871103|GENSCAN_predicted_CDS_3|846_bp atggccattcccggcatcccctatgagagacggcttctcatcatggcggaccctagagat aaggcgcttcaggactaccgcaagaagttgcttgaacacaaggagatcgacggccgtctt aaggagttaagggaacaattaaaagaacttaccaagcagtatgaaaagtctgaaaatgat ctgaaggccctacagagtgttgggcagatcgtgggtgaagtgcttaaacagttaactgaa gaaaaattcattgttaaagctaccaatggaccaagatatgttgtgggttgtcgtcgacag gtgatagaattacctcttacaaacccagagttatttcagcgtgtaggaataatacctcca aaaggctgtttgttatatggaccaccaggtacgggaaaaacactcttggcacgagccgtt gctagccagctggactgcaatttcttaaaggttgtatctagttctattgtagacaagtac attggtgaaagtgctcgtttgatcagagaaatgtttaattatgctagagatcatcaacca tgcatcatttttatggatgaaatagatgctattgatattgatttgccaaatgaacaagca agattagacatactgaaaatccatgcaggtcccattacaaagcatggtgaaatagattat gaagcaattgtgaagctttcggatggctttaatggagcagatctgagaaatgtttgtact gaagcaggtatgttcgcaattcgtgctgatcatgattttgtagtacaggaagacttcatg aaagcagtcagaaaagtggctgattctaagaagctggagtctaaattggactacaaacct gtgtaa >gi568815584f:52630475_52871103|GENSCAN_predicted_peptide_4|146_aa MEDVKLEFPSLPQCKEDAEVSRSRGCHAQASPCGSGRGATPVPNRLSRHLAAFVIAYIME TFGMKYRDAFAYVQERRFCINPNAGFVHQLQEYEAIYLAKLTIQMMSPLQIERSLSVHSG TTGSLKRTHEEEDDFGTMQVATAQNG >gi568815584f:52630475_52871103|GENSCAN_predicted_CDS_4|441_bp atggaggacgtgaagctggagttcccttcccttccacagtgcaaggaagacgccgaggtg agtcgctcccgtggctgccacgcacaggcctctccctgtggctccggccgaggggcgacc ccagtccccaaccgtcttagccgccaccttgcagcctttgttattgcatacattatggaa acatttggaatgaagtacagagatgcttttgcttatgttcaagaaagaagattttgtatt aatcctaatgctggatttgtccatcaacttcaggaatatgaagccatctacctagcaaaa ttaacaatacagatgatgtcaccactccagatagaaaggtcattatctgttcattctggt accacaggcagtttgaagagaacacatgaagaagaggatgattttggaaccatgcaagtg gcgactgcacagaatggctga >gi568815584f:52630475_52871103|GENSCAN_predicted_peptide_5|334_aa MVHQGLDNALDTRDIFPLTLSGPSLSTSPEPLRLWRAPGLPQAKHLPHSCFSPRLDHEDR WHSLRSQDFENPDSNKEIDMASCVTNTDKFVCDKSSSLLLEQLVTTSLQCRVVLMLGLQV LTMVPGQGILKLLIQANVPESLLNSITDIYVYPRNSLICANENIVVDLTRKMKPDETPMF DPSLLKEVDWSQNTATFSPAISPTHPGEGLVLRPLCTADLNRESFEHMKKSGDYYVTVVE DVTLGQIVATATLIIEHKFIHSCAKRGRVEDVVVSDECRGKQLGKLLLSTLTLLSKKLNC YKITLECLPQNVGFYKKFGYTVSEENYMCRRFLK >gi568815584f:52630475_52871103|GENSCAN_predicted_CDS_5|1005_bp atggtgcatcaggggcttgacaacgcactcgacaccagagatatttttcctctaactctt agtggtccttctttgagcacctccccagaacctcttaggctgtggagagcaccagggctc cctcaagccaagcacttgcctcacagctgcttttctccccgtctagaccacgaggacagg tggcactccctccggtcacaagattttgagaacccagattcaaataaagaaatagatatg gccagttgtgtgactaacactgacaaatttgtctgtgacaagagttcttccttactactt gagcagcttgtcaccacttcccttcagtgcagggtggttctaatgctgggattacaggtg ttaaccatggtacccggccaaggtattcttaaacttctcattcaggccaacgttcctgaa tcacttctcaactcaatcactgacatctatgtttatccacgtaatagccttatttgtgcc aatgagaatattgttgttgaccttactagaaaaatgaaacctgatgaaactcctatgttt gacccaagtctactcaaagaagtggactggagtcagaatacagctacattttctccagcc atttccccaacacatcctggagaaggcttggttttgaggcctctttgtactgctgactta aatagagaatcttttgagcatatgaagaaatctggggattattatgttacagttgtagaa gatgtgactctaggacagattgttgctacggcaactctgattatagaacataaattcatc cattcctgtgctaagagaggaagagtagaagatgttgttgttagtgatgaatgcagagga aagcagcttggcaaattgttattatcaacccttactttgctaagcaagaaactgaactgt tacaagattacccttgaatgtctaccacaaaatgttggtttctataaaaagtttggatat actgtatctgaagaaaactacatgtgtcggaggtttctaaagtaa >gi568815584f:52630475_52871103|GENSCAN_predicted_peptide_6|272_aa MRGSRRRNWQTLDDKNGCEVTPDVNISGQKFNIKLLIPVAEGMNEIWLRCDNEKQYAHWM AACRLASKGKTMADSSYNLEVQNILSFLKMQHLNPDPQLIPEQITTDITPECLVSPRYLK KYKNKQITARILEAHQNVAQMSLIEAKMRFIQAWQSLPEFGITHFIARFQGGKKEELIGI AYNRLIRMDASTGDAIKTWRFSNMKQWNVNWEIKMVTVEFADEVRLSFICTEVDCKVVHE FIGGYIFLSTRAKDQNESLDEEMFYKLTSGWV >gi568815584f:52630475_52871103|GENSCAN_predicted_CDS_6|819_bp atgagaggtagtcgcagaagaaactggcagacattagatgataaaaatggatgtgaagtt accccagatgtaaacatttcaggccaaaaatttaacattaaactcctgattccagttgca gaaggcatgaatgaaatctggcttcgttgtgacaatgaaaaacagtatgcacactggatg gcagcctgcagattagcctccaaaggcaagaccatggcggacagttcttacaacttagaa gttcagaatattctttcctttctgaagatgcagcatttaaacccagatcctcagttaata ccagagcagatcacgactgatataactcctgaatgtttggtgtctccccgctatctaaaa aagtataagaacaagcagataacagcgagaatcttggaggcccatcagaatgtagctcag atgagtctaattgaagccaagatgagatttattcaagcttggcagtcactacctgaattt ggcatcactcacttcattgcaaggttccaagggggcaaaaaagaagaacttattggaatt gcatacaacagactgattcggatggatgccagcactggagatgcaattaaaacatggcgt ttcagcaacatgaaacagtggaatgtcaactgggaaatcaaaatggtcaccgtagagttt gcagatgaagtacgattgtccttcatttgtactgaagtagattgcaaagtggttcatgaa ttcattggtggctacatatttctctcaacacgtgcaaaagaccaaaacgagagtttagat gaagagatgttctacaaacttaccagtggttgggtgtga