GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:29:29 Sequence gi568815579f:29508657_29715377 : 206721 bp : 50.61% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 17265 18009 745 2 1 85 101 296 0.278 23.50 1.02 Intr + 18555 18739 185 1 2 108 86 286 0.987 29.91 1.03 Intr + 19777 19806 30 0 0 139 94 20 0.972 6.03 1.04 Intr + 21163 21634 472 0 1 106 90 886 0.919 83.25 1.05 Term + 29521 29696 176 2 2 46 41 82 0.041 -2.78 1.06 PlyA + 31065 31070 6 1.05 2.06 PlyA - 31422 31417 6 1.05 2.05 Term - 34297 34284 14 2 2 127 50 7 0.125 -0.84 2.04 Intr - 36475 36362 114 2 0 106 14 73 0.159 2.12 2.03 Intr - 37192 37073 120 2 0 66 64 50 0.056 0.87 2.02 Intr - 53546 53426 121 2 1 79 68 66 0.060 3.77 2.01 Init - 55346 55320 27 2 0 90 45 44 0.145 -0.03 2.00 Prom - 57180 57141 40 0.74 3.04 PlyA - 59678 59673 6 1.05 3.03 Term - 67356 67235 122 1 2 29 47 126 0.352 1.24 3.02 Intr - 73256 73170 87 0 0 24 86 79 0.319 1.24 3.01 Init - 74104 73963 142 1 1 49 89 67 0.500 3.20 3.00 Prom - 76804 76765 40 -5.06 4.00 Prom + 81209 81248 40 -4.06 4.01 Init + 89380 89472 93 0 0 41 94 98 0.672 6.08 4.02 Intr + 97540 97669 130 0 1 76 63 86 0.055 5.17 4.03 Intr + 100001 100053 53 0 2 107 82 44 0.995 4.33 4.04 Intr + 101753 101976 224 1 2 76 76 436 0.998 38.13 4.05 Intr + 103206 103283 78 0 0 83 113 11 0.761 1.77 4.06 Intr + 103461 103522 62 0 2 91 91 24 0.769 1.38 4.07 Intr + 105215 105316 102 0 0 107 95 46 0.925 7.35 4.08 Intr + 106588 106674 87 2 0 107 5 82 0.251 1.74 4.09 Intr + 108068 108211 144 0 0 53 30 106 0.008 1.55 4.10 Intr + 108748 108793 46 2 1 111 80 20 0.011 0.97 4.11 Intr + 113184 113289 106 0 1 39 61 67 0.274 -0.68 4.12 Intr + 115591 115742 152 0 2 84 75 182 0.801 15.46 4.13 Intr + 133027 133225 199 1 1 13 -2 241 0.096 6.95 4.14 Intr + 149175 149277 103 2 1 73 64 42 0.016 0.05 4.15 Intr + 162955 163003 49 2 1 76 74 45 0.059 -0.36 4.16 Intr + 165168 165996 829 0 1 112 -11 1733 0.151 157.21 4.17 Intr + 166020 166211 192 2 0 71 95 41 0.311 2.79 4.18 Intr + 168544 168728 185 0 2 136 40 43 0.163 2.89 4.19 Term + 173554 173779 226 1 1 56 55 133 0.295 3.05 4.20 PlyA + 176482 176487 6 1.05 5.00 Prom + 178976 179015 40 -7.46 5.01 Init + 181255 181341 87 0 0 70 100 108 0.616 10.84 5.02 Term + 188403 189086 684 2 0 -15 47 300 0.509 9.24 5.03 PlyA + 189878 189883 6 1.05 6.03 PlyA - 190951 190946 6 1.05 6.02 Term - 194321 194056 266 0 2 102 47 505 0.996 43.47 6.01 Init - 199757 199598 160 2 1 102 111 199 0.809 23.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 96902 96977 76 1 1 86 58 11 0.838 -2.84 S.002 Term - 122307 122185 123 0 0 48 47 136 0.862 3.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:29508657_29715377|GENSCAN_predicted_peptide_1|535_aa MQRWRRLGAQPLTAPSLPLPSPSPRLSPAPGPPRRTRSPRAALGRTPGLRAAAERPLALR KEPRRNRRGRLREPKEVAIGSRTRRRQGGRSSRLRGTHSLGQGWVPARPSPCPARRAETE PRIPTVLWTTGQREALTDRQQPPGGTASPAHPAQRPPPEPHRASRRGSGADWRPPAARGG GAARGHGRLGGVLARAGADPSPRGRRLSAPGPPPPPRPPREMEQRNRLGALGYLPPLLLH ALLLFVADAAFTEVPKDVTVREGDDIEMPCAFRASGATSYSLEIQWWYLKEPPRELLHEL ALSVPGARSKVTNKDATKISTVRVQGNDISHRLRLSAVRLQDEGVYECRVSDYSDDDTQE HKAQAMLRVLSRFAPPNMQAAEAVSHIQSSGPRRHGPASAANANNAGAASRTTSEPGRGD KSPPPGSPPAAIDPAVPEAAAASAAHTPTTTVAAAAAASSASPPSGQAVLLRQRHGSVLC LFSGYIQPFDWNLESITSHSLCTSSVICYHQHDTDSYAREWLAGSGGIIRGKMLA >gi568815579f:29508657_29715377|GENSCAN_predicted_CDS_1|1608_bp atgcagcggtggaggcggcttggcgcccagccactgaccgccccctcccttcccctcccc tccccctctccccgcctctctccggctccgggtccgccacgccggacccgctctccccgc gctgcgctgggtcggacgccaggtctgcgcgccgcggctgagcgcccactcgccctgcgg aaagagccgcggaggaaccggcgggggcggctgcgggagccgaaggaggtggcaatcggg agcagaaccagaaggagacagggaggcagaagctcgcggctccgtggcacgcactccctc ggccagggatgggtcccggcgcggcccagcccctgcccggcccgccgggcagagactgaa ccgcggatccccaccgtcctgtggacgaccggacagagagaggcactgaccgatcgccag cagcctcccggtgggaccgcgtctcctgcacaccccgcgcagcgccccccgccggagccg caccgggcaagccggcgagggagcggggctgattggcggccgccggcggccaggggaggg ggcgccgcgcggggccatggcaggctcggaggcgtcctagcccgagccggagccgatccg agcccacgcggccgccgcctctccgctcccgggcccccgccgccaccgcgccccccgcgg gagatggaacagcggaaccggctcggtgccctcggatacctgccgcctctgctgctgcat gccctgctgctcttcgtggccgacgctgcattcacagaagtccccaaagatgtgacagta cgggagggagacgacatcgaaatgccctgcgcgttccgggccagcggagccacctcgtat tcgctggagattcagtggtggtacctcaaggagccaccccgggagctgctgcacgagctg gcgctcagcgtgccgggcgcccggagcaaggtaacaaataaggatgcaactaaaatcagc accgtacgcgtccagggcaatgacatctcacaccggcttcggctgtctgccgtgcggctg caggacgagggcgtgtacgagtgccgcgtgtcggactacagcgacgacgacacgcaggag cacaaggcccaggcgatgctgcgcgtgctctcgcgcttcgcgccgcccaacatgcaggcc gccgaggccgtgtcccacatccagagcagcggcccgcgtcgccacggcccagccagcgcc gccaacgccaacaacgcgggcgccgcgagccgtaccacctccgagcccggccgcggcgac aagagcccgccgcccgggagccctcccgccgccatcgatcccgcagtccccgaggccgcg gcagcctcggcggcccacacgcccaccaccacagtcgcggcagctgctgctgcctcgtca gcgtcgccgccatcgggacaggcggtcctgctgcgccagaggcacggctcggtattatgc ttgttttccggatacatccaaccatttgattggaatcttgagtccatcacttcccactct ctctgtacgagttccgtcatctgttatcatcagcacgacaccgattcctatgccagagag tggttggcaggatccggggggataatccgtggaaagatgttagcttaa >gi568815579f:29508657_29715377|GENSCAN_predicted_peptide_2|131_aa MVLTEGPGQKEEVVQIIWPESPGFHLTTVGNTGCVQTSECKVGGAIRGAESGGTCAGWFY QKDTCTCMFIIALMTHNSKHMEPTQMPINDHGSPSDIGPSLTQSGGSLCPTGSKASSISS GLYVDGRSSVA >gi568815579f:29508657_29715377|GENSCAN_predicted_CDS_2|396_bp atggtcctcaccgagggccccgggcagaaggaggaggttgtacaaatcatctggcctgaa agtccaggattccacctgaccactgtgggaaacacaggctgtgttcaaacatccgaatgc aaagtgggtggtgctatccgaggggcagaatcagggggtacatgtgcaggttggttctac caaaaagacacctgcacctgtatgttcatcatagcactcatgactcacaacagcaaacac atggaaccaacccagatgcccatcaacgaccatgggtctccttcagacattgggccctcg ctgacccagtctggaggctccctttgtcctacaggctccaaggcaagttccatttcttcg gggctgtatgtggatggccgtagttcagtggcctag >gi568815579f:29508657_29715377|GENSCAN_predicted_peptide_3|116_aa MANLLYLIIVTHKNGNARWFNQILLLIHEKPNIWDQQGSRKTFTIERGLDTADNDIPVIA LEELHDSPVTVGPPWQAAGEEGLTFAQTTVALVVGEIQLSVSNKTLRASGRDEEAS >gi568815579f:29508657_29715377|GENSCAN_predicted_CDS_3|351_bp atggcaaatttgctgtatttgatcattgtgacccataaaaatggcaatgccagatggttc aatcaaatacttttactaattcatgagaaacctaacatctgggaccaacagggtagcaga aagaccttcaccatagaacgtgggcttgacacagcagacaatgatattcctgtgatagcc ctagaggagttacatgattctccagtgactgttgggcctccatggcaagccgctggcgag gaaggattgacttttgcccaaacgacagtggctctggttgtaggggagatccagctgtct gtgtctaataagactctccgagcatcagggagagacgaggaggcctcttaa >gi568815579f:29508657_29715377|GENSCAN_predicted_peptide_4|1019_aa MFYHLVPDGKKPGATLKATSAPKGKANGGRQAHAPPRWASAGDVTHSAISELRESATAAA SASSESAGSGPRMKSVIYHALSQKEANDSDVQPSGAQRAEAFVRAFLKRSTPRMSPQARE DQLQRKAVVLEYFTRHKRKEKKKKAKGLSARQRRELRLFDIKPEQQRYSLFLPLHELWKQ YIRDLCSGLKPDTQPQMIQAKLLKADLHGAIISVTKSKCPSYVGITGILLQETKHIFKII TKEDRLKVIPKLNCVFTVETDGFISYIYGSKFQLRSRCSEAAQVLSPKQHAVTAASDEHT HGLGTWSPSTLPPRTRAPAQRLPCGYTVALMAHPGVISPSPQCNHICLFHIIFFHWNVSC SDAGTLPMVFAAITIPSALKGGSPTRRVEMAPAVQAAEQGAVDKASVAKEAGAAAEVFSG EKGAGRARRCQKRKKEKQKRRRRRRKEEGGEGGGKKKKRKKRKRRKQKKWKKKMEEEKEE EKEKEGKERRVLQGVGRCWLEASVSPMGFSGGLHGPLHDSAASFPWSLQYTAKISKLKIA VLTRQLETMVDHLANTEINSQRIAAVESCFGASGQPLALPGRVLLGEGVLTKECRKKAKP RIFFLFNDILVYGSIVLNKRKYRSQHIIPLEEVTLELLPETLQAKNRWMIKTAKKSFVVS AASATERQEWISHIEECVRRQLRATGRPPSTEHAAPWIPDKATDICMRCTQTRFSALTRR HHCRKCGFVVCAECSRQRFLLPRLSPKPVRVCSLCYRELAAQQRQEEAEEQGAGSPGQPA HLARPICGASSGDDDDSDEDKEGSRDGDWPSSVEFYASGLTPGLQNICPQASSTAQAPKR AAPEAAQGSGTPSHGGRCSGGEWLFLDSQCLFAGHCVLMASLQGWNRTAAAPSPKVLGPV RAHGLGWPRTFAEPSHLRSCSRTSWAMPAAGDTAGSETNKARTACCPRQHLGYGDLGSGI LEQHTGTLRTVCKLLIPLKYQNAFGFQQPPDLLGAPTMKQRPLNLACPSHNHLLAQPNP >gi568815579f:29508657_29715377|GENSCAN_predicted_CDS_4|3060_bp atgttttaccatttggttcccgatgggaagaagccaggggcaaccctcaaagccaccagt gctccaaaaggaaaggccaacgggggccgccaggcgcatgcgcccccgaggtgggcgagc gccggtgatgtcacgcatagcgccatctccgagctccgagagtctgcgacagcagctgcc agtgcgtcatcagagagcgccggaagcggtccgagaatgaagagtgtgatctaccatgca ttgtctcagaaagaggcgaatgactccgatgtccagccttcaggagcacagcgggccgag gccttcgtgagggccttcctgaagcgcagcacgccccgcatgagcccgcaggcccgcgag gaccagctgcagcgcaaggcggtggtcctggagtacttcacccgccacaagcgcaaggag aagaagaagaaagccaaaggcctctctgccaggcaaaggagggagctgcggctctttgac attaaaccagagcagcagagatacagccttttcctccctctccatgaactctggaaacag tacatcagggacctgtgcagtgggctcaagccagacacgcagccacagatgattcaggcc aagctcttaaaggcagatcttcacggggctattatttcagtgacaaaatccaaatgcccc tcttatgtgggtattacaggaatccttctacaggaaacaaagcacattttcaaaattatc accaaagaagaccgcctgaaagttatccccaagctaaactgcgtgttcactgtggaaacc gatggctttatttcctacatttacgggagcaaattccagcttcggtcaaggtgttctgag gcagctcaagtgctctcccccaagcagcacgcagtcaccgctgccagtgatgaacacacg cacgggctaggcacatggagcccaagcacactgccacctcggacacgggccccggcccag cgactgccttgtggatacactgttgccctcatggcccacccaggggtcatatctcccagt ccccagtgcaaccatatctgcctgtttcatatcattttcttccactggaatgtgagctgc tcggacgctgggacactgcccatggtgttcgccgccataaccattcccagtgccctgaag ggtggcagccccactaggcgagtggagatggctccagccgtccaagctgctgaacaaggg gctgtggacaaggccagcgtggccaaggaggcaggggctgcagctgaggtcttctcaggg gaaaagggagcaggcagagcaagacgctgtcagaaaaggaagaaagagaagcagaagagg aggaggaggaggagaaaagaagaaggaggtgaaggaggaggaaagaagaaaaagaggaag aagaggaagaggaggaagcagaagaagtggaagaagaagatggaggaggagaaagaggag gagaaggagaaggaggggaaagaaaggagagttttgcaaggggttggtcgctgttggctg gaggcctcagtgtctcccatgggcttctccggagggctgcatggacctcttcatgactca gcagccagcttcccctggagtttgcagtatacagccaagatcagcaagctaaaaatagca gttcttacccgccagctggagacgatggtggaccacttggccaacacggagatcaacagc cagcgcatcgcggcagtggagagctgcttcggggcctcggggcagccgctggcgctgcca ggccgagtgctgctgggcgagggcgtgctgaccaaagagtgccgcaagaaggccaagccg cgcatcttcttcctctttaacgacatcctggtgtatggcagcatcgtgctcaacaagcgc aagtaccgcagccagcacatcatccccctggaggaggtcacactggagctgttgccggag acgctgcaggccaagaaccgctggatgatcaagacggccaagaagtcctttgtggtgtcg gccgcctccgctacggagcgccaggaatggattagccacatcgaggagtgcgtgcggcgg caactgagggccacgggccgcccgcccagcacggagcacgcggcaccctggatccccgac aaggccacggacatctgcatgcgctgcacgcagacgcgcttctctgccctcacgaggcgc caccactgccgcaagtgcggcttcgtggtctgcgctgagtgctcgcgccagcgcttcctg ctcccgcgcctgtcccccaagcccgtgcgcgtctgcagcctctgctaccgcgaactggcc gcccagcagcggcaggaggaggcggaggagcagggcgcggggtccccagggcagccagcc cacctggcccggcccatctgcggagcgtccagtggagatgacgatgactccgacgaggac aaggagggcagcagggacggcgactggcccagcagcgtggagttctacgcctcggggctg acccccggcctgcagaacatctgtccccaagccagctccactgcccaggcccccaagagg gcagctccagaagctgcccagggctccgggaccccatcccatggtggcaggtgcagcggt ggggagtggctctttctggactcccagtgcctttttgctggacactgtgtccttatggct tcactgcagggctggaacagaactgctgctgccccaagtcccaaggtgttagggcctgta agggcccacggcttggggtggcccaggaccttcgcagagccttcacacctgcggtcttgc tcccgcaccagctgggccatgccagctgctggggacaccgctgggagtgagacaaacaag gcccgcacagcatgctgcccacggcagcacctgggctatggggacctgggctccgggatt ctggagcagcacactgggacacttcgtacagtttgcaaattactaatacccttaaaatac caaaatgcctttggatttcagcagccaccagacctcctgggggcccccaccatgaagcag aggcctttgaacctcgcctgccccagccataaccacttgctggcacaacccaatccctga >gi568815579f:29508657_29715377|GENSCAN_predicted_peptide_5|256_aa MLLAIIPYFPGIIASNMAAPSELSDEELKAPDTRHPDTWTPNTQTPRHSDTQTPRNLTPR HPDTQTPRNLTPRHPDTQTPDTQTPRNLTARHPDTQTPDTQTPGHSDTQTLRHPDIQTPD TQKPDTPPPGHPDILPPEHPATQSLGHLDTQPPHHPDTLPLQYPDTPTPRHPDTPLPGHL DNPPPGHLDTQTPRPSATQTPCHSDTQTPHHLDTWTPGHPDTRTLRHLTTQTPRNLTPRN LDTRTPRHLTPRHTDT >gi568815579f:29508657_29715377|GENSCAN_predicted_CDS_5|771_bp atgttgctggcaattatcccgtattttcctgggataattgccagcaacatggcagccccc tctgagcttagtgatgaagaattgaaggcacctgacaccagacacccagacacctggaca cccaacactcagacacccagacactcagatacccagacacccagaaacctgacacccaga cacccggatacccagacacccagaaacctgacacccagacacccagatacccagacaccc gacacccagacacccagaaacctgacagccagacacccggatacccagacacccgacacc cagacacccggacactcagacacccagacactcagacacccagacatccagacacctgat acccagaaacctgacaccccaccacccggacacccagacatcctgccacctgaacaccct gccactcagtcacttggacacctggatactcagccaccccaccatccagataccctgcca ctccaatacccagacaccccaacccccagacacccagacactccactacctggacacctg gacaacccaccacctggacacctggacacccaaacacccagaccctctgccacccagaca ccctgccactcagacacccagacaccccaccacctggacacctggacacccggacaccca gacacccggacactcagacacctcaccacccagacaccgagaaacctgacacccagaaac ctggacacccggacacccagacacctgacaccaagacacacagacacttga >gi568815579f:29508657_29715377|GENSCAN_predicted_peptide_6|141_aa MTIMVEDIMKLLCSLSGERKMKAAVKHSGKGALVTGAMAFVGGLVGGPPGLAVGGAVGGL LGAWMTSGQFKPVPQILMELPPAEQQRLFNEAAAIIRHLEWTDAVQLTALVMGSEALQQQ LLAMLVNYVTKELRAEIQYDD >gi568815579f:29508657_29715377|GENSCAN_predicted_CDS_6|426_bp atgactatcatggtggaggacatcatgaagctgctgtgctccctttctggggagaggaag atgaaggcggctgtcaagcactctgggaagggtgccctggtcacaggggccatggccttc gtcgggggtttggtgggcggcccaccgggactcgccgttgggggggctgtcggggggctg ttaggtgcctggatgacaagtggacagtttaagccggttcctcagatcctaatggagctg ccccctgccgagcaacagaggctctttaacgaagccgcagccatcatcaggcacctggag tggacggacgccgtgcagctgaccgcgctggtcatgggcagcgaggccctgcagcagcag ctgctggccatgctggtgaactacgtcaccaaggagctgcgggccgagatccagtatgat gactag