GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:03:53 Sequence gi568815593f:159816906_160072489 : 255584 bp : 43.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 35201 36318 1118 1 2 60 53 271 0.310 14.56 1.02 Intr + 61162 61337 176 1 2 23 116 72 0.261 3.08 1.03 Intr + 62043 62111 69 1 0 64 68 62 0.248 0.95 1.04 Term + 65715 65884 170 1 2 96 33 51 0.268 -1.56 1.05 PlyA + 66028 66033 6 1.05 2.00 Prom + 80960 80999 40 -2.16 2.01 Init + 100001 100949 949 1 1 55 111 1389 0.805 131.62 2.02 Intr + 102817 102932 116 2 2 64 110 10 0.419 0.97 2.03 Intr + 111389 111530 142 1 1 46 58 73 0.226 0.03 2.04 Term + 114786 114979 194 1 2 61 46 122 0.682 2.88 2.05 PlyA + 116914 116919 6 1.05 3.04 PlyA - 119242 119237 6 1.05 3.03 Term - 133732 133588 145 0 1 53 41 209 0.857 10.08 3.02 Intr - 133940 133808 133 0 1 -20 55 208 0.820 6.40 3.01 Init - 134284 133945 340 1 1 97 45 414 0.936 33.42 3.00 Prom - 134893 134854 40 -3.96 4.00 Prom + 142797 142836 40 -7.26 4.01 Init + 147854 148008 155 1 2 53 80 175 0.946 12.69 4.02 Intr + 153787 153875 89 1 2 44 91 -19 0.111 -6.39 4.03 Term + 154974 155587 614 1 2 108 49 1084 0.556 101.15 4.04 PlyA + 156965 156970 6 1.05 5.00 Prom + 157812 157851 40 -3.16 5.01 Init + 193624 193953 330 0 0 102 111 312 0.993 32.21 5.02 Term + 195111 195149 39 2 0 70 52 40 0.402 -4.21 5.03 PlyA + 196465 196470 6 1.05 6.04 PlyA - 197150 197145 6 1.05 6.03 Term - 207073 206793 281 2 2 30 44 287 0.958 14.11 6.02 Intr - 208795 208730 66 2 0 93 61 41 0.615 0.78 6.01 Init - 210710 210656 55 2 1 59 115 29 0.788 4.15 6.00 Prom - 214201 214162 40 -8.36 7.00 Prom + 214794 214833 40 -5.06 7.01 Init + 215105 215107 3 1 0 62 115 0 0.314 0.10 7.02 Intr + 217417 217464 48 0 0 90 73 50 0.398 2.58 7.03 Intr + 218235 218295 61 2 1 61 94 25 0.256 -1.29 7.04 Intr + 219786 219898 113 1 2 109 91 -8 0.324 1.70 7.05 Intr + 226228 226264 37 0 1 81 87 57 0.655 2.74 7.06 Intr + 232609 232757 149 2 2 100 53 47 0.909 2.35 7.07 Intr + 234224 234278 55 1 1 117 87 53 0.909 6.65 7.08 Term + 248027 248160 134 0 2 87 42 106 0.720 4.15 7.09 PlyA + 248618 248623 6 1.05 8.02 PlyA - 249689 249684 6 1.05 8.01 Term - 254972 254826 147 2 0 81 54 138 0.988 7.60 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:159816906_160072489|GENSCAN_predicted_peptide_1|510_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWENIPCSWVGRINVM KMAILPKVIYRFNAIPIKLPMTFLTELEKITLKFIWNQKRAHIAKSILSQKNKAGGITLP DFKLYYQATVTKTAWCWYQNRDIDQWNRTEPSEIMLHIYNNLIFDKPDKKKWGKDSLFNK WFWENWLAIRRKLKLDPLLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDF MSRTPKAMATKVKIDKWDQIKLKSFGTAKETTIRVNRQLTEWEKIFAIYSSDKGLISRIY NELKQIYKKKTNNPIKKWAKGMNRHFSKDIYAAKRHMKNCSPSLAIREMQIKTTMRYHLT PVRMAIKKSGNNRTPGLNEAQESEDQMPRQSQAAPTVRDARPSSPTVALLNQSGPEMSSS CYLGYVRARSQDFSDVAQNPAKSQLPPYILTPALAKRQAFTIDMFPSPNSLIIQLCLVIS KLQNRDELVASGLKLSKLPNLLLGVCSFCN >gi568815593f:159816906_160072489|GENSCAN_predicted_CDS_1|1533_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatgggagaacattccatgctcatgggtaggaagaatcaatgtcatg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcctcacagaattggaaaaaattactttaaagttcatatggaaccaaaaaaga gcccatattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacct gacttcaaactatactaccaggctacagtaaccaaaacagcatggtgctggtaccaaaac agagatatagaccaatggaacagaacagagccctcagaaataatgctgcatatctacaac aatctgatctttgacaaacctgacaaaaagaaatggggaaaggattccctatttaataaa tggttctgggaaaactggctagccatacggagaaagctgaaactggatcccctccttaca ccttatacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgtctagaacaccaaaagcaatggcaacaaaagtcaaaattgacaaatgggatcaaatt aaactaaagagcttcggcacagcaaaagaaaccaccatcagagtgaacaggcaacttaca gaatgggagaaaatttttgcaatctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaag ggtatgaacagacacttctctaaagacatttatgcagccaaaagacacatgaaaaattgc tcaccatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcaca ccagttagaatggctatcaaaaagtcaggaaacaacaggactccagggctgaacgaggcc caagaatcagaggaccaaatgccgaggcagtctcaggcagctccgacagtccgggacgcc aggcccagcagtcctacagttgccctcttgaatcagtcgggtccagagatgagcagcagc tgttacctaggctatgtcagggccaggagccaagacttttctgatgtcgctcaaaaccca gcaaagagccagctgcctccttacatcctcaccccggccctggcaaagcgtcaggcattc actattgacatgtttccatcacccaatagcctgatcatacagctgtgcttggtaatatcc aagctgcagaacagggatgaactcgtggcttcagggctcaagctctctaaattgccaaac ttgctcctgggtgtctgttcattttgcaattag >gi568815593f:159816906_160072489|GENSCAN_predicted_peptide_2|466_aa MNPDLDTGHNTSAPAHWGELKNANFTGPNQTSSNSTLPQLDITRAISVGLVLGAFILFAI VGNILVILSVACNRHLRTPTNYFIVNLAMADLLLSFTVLPFSAALEVLGYWVLGRIFCDI WAAVDVLCCTASILSLCAISIDRYIGVRYSLQYPTLVTRRKAILALLSVWVLSTVISIGP LLGWKEPAPNDDKECGVTEEPFYALFSSLGSFYIPLAVILVMYCRVYIVAKRTTKNLEAG VMKEMSNSKELTLRIHSKNFHEDTLSSTKAKGHNPRSSIAVKLFKFSREKKAAKTLGIVV GMFILCWLPFFIALPLACRGCDQSMLRKEKASAGCRNGSKVQALAFRGRGNSSKQHHESA QDTVESVHIGRLCSYDSIDSTLLATLTRSSSDHPLNTQVPLSVLEARKSKIKEQEHSVSG EDLPIDGDFYVSSHGRRGKQALECLFYKYTDPIYQSSALVTYLPFT >gi568815593f:159816906_160072489|GENSCAN_predicted_CDS_2|1401_bp atgaatcccgacctggacaccggccacaacacatcagcacctgcccactggggagagttg aaaaatgccaacttcactggccccaaccagacctcgagcaactccacactgccccagctg gacatcaccagggccatctctgtgggcctggtgctgggcgccttcatcctctttgccatc gtgggcaacatcctagtcatcttgtctgtggcctgcaaccggcacctgcggacgcccacc aactacttcattgtcaacctggccatggccgacctgctgttgagcttcaccgtcctgccc ttctcagcggccctagaggtgctcggctactgggtgctggggcggatcttctgtgacatc tgggcagccgtggatgtcctgtgctgcacagcgtccattctgagcctgtgcgccatctcc atcgatcgctacatcggggtgcgctactctctgcagtatcccacgctggtcacccggagg aaggccatcttggcgctgctcagtgtctgggtcttgtccaccgtcatctccatcgggcct ctccttgggtggaaggagccggcacccaacgatgacaaggagtgcggggtcaccgaagaa cccttctatgccctcttctcctctctgggctccttctacatccctctggcggtcattcta gtcatgtactgccgtgtctatatagtggccaagagaaccaccaagaacctagaggcagga gtcatgaaggagatgtccaactccaaggagctgaccctgaggatccattccaagaacttt cacgaggacacccttagcagtaccaaggccaagggccacaaccccaggagttccatagct gtcaaactttttaagttctccagggaaaagaaagcagctaagacgttgggcattgtggtc ggtatgttcatcttgtgctggctacccttcttcatcgctctaccgcttgcctgcagaggc tgtgaccagagcatgctcaggaaggagaaggcctcagcaggatgcaggaatggatcaaag gtgcaggctttggcttttaggggcaggggaaatagctccaaacagcaccatgagtcagct caggatacagtagaatctgtccacattggcaggctctgctcttacgactctattgacagc acactcctggccacactcacccgcagctcatctgaccaccctttgaacactcaagtgccc ctctctgttctggaggctcggaagtccaagatcaaggagcaagaacattcagtgtctggt gaggacctgcccatagatggtgacttctatgtgtcctcacatggcagaaggggcaaacag gctctcgaatgcctcttttataagtacactgatcccatttatcaaagttctgccctggtg acctacttgcctttcacatga >gi568815593f:159816906_160072489|GENSCAN_predicted_peptide_3|205_aa MEKAGADLQGGATRVIISAPSADAPMFVMGMNHKKYDNRLNIVSYASCTTNCLVPLGEVI HDNFGIVERFQNTVHAITATQKIVDGLSGKLWCDGCGALQNITPASTGTAKAVGVISKLN KKLTDMGFCVHITNVSVMDLTCRLEKPAKYDDIKKVVKDTHSSTFDAGAGIALKDHFVKL ISWYDNEFGYSNRVVDLMAHIASKE >gi568815593f:159816906_160072489|GENSCAN_predicted_CDS_3|618_bp atggagaaggctggggctgacttgcagggaggagctacaagggtcatcatctctgccccc tctgctgatgcccccatgttcgtgatgggcatgaaccacaagaagtatgacaacagactc aatatcgtcagctatgcctcctgcaccaccaactgcttagtgcccctgggcgaggtcatc catgacaactttggtattgtggaaagattccagaatacagtccacgccatcactgccacc cagaagattgtggatggcctctctgggaaactgtggtgcgacggctgcggggctctccag aacatcacccctgcctctactggcactgccaaggctgtgggggtcatctctaagctgaac aagaagctcactgacatgggcttctgtgtccacatcaccaacgtgtcagtcatggacctc acctgccgtctcgaaaaacctgccaaatatgatgacattaagaaggtggtgaaagacacc cactcctccacctttgatgctggggccggcattgccctcaaagaccattttgtcaagctc atttcctggtatgacaacgaatttggctacagcaacagggtggtggacctcatggcccac atagcctccaaggagtaa >gi568815593f:159816906_160072489|GENSCAN_predicted_peptide_4|285_aa MPTQHIDLILFTTFFILVAVVLVVTAVLAVVTVAALKAVIIAILIECFQCASHEQSSSCP HTPHPCVNLELCSFLIFASLSGSLFSTLKPPDAVFKVVFWLGYFNSCLNPIIYPCSSKEF KRAFVRILGCQCRGRGRRRRRRRRRLGGCAYTYRPWTRGGSLERSQSRKDSLDDSGSCLS GSQRTLPSASPSPGYLGRGAPPPVELCAFPEWKAPGALLSLPAPEPPGRRGRHDSGPLFT FKLLTEPESPGTDGGASNGGCEAAADVANGQPGFKSNMPLAPGQF >gi568815593f:159816906_160072489|GENSCAN_predicted_CDS_4|858_bp atgcccacacagcacatagacctcatcctcttcaccaccttcttcatcttggtagcagtg gtactggtggtaacagcggtgctggcagttgtaacagtagcagcgctcaaagcagtcatc atagctatacttattgagtgctttcaatgtgccagccatgaacaaagctcctcgtgtccc cacaccccacacccctgtgtgaacttggaactatgcagcttcctaatttttgccagccta tcaggctccttgttctccaccctgaagccccccgacgccgtgttcaaggtggtgttctgg ctgggctacttcaacagctgcctcaaccccatcatctacccatgctccagcaaggagttc aagcgcgctttcgtgcgcatcctcgggtgccagtgccgcggccgcggccgccgccgacgc cgccgccgccgtcgcctgggcggctgcgcctacacctaccggccgtggacgcgcggcggc tcgctggagcgctcgcagtcgcgcaaggactcgctggacgacagcggcagctgcctgagc ggcagccagcggaccctgccctcggcctcgccgagcccgggctacctgggccgcggcgcg ccaccgccagtcgagctgtgcgccttccccgagtggaaggcgcccggcgccctcctgagc ctgcccgcgcctgagccccccggccgccgcggccgccacgactcgggcccgctcttcacc ttcaagctcctgaccgagcccgagagccccgggaccgacggcggcgccagcaacggaggc tgcgaggccgcggccgacgtggccaacgggcagccgggcttcaaaagcaacatgcccctg gcgcccgggcagttttag >gi568815593f:159816906_160072489|GENSCAN_predicted_peptide_5|122_aa MGEKSENCGVPEDLLNGLKVTDTQEAECAGPPVPDPKNQHSQSKLLRDDEAHLQEDQGEE ECFHDCSASFEEEPGADKVENKSNEDVNSSELDEEYLIELEKNMSDEEKQCAPPEQEEKG QK >gi568815593f:159816906_160072489|GENSCAN_predicted_CDS_5|369_bp atgggggagaagtcagagaactgtggggttccagaggatctgttaaatggtttgaaggtt acagatactcaggaagccgagtgtgctggccctccagttcctgatcccaaaaatcagcat tcccagagtaagctgctcagggatgatgaggcccatctccaggaggaccagggagaagag gagtgttttcatgactgcagtgcctcatttgaggaggagccaggagcggacaaggttgag aacaaatctaatgaagatgtgaattcctctgaactagatgaagaatacctaatagaactg gaaaaaaacatgtcggatgaagagaaacagtgtgcaccaccagaacaggaggaaaaaggg caaaagtga >gi568815593f:159816906_160072489|GENSCAN_predicted_peptide_6|133_aa MTVDGHSKPQEEMKNIRNGVEIKNCLQEVELSNAQKLLQKGGTKYSLIPDEEEEKEEAKS AEFEKPDPTRNPSRKRKEEKKKNWKSSDSDSSDSENDTGKRARHTSKDSKAAKKKKKKKH KKKHIGVERLGNM >gi568815593f:159816906_160072489|GENSCAN_predicted_CDS_6|402_bp atgacagtagatggtcactcaaaaccacaggaggaaatgaagaacatcagaaacggtgtg gaaataaagaattgtttgcaggaagtagaactgagtaatgcacagaagctgctgcaaaaa ggtgggaccaaatactctctgatacctgatgaggaagaggaaaaggaagaggcaaagtca gcagagtttgagaagcctgaccccacaaggaatccttctagaaaaagaaaggaggagaag aaaaagaattggaagtcatctgactctgacagctcagactctgagaatgatacaggcaag agggcaaggcacacatcaaaagacagcaaggcagcaaagaagaagaaaaagaagaagcac aagaagaagcacataggagtggaaagattaggcaacatgtga >gi568815593f:159816906_160072489|GENSCAN_predicted_peptide_7|199_aa MDNILSAFVFDEFLAIYKRREESTRLKEEGNEQFKKGDYIEAESSYSRALEMCPSCFQKE RSILFSNRAAARMKQDKKEMAINDCSKAIQLNPSYIRAILRRAELYEKTDKLDEALEDYK SILEKDPSIHQAREACMRLPKQIEERNERLKEEMLGKLKDLGNLVLRPFGLSTENFQIKQ DSSTGSYSINFVQNPNNNR >gi568815593f:159816906_160072489|GENSCAN_predicted_CDS_7|600_bp atggataacattctctcagcctttgtgtttgatgagtttttggccatctataaaagaaga gaagagagcactagactaaaggaggagggaaatgaacagtttaagaaaggagattatata gaagctgaaagttcttatagtcgagccctcgaaatgtgcccatcctgcttccaaaaggag aggtcgattctattttcaaatagagctgcagcaaggatgaaacaggacaagaaagaaatg gccatcaatgactgcagcaaagcaattcaattaaaccccagctatatcagggcaatattg aggagagcagagttgtatgagaagacggacaagctagatgaagccctggaagactataaa tctatattagaaaaagatccatcaatacatcaagcaagagaagcttgtatgagattacct aagcaaattgaagaacgtaatgaaagactaaaagaagagatgttaggtaaattaaaagat cttgggaacttggttctccgaccttttgggctctccacggaaaatttccagatcaaacag gattcctctaccggctcgtactccatcaatttcgttcaaaatccaaataataacagataa >gi568815593f:159816906_160072489|GENSCAN_predicted_peptide_8|48_aa GRDEHMEETGTVMKGSDHRKQQEGTYNPAGSDHQLQAHKAASEWGLRS >gi568815593f:159816906_160072489|GENSCAN_predicted_CDS_8|147_bp gggagggacgagcatatggaggaaaccggcacagtaatgaagggaagtgatcacagaaaa caacaggaaggaacatacaatccagctggcagcgaccatcagctgcaggctcataaggct gcatctgaatgggggctgcgcagctga