GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:36:38 Sequence gi568815595f:189689801_189994499 : 304699 bp : 37.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 32625 32761 137 2 2 46 67 105 0.015 3.86 1.02 Intr + 48842 48974 133 2 1 96 92 199 0.996 20.83 1.03 Intr + 92571 92624 54 2 0 58 92 77 0.030 3.36 1.04 Intr + 118472 118726 255 1 0 61 107 298 0.623 25.72 1.05 Term + 123199 123303 105 0 0 56 34 95 0.668 -1.67 1.06 PlyA + 123977 123982 6 1.05 2.00 Prom + 126730 126769 40 -5.95 2.01 Init + 128422 128486 65 0 2 41 108 33 0.312 1.27 2.02 Intr + 133595 133725 131 2 2 -26 78 176 0.139 4.62 2.03 Intr + 141640 141744 105 2 0 75 103 60 0.069 5.47 2.04 Term + 159605 159744 140 0 2 39 54 168 0.003 5.64 2.05 PlyA + 159972 159977 6 1.05 3.00 Prom + 160616 160655 40 -7.65 3.01 Init + 166795 166848 54 0 0 66 94 75 0.923 7.23 3.02 Intr + 174306 174618 313 2 1 20 98 272 0.605 16.23 3.03 Intr + 176882 176997 116 0 2 114 98 93 0.991 12.15 3.04 Intr + 177658 177825 168 0 0 77 67 54 0.717 1.42 3.05 Intr + 178033 178142 110 0 2 88 89 64 0.999 4.66 3.06 Intr + 178241 178320 80 2 2 57 98 61 0.989 2.28 3.07 Intr + 178780 178916 137 2 2 87 97 130 0.971 13.17 3.08 Intr + 183059 183195 137 1 2 102 79 137 0.982 12.65 3.09 Intr + 196594 196751 158 1 2 117 61 165 0.927 15.43 3.10 Intr + 199540 199684 145 2 1 89 109 28 0.966 3.42 3.11 Intr + 200989 201082 94 1 1 118 95 46 0.997 7.35 3.12 Intr + 204406 204698 293 0 2 103 38 370 0.029 28.51 3.13 Term + 216985 217246 262 1 1 55 37 182 0.017 3.81 3.14 PlyA + 217357 217362 6 -0.45 4.22 PlyA - 217442 217437 6 -0.45 4.21 Term - 218306 218110 197 0 2 50 41 149 0.244 2.99 4.20 Intr - 218942 218828 115 2 1 93 36 68 0.426 1.20 4.19 Intr - 219182 219093 90 2 0 91 57 62 0.384 2.67 4.18 Intr - 248387 248247 141 2 0 38 94 181 0.587 13.33 4.17 Intr - 253515 253389 127 2 1 100 51 71 0.403 4.36 4.16 Intr - 264842 264688 155 2 2 28 45 119 0.015 -0.45 4.15 Intr - 274298 274158 141 2 0 87 97 89 0.904 9.33 4.14 Intr - 278606 278486 121 1 1 71 54 85 0.563 2.98 4.13 Intr - 279522 279279 244 1 1 39 3 242 0.536 6.43 4.12 Intr - 280103 279710 394 2 1 68 10 239 0.040 7.40 4.11 Intr - 281091 281016 76 2 1 69 93 57 0.064 2.90 4.10 Intr - 282207 282090 118 1 1 107 48 87 0.091 5.20 4.09 Intr - 283224 283074 151 0 1 105 85 114 0.991 11.61 4.08 Intr - 284204 284109 96 2 0 110 37 130 0.985 9.39 4.07 Intr - 284885 284758 128 0 2 59 37 233 0.144 14.78 4.06 Intr - 293340 293246 95 2 2 28 115 72 0.787 2.59 4.05 Intr - 294790 294750 41 1 2 45 123 53 0.903 0.60 4.04 Intr - 297077 296988 90 2 0 124 111 57 0.995 10.87 4.03 Intr - 297869 297727 143 0 2 113 78 168 0.999 17.45 4.02 Intr - 299238 299107 132 1 0 74 88 60 0.800 4.30 4.01 Intr - 304483 304294 190 1 1 99 98 225 0.991 22.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 47995 48068 74 0 2 86 98 16 0.991 2.99 S.002 Init - 78670 78568 103 1 1 56 58 100 0.882 4.25 S.003 Init - 147012 146851 162 0 0 49 96 138 0.828 10.48 S.004 Term + 190264 190378 115 1 1 103 43 106 0.803 4.66 S.005 Term + 204406 204702 297 0 0 103 52 371 0.966 29.28 S.006 Term - 272007 271907 101 1 2 65 47 176 0.836 8.51 S.007 Term - 282207 282080 128 1 2 107 42 93 0.897 4.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:189689801_189994499|GENSCAN_predicted_peptide_1|227_aa MHDRIFGPEGLRAQGNQPLHFTEEELESQTGAGTCPRSQPVNALGLPICSVQPIDLNFVD EPSEDGATNKIEISMDCIRMQDSDLSDPMWLTASMTICHGGSCMEDMAPQYTNLGLLNSM DQQIQNGSSSTSPYNTDHAQNSVTAPSPYAQPSSTFDALSPSPAIPSNTDYPGPHSFDVS FQQSSTAKSATWTTSAGWLCCFPKPAQADVGSPCQCKERPFPDLRTC >gi568815595f:189689801_189994499|GENSCAN_predicted_CDS_1|684_bp atgcacgatagaatctttggaccagagggccttagagctcaggggaatcaaccccttcac ttcacagaggaggaacttgaatcccaaacaggggcagggacttgcccaagatcccagcca gtgaacgcattgggcctgcctatatgttcagttcagcccattgacttgaactttgtggat gaaccatcagaagatggtgcgacaaacaagattgagattagcatggactgtatccgcatg caggactcggacctgagtgaccccatgtggttgactgcgtctatgaccatttgccatggt ggcagctgtatggaggatatggcaccacagtacacgaacctggggctcctgaacagcatg gaccagcagattcagaacggctcctcgtccaccagtccctataacacagaccacgcgcag aacagcgtcacggcgccctcgccctacgcacagcccagctccaccttcgatgctctctct ccatcacccgccatcccctccaacaccgactacccaggcccgcacagtttcgacgtgtcc ttccagcagtcgagcaccgccaagtcggccacctggacgacttctgcaggctggctctgc tgctttccgaagcctgcccaagcagatgtaggctctccttgccagtgcaaggagaggccc tttccagatttaagaacctgttga >gi568815595f:189689801_189994499|GENSCAN_predicted_peptide_2|146_aa MFSHKNSVIIMDKISDQAIVINRWRLVGGACSKGEQFGECHGMQTHTVELSTQSVMWLTH QGKDRGHFHFHDIIRASQPHSKFSVVIFIFMLRMRTRKLTDLRVDSVTVELEDTQLGVCC LVSGEKLPHIWPQKSSSVLMIVAVVM >gi568815595f:189689801_189994499|GENSCAN_predicted_CDS_2|441_bp atgttttcccataagaacagtgtcataattatggataaaatttctgaccaagctattgtc atcaacagatggcggttggttggaggggcctgcagtaaaggagagcagtttggagaatgc cacggcatgcagacgcacactgtggagctgagcactcagtcggtgatgtggctcactcac caggggaaggacagaggtcattttcatttccatgatatcattcgagcttcacaaccacac tccaagttcagtgttgttatcttcatcttcatgttgaggatgaggacacggaagctcaca gatctccgggtagacagtgtcacagttgaattggaggacacccagcttggtgtctgctgc ttggtgagtggtgaaaaactcccacacatttggccacagaagtcatcttctgtgttgatg attgttgctgtggtaatgtga >gi568815595f:189689801_189994499|GENSCAN_predicted_peptide_3|688_aa MIIYVDRKEGIYKKLQELPVVNRQHAALKSGQIFIVLIWNVTISPVGSLLPFSTGPNSKQ YSTELKKLYCQIAKTCPIQIKVMTPPPQGAVIRAMPVYKKAEHVTEVVKRCPNHELSREF NEGQIAPPSHLIRVEGNSHAQYVEDPITGRQSVLVPYEPPQGKYSWSIATVFGNYPQFSL FSDIHEGGIAYLNYMMWISLQTNRIKDHKMLKPVSPKVGTEFTTVLYNFMCNSSCVGGMN RRPILIIVTLETRDGELVLCALNPCYKRLHKRSKKVETKEVGKSWADAALRPGSVLAQEE TGRRMKIASESSKFRTVQRTVMVRSAVRGRETYEMLLKIKESLELMQYLPQHTIETYRQQ QQQQHQHLLQKQTSIQSPSSYGNSSPPLNKMNSMNKLPSVSQLINPQQRNALTPTTIPDG MGANIPMMGTHMPMAGDMNGLSPTQALPPPLSMPSTSHCTPPPPYPTDCSIVSFLARLGC SSCLDYFTTQGLTTIYQIEHYSMDDLASLKIPEQFRHAIWKGILDHRQLHEFSSPSHLLR TPSSASTVSVGSSETRGERVIDAVRFTLRQTISFPPRDEWNDFNFDMDARRNKQQRIKEE GELEDLESRQCNSVCVQRLKNKARGGGCWWIVCILAGNWEYRCLKVKKMMEVAAQIGGKF VPPPPLALFRPSRDATHIGEDDLLYSVY >gi568815595f:189689801_189994499|GENSCAN_predicted_CDS_3|2067_bp atgatcatctatgtagatcgtaaggaaggcatctacaaaaagctacaagaactaccagta gtaaacaggcagcatgcagctctaaaaagtggacagattttcattgttctgatttggaat gtaacaatatctcctgttggttctctccttcctttctccactggccccaactctaagcag tattccactgaactgaagaaactctactgccaaattgcaaagacatgccccatccagatc aaggtgatgaccccacctcctcagggagctgttatccgcgccatgcctgtctacaaaaaa gctgagcacgtcacggaggtggtgaagcggtgccccaaccatgagctgagccgtgaattc aacgagggacagattgcccctcctagtcatttgattcgagtagaggggaacagccatgcc cagtatgtagaagatcccatcacaggaagacagagtgtgctggtaccttatgagccaccc cagggaaaatattcttggagcattgctactgtttttggcaattatccccagttttccctt ttcagtgatattcatgaaggtggtatagcctatttgaattacatgatgtggatcagctta caaacgaacaggatcaaagatcacaaaatgttaaagcctgtctcacctaaggttggcact gaattcacgacagtcttgtacaatttcatgtgtaacagcagttgtgttggagggatgaac cgccgtccaattttaatcattgttactctggaaaccagagacggagagcttgtcctttgt gctctaaatccttgctacaaacggttacataaaagatctaagaaagtggagacaaaggaa gtgggcaagtcctgggccgacgctgctttgaggcccggatctgtgcttgcccaggaagag acaggaaggcggatgaagatagcatcagaaagcagcaagtttcggacagtacaaagaacg gtgatggtacgaagcgccgtgaggggccgtgagacttatgaaatgctgttgaagatcaaa gagtccctggaactcatgcagtaccttcctcagcacacaattgaaacgtacaggcaacag caacagcagcagcaccagcacttacttcagaaacagacctcaatacagtctccatcttca tatggtaacagctccccacctctgaacaaaatgaacagcatgaacaagctgccttctgtg agccagcttatcaaccctcagcagcgcaacgccctcactcctacaaccattcctgatggc atgggagccaacattcccatgatgggcacccacatgccaatggctggagacatgaatgga ctcagccccacccaggcactccctcccccactctccatgccatccacctcccactgcaca cccccacctccgtatcccacagattgcagcattgtcagtttcttagcgaggttgggctgt tcatcatgtctggactatttcacgacccaggggctgaccaccatctatcagattgagcat tactccatggatgatctggcaagtctgaaaatccctgagcaatttcgacatgcgatctgg aagggcatcctggaccaccggcagctccacgaattctcctccccttctcatctcctgcgg accccaagcagtgcctctacagtcagtgtgggctccagtgagacccggggtgagcgtgtt attgatgctgtgcgattcaccctccgccagaccatctctttcccaccccgagatgagtgg aatgacttcaactttgacatggatgctcgccgcaataagcaacagcgcatcaaagaggag ggggagctggaggaccttgaaagcaggcagtgtaattcagtctgtgtccaaagactcaag aacaaagcgagaggaggagggtgttggtggattgtgtgcatccttgcagggaactgggaa taccgatgtctgaaagtcaagaagatgatggaggttgccgctcaaatagggggcaaattt gtccctcctccacctttggctctcttcaggccctcaagggatgccacccacattggtgaa gatgatcttctttactcagtctactga >gi568815595f:189689801_189994499|GENSCAN_predicted_peptide_4|994_aa ESYNAGVKHYEADDFEMAIRHFEQALREYFVEDTECRTLCEGPQRFEEYEYLGYKAGLYE AIADHYMQVLVCQHECVRELATRPGRLSPIENFLPLHYDYLQFAYYRVGEYVKALECAKA YLLCHPDDEDVLDNVDYYESLLDDSIDPASIEAREDLTMFVKRHKLESELIKSAAEGLGF SYTEPNYWIRYGGRQDENRVPSGVNVEGAEVHGFSMGKKLSPKIDRDLREGGPLLYENIT FVYNSEQLNGTQRVLLDNVLSEEQCRELHSVASGIMLVGDGYRGKTSPHTPNEKFEGATV LKALKSGYEGRVPLKSARLFYDISEKARRIVESYFMLNSTLYFSYTHMVCRTALSGQQDR RNDLSHPIHADNCLLDPEANECWKEPPAYTFRDYSALLYMNDDFEGGEFIFTEMDAKTVT SRVLAHCSFLSCADMASSTTTTAMKIGIIGGTGLDDPEILEGRTEKYVDTPFGKPFDVLI LGKIKNVDCVLLARHGRQHTILASKVNYQATIWALKEEGCTHVIGTTACGSLREEIQPGD IVIIDQFIDRTKSFMFCTWGVDVINMTTVPEVVLTKEAGICYASIAMVTDYDCWKEHEEA VLVDRDLQTPKENASKAKSLLLTAIPQIGSMEWDMDEAGNHHSQQTNTGTENQTPHVLTH KWELNNENTWTRRASIKPKCGRMISFSSGGENPHGVKAVTKGKRCAVALWFTLDPLYREL SAAHKAFIFTYIIVLNAYDTEHVHTLGTPGGILGMDRDVHFADEDMESSGERRGQKRQGG ADLLPVKCHIKEFHPDPHRCRHLRGIAVQIVLSLPFKDKVTSARAFLSPMRNVASSTVVE SHHTQFRALEVKAQQNERNWQTCKCKIHTHWSQVLGETYCKEIAFVGKSPWPLELGLIEG GFSRINFAPKSVLLAILEQHDYLMSLKNVEKSSHYEFAVVSILQLHSSLVVSSVSVDNEW SLYQLATKQKTRKLTICCSPQGIGAFLLLPHVFV >gi568815595f:189689801_189994499|GENSCAN_predicted_CDS_4|2985_bp gagagttacaatgcaggagttaaacattatgaggctgatgactttgagatggctatcagg cacttcgaacaagccttaagagaatatttcgttgaagatacagaatgccggaccctatgt gaggggcctcagagatttgaagaatatgagtatttagggtataaggctggtctgtatgaa gctattgcagatcactacatgcaggtgcttgtttgtcagcatgaatgtgtgagggaactt gccacccgccctggccgcctctctcccatcgagaattttcttcctctgcactatgattac ctacagtttgcctactatcgagttggtgagtatgtgaaagccctggagtgtgccaaagcc tatcttctatgccatccagatgatgaggatgtcctagacaatgtggattactatgagagt ctgctggatgatagcattgacccggcatccattgaggccagagaggatttaacaatgttt gtgaaacgtcataagctggagtctgagctgataaaatcagctgcagaaggtctggggttt tcatacactgaaccgaattattggatcagatatggaggacgacaggatgagaatcgggtc ccttcaggagtgaacgtagagggagcagaagttcatggattctcaatgggaaaaaagcta tcacccaagatagatcgagacctaagagaaggtggtcctctactctatgagaacatcaca ttcgtctacaactcggagcagctgaacgggactcagcgggttctcctggataacgtcctg tcggaagaacagtgccgggagctccacagcgtggccagtggaatcatgcttgttggtgat ggatacagaggaaaaacttcaccccatacacccaatgaaaagtttgaaggtgcaactgtc ctgaaagcactcaaatctggttatgaaggtcgagtcccactgaagagcgctcgtctgttt tatgacatcagcgaaaaggctcgaaggattgtagaatcttattttatgctgaactcaact ctgtatttttcctatacacacatggtctgccgaacagccctgtctggtcagcaggataga agaaatgacctcagtcatcccatccatgctgacaactgtttgttggatccagaggccaac gaatgctggaaggagcctcctgcttacacatttcgagactatagtgctctcctatatatg aatgatgactttgaaggaggagaattcatattcacagagatggatgctaagactgtgact tcccgagtgctcgcccactgcagcttcctttcctgtgcggacatggcctcaagcaccacc accactgccatgaagattggaataattggtggaacaggcctggatgatccagaaatttta gaaggaagaactgaaaaatatgtggatactccatttggcaagccatttgatgtcttaatt ttggggaagataaaaaatgttgattgtgtcctccttgcaaggcatgggaggcagcacacc atcctggcttccaaggtcaactaccaggcgaccatctgggctttgaaggaagagggctgt acacatgtcatagggaccacagcttgtggctccttgagggaggagattcaacctggcgat attgtcattattgatcagttcattgacaggaccaaaagcttcatgttctgcacctggggg gtggatgttatcaacatgaccacagttccagaggtggttctcactaaagaggctggaatt tgttatgcgagtattgccatggtgacagattatgactgctggaaggagcacgaggaagca gttttggtggaccgggacttacagaccccgaaagaaaatgctagtaaagccaaaagttta ctgctcactgccatacctcagataggatccatggaatgggacatggatgaagctggaaac catcattctcaacaaacaaacacaggaacagaaaaccaaacaccacatgttctcactcat aagtgggagctgaacaatgagaacacatggacacggagggcctctataaaaccaaaatgt gggcgcatgatcagcttctcatctggaggagagaaccctcatggggtgaaggcagtcacc aagggaaagaggtgtgctgtggctctgtggttcaccttggacccactttatagagaattg tctgcagctcacaaagcttttattttcacctatattattgttttgaatgcttatgacaca gagcatgttcacacactgggaaccccaggaggtattctgggcatggatcgtgatgtccat tttgcagatgaggacatggagagcagtggagagagaaggggacaaaagaggcagggagga gcagatcttctgccagtcaaatgtcacattaaagaattccacccagatccacacaggtgc agacacctgaggggcattgccgttcagatagtgctgtctctgccatttaaagacaaagtc acctcagcccgtgctttcctgtcaccaatgcgaaatgtagccagtagcacagtcgtagaa tcacatcacactcaatttcgtgctcttgaggtcaaggcccagcagaatgaaaggaactgg cagacttgcaaatgtaagattcatactcactggagccaagtgttaggggagacttactgt aaagaaatagcatttgtaggaaaatcaccttggcccctagagcttggtttaattgaaggt ggtttcagtaggattaactttgctccaaaaagtgtgcttttggctatattggagcagcat gactatttaatgtctttgaagaacgtagagaaaagcagtcactatgagttcgctgtggtc agcattctgcaacttcattcttctctcgtagtatccagtgtttctgtggacaatgaatgg tccctgtatcagctagccacaaaacagaaaactaggaaactcacaatttgctgttcaccc caaggcattggagccttcctgcttctcccacatgtcttcgtgtaa