GENSCAN 1.0 Date run: 11-Nov-116 Time: 16:59:28 Sequence gi568815593f:76616308_76833500 : 217193 bp : 42.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 2335 1507 829 2 1 89 68 521 0.467 39.73 1.01 Init - 6923 6860 64 2 1 58 91 98 0.261 6.49 1.00 Prom - 7256 7217 40 -7.05 2.00 Prom + 9902 9941 40 -7.35 2.01 Init + 10532 10534 3 1 0 26 95 0 0.230 -5.45 2.02 Intr + 11103 11193 91 2 1 91 75 140 0.999 11.65 2.03 Intr + 15552 15719 168 1 0 36 107 186 0.978 14.30 2.04 Intr + 24181 24278 98 2 2 32 49 84 0.317 -2.29 2.05 Intr + 24626 24796 171 1 0 39 91 181 0.984 12.72 2.06 Intr + 36443 36526 84 1 0 84 94 52 0.184 4.50 2.07 Intr + 38627 38696 70 1 1 69 106 44 0.240 2.14 2.08 Intr + 41679 41798 120 1 0 76 78 49 0.533 2.25 2.09 Intr + 42152 42360 209 0 2 90 78 299 0.917 26.97 2.10 Intr + 48719 48868 150 1 0 53 94 91 0.914 5.64 2.11 Intr + 52374 52537 164 2 2 119 53 70 0.993 4.45 2.12 Intr + 55452 55676 225 0 0 58 90 256 0.997 18.98 2.13 Intr + 57142 57282 141 1 0 70 97 123 0.997 9.95 2.14 Intr + 57645 57729 85 0 1 52 71 51 0.979 -1.30 2.15 Intr + 58170 58402 233 2 2 83 94 233 0.995 19.05 2.16 Intr + 60911 61043 133 2 1 81 89 157 0.994 14.83 2.17 Intr + 66808 66910 103 0 1 23 91 116 0.979 4.23 2.18 Intr + 67469 67610 142 0 1 81 106 139 0.996 13.49 2.19 Intr + 71470 71527 58 1 1 113 101 2 0.683 1.97 2.20 Intr + 72502 72687 186 0 0 28 36 133 0.409 1.06 2.21 Intr + 76817 76932 116 1 2 28 62 81 0.427 -2.17 2.22 Intr + 77048 77135 88 2 1 52 91 86 0.894 4.35 2.23 Intr + 79147 79359 213 0 0 79 2 226 0.869 11.09 2.24 Intr + 81680 81840 161 1 2 103 106 182 0.999 19.36 2.25 Intr + 84769 84906 138 1 0 65 100 123 0.999 9.86 2.26 Intr + 86175 86283 109 0 1 107 115 115 0.996 15.47 2.27 Term + 90893 91006 114 1 0 74 45 105 0.784 2.39 2.28 PlyA + 91801 91806 6 1.05 3.00 Prom + 95491 95530 40 -6.35 3.01 Init + 100001 100088 88 1 1 91 94 166 0.681 16.35 3.02 Intr + 116007 117057 1051 1 1 93 94 663 0.082 55.93 3.03 Term + 126212 126383 172 2 1 127 48 83 0.818 4.52 3.04 PlyA + 130441 130446 6 1.05 4.09 PlyA - 132913 132908 6 1.05 4.08 Term - 134496 134382 115 2 1 65 46 101 0.560 0.66 4.07 Intr - 135520 135338 183 0 0 94 60 135 0.622 9.28 4.06 Intr - 142430 142315 116 2 2 86 16 111 0.026 2.03 4.05 Intr - 152726 152670 57 2 0 72 84 55 0.002 1.66 4.04 Intr - 168315 168229 87 0 0 42 57 114 0.009 2.95 4.03 Intr - 190367 190107 261 2 0 54 6 171 0.055 2.26 4.02 Intr - 195976 195720 257 2 2 23 -5 166 0.157 -2.96 4.01 Init - 197182 197062 121 1 1 68 94 62 0.626 5.20 4.00 Prom - 197966 197927 40 -8.45 5.00 Prom + 201490 201529 40 -6.45 5.01 Init + 202876 202957 82 0 1 61 100 166 0.890 14.28 5.02 Intr + 216383 217184 802 0 1 137 -37 547 0.093 36.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 113715 113724 10 2 1 77 93 3 0.820 0.54 S.002 Term + 116007 117196 1190 1 2 93 49 693 0.903 56.80 S.003 Term + 203583 203716 134 1 2 65 46 96 0.813 0.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:76616308_76833500|GENSCAN_predicted_peptide_1|298_aa MKALIFAAAGLLLLLPTFCQSGMENDTNNLAKPTLPIKTFRGAPPNSFEEFPFSALEGWT GATITVKIKCPEESASHLHVKNATMGYLTSSLSTKLIPAIYLLVFVVGVPANAVTLWMLF FRTRSICTTVFYTNLAIADFLFCVTLPFKIAYHLNGNNWVFGEVLCRATTVIFYGNMYCS ILLLACISINRYLAIVHPFTYRGLPKHTYALVTCGLVWATVFLYMLPFFILKQEYYLVQP DITTCHDVHNTCESSSPFQLYYFISLAFFGFLIPFVLIIYCYAAIIRTLNAYDHRWLX >gi568815593f:76616308_76833500|GENSCAN_predicted_CDS_1|894_bp atgaaagccctcatctttgcagctgctggcctcctgcttctgttgcccactttttgtcag agtggcatggaaaatgatacaaacaacttggcaaagccaaccttacccattaagaccttt cgtggagctcccccaaattcttttgaagagttccccttttctgccttggaaggctggaca ggagccacgattactgtaaaaattaagtgccctgaagaaagtgcttcacatctccatgtg aaaaatgctaccatggggtacctgaccagctccttaagtactaaactgatacctgccatc tacctcctggtgtttgtagttggtgtcccggccaatgctgtgaccctgtggatgcttttc ttcaggaccagatccatctgtaccactgtattctacaccaacctggccattgcagatttt cttttttgtgttacattgccctttaagatagcttatcatctcaatgggaacaactgggta tttggagaggtcctgtgccgggccaccacagtcatcttctatggcaacatgtactgctcc attctgctccttgcctgcatcagcatcaaccgctacctggccatcgtccatcctttcacc taccggggcctgcccaagcacacctatgccttggtaacatgtggactggtgtgggcaaca gttttcttatatatgctgccatttttcatactgaagcaggaatattatcttgttcagcca gacatcaccacctgccatgatgttcacaacacttgcgagtcctcatctcccttccaactc tattacttcatctccttggcattctttggattcttaattccatttgtgcttatcatctac tgctatgcagccatcatccggacacttaatgcatacgatcatagatggttgtgn >gi568815593f:76616308_76833500|GENSCAN_predicted_peptide_2|1190_aa MDSESVSKVLWLDEIQQAVDDANVDKDRAKQWVTLVVDVNQCLEGKKSSDILSVLKSSTS NANDIIPECADKYYDALVKAKELKSERGVCLLWPGEEMGKKSEDLMAIAVFSLAVRGKRE DIIEEVTVGYIRENIWSASEELLLRFQATSSGPILREEFEARKSFLHEQEENVVKIQAFW KGYKQRKEYMHRRQTFIDNTDSIVKNNEIVKIQSLLRANKARDDYKTLAFTYFRLLPKDP LGWEKPLSASCSKNVSHLSSSSTRSQGRVGSENPPLTVIRKFVYLLDQSDLDFQEELEVA RLREEVVTKIRANQQLEKDLNLMDIKIGLLVKNRITLEDVISHSKKLNKKKGGEMEILNN TDNQGIKSLSKERRKTLETYQQLFYLLQTNPLYLAKLIFQMPQNKSTKFMDTVIFTLYNY ASNQREEYLLLKLFKTALEEEIKSKVDQVQDIVTGNPTVIKMVVSFNRGARGQNTLRQLL APVVKEIIDDKSLIINTNPVEVYKAWVNQLETQTGEASKLPYDVTTEQALTYPEVKNKLE ASIENLRRVTDKVLNSIISSLDLLPYGLRYIAKVLKNSIHEKFPDATEDELLKIVGNLLY YRYMNPAIVAPDGFDIIDMTAGGQINSDQRRNLGSVAKVLQHAASNKLFEGENEHLSSMN NYLSETYQEFRKYFKEACNVPEPEEKFNMDKYTDLVTVSKPVIYISIEEIISTHSLLLEH QDAIAPEKNDLLSELLGSLGEVPTVESFLGEGAVDPNDPNKANTLSQLSKTEISLVLTSK YDIEDGEAIDSRSLMIKWDCLVVGKQAQGIHSFCIMQPQESGGYYQYFTDEETRHRTGIN LFLGNTVTKWSRDSNPHDLKSSRAHTLDHKVLQLVEGKYNATTSSTESPDFDYQAFINFQ EPYSRPGEYPQECFILRTKKLIIDVIRNQPGNTLTEILETPATAQQEVDHATDMVSRAMI DSRTPEEMKHSQSMIEDAQLPLEQKKRKIQRNLRTLEQTGHVSSENKYQDILNEIAKDIR NQRIYRKLRKAELAKLQQTLNALNKKAAFYEEQINYYDTYIKTCLDNLKRKNTRRSIKLD GKGEPKGAKRAKPVKYTAAKLHEKGVLLDIDDLQTNQFKNVTFDIIATEDVGIFDVRSKF LGVEMEKVQLNIQDLLQMQYEGVAVMKMFDKVKVNVNLLIYLLNKKFYGK >gi568815593f:76616308_76833500|GENSCAN_predicted_CDS_2|3573_bp atggactctgagagtgtttccaaagtgctttggctggatgagatacagcaagccgtcgat gatgccaacgtggacaaggacagagcaaaacaatgggttactctggtggttgatgttaat cagtgtttggaaggaaaaaaatcaagtgatattttgtctgtattgaagtcttccacttct aatgcaaatgacataatcccggagtgtgctgacaaatactatgatgcccttgtgaaggca aaagagctcaaatctgaaagaggggtctgcctactctggcctggggaagaaatgggaaag aagtctgaggaccttatggccattgcagtcttttccttggcagtccgagggaagcgggag gacattattgaggaagtcacagtaggttacattcgtgagaatatatggtctgcttcagaa gagttgcttcttcgctttcaagccacaagctcaggacccatccttagggaagagtttgaa gctagaaaatcatttttgcatgaacaagaagagaatgtggtcaaaatacaggctttttgg aaaggatataaacaacggaaggagtatatgcacaggcggcaaacgttcattgataatact gattctattgtgaagaataatgaaattgtgaaaatacagtcactgttgagagcgaacaaa gctagagatgactacaaaacattggctttcacctacttccggctccttcccaaagatcct ttgggctgggagaaaccactttctgcttcctgcagcaagaatgtaagtcacttatctagc tcaagcacaaggtctcagggtcgagttggctctgaaaacccaccattaacagtaattcgc aaatttgtatacctgctggaccaaagtgatttggatttccaggaggaactagaggttgca cgattaagggaagaagtagtgaccaagatcagggccaatcaacagctggaaaaagacctg aacctgatggacatcaagattggactgctggtgaagaacaggatcacactagaggatgta atttcacatagtaaaaagctgaacaagaaaaaaggaggagaaatggaaatactgaataac accgacaaccaaggaataaaaagtttgagtaaggagaggagaaaaacactagaaacatat cagcagctgttttaccttttacagaccaaccctttatacttggctaagctgattttccag atgccacagaacaagtccactaaatttatggatactgttattttcacactatataattat gcctctaatcagcgagaagaatatctacttctcaagctttttaaaactgctctggaggaa gaaataaaatcaaaagtggaccaggtacaggacatagttactggtaaccctacagtcatc aagatggtcgtcagcttcaatagaggtgcccggggacagaacaccctgcgccaactcctg gctccagtggtaaaagagatcatcgacgacaagtcgctgattatcaacacaaaccctgta gaggtgtacaaggcttgggtgaaccaactagaaacacagactggagaggccagcaagttg ccttatgatgtgaccacagaacaagctctaacatacccagaagtgaaaaataaactggag gcttccattgagaacctgagaagggtcaccgacaaagtcctgaattctatcatttcttcc cttgatctactgccttatggattgaggtatatagccaaagtactgaagaattcgatccat gagaaattccccgatgcaacagaagatgagctattaaagattgttggaaacctcctgtac tatcggtacatgaatccagccattgtagctccagatggctttgatatcatcgacatgaca gctggaggtcagataaattctgaccaaaggagaaacttaggatcagtggccaaggttctt cagcacgcagcctccaacaagctgtttgaaggagaaaatgagcatctctcatctatgaac aattatttatcagagacgtatcaggaattcaggaaatatttcaaagaagcatgtaatgtc cctgagccagaagagaagtttaatatggacaaatacacagacctggtgacagtcagcaaa ccagtcatttatatttcaattgaagaaatcatcagcacacactcactcctgttggaacac caggatgcaattgcccctgagaaaaatgacttactgagtgaattgctggggtcgctggga gaggtgccaaccgtggaatcttttcttggggaaggagcagttgaccccaatgaccctaac aaggcaaatacactaagtcagctttcaaagaccgagatttctcttgtcttgacaagcaaa tatgacatagaggacggtgaagctatagatagccgaagcctcatgataaaatgggactgt ctagttgtaggaaaacaagctcagggcatccactcattctgtattatgcaaccccaagag agtggaggttattatcagtattttacagatgaggaaactaggcacagaacaggtataaac cttttcctgggtaacacagttactaagtggagcagagattcaaacccacatgatttgaaa tcctccagggcccacactcttgatcataaagttttacaacttgtcgagggcaagtataat gctacaacatcaagcacagaaagcccagactttgattatcaagcattcataaactttcag gagccttatagcagacctggtgaatatccgcaagaatgttttattttaaggaccaagaag ctgataattgatgtgatccggaaccagccagggaacacattgacagaaatcttagagaca ccagcaactgcgcaacaggaggtagaccatgccacggacatggtgagccgtgcaatgata gattccaggactccagaagaaatgaagcatagccaatctatgattgaagatgcacagctg cctcttgagcagaagaagaggaaaatccagaggaatcttcggacgttggaacagactgga cacgtgtcatccgaaaataaataccaagacattctcaatgagattgccaaggatattcga aatcaaagaatctatcgtaagcttcgaaaagctgaattggcaaaacttcagcagaccctg aatgcacttaacaagaaggcagcattttatgaagagcaaatcaattattatgacacctac ataaagacttgtttagacaacttaaaaagaaaaaatactcggagatcaattaaactagat ggaaaaggagaacccaaaggggcgaagagagcgaagccagtgaagtacactgcagcaaag ctgcatgagaaaggtgtcctgctagatatagatgatcttcaaacaaaccagtttaagaat gttacatttgatatcatagctactgaagatgtaggcattttcgatgtaagatcaaaattc cttggtgttgagatggaaaaggtgcaactcaatattcaggatttacttcagatgcaatat gaaggagtagctgtaatgaaaatgtttgataaggttaaagtgaatgtaaaccttctcata tacctgctgaacaagaagttctatggaaagtga >gi568815593f:76616308_76833500|GENSCAN_predicted_peptide_3|436_aa MGPRRLLLVAACFSLCGPLLSARTRARRPESKATNATLDPRSFLLRNPNDKYEPFWEDEE KNESGLTEYRLVSINKSSPLQKQLPAFISEDASGYLTSSWLTLFVPSVYTGVFVVSLPLN IMAIVVFILKMKVKKPAVVYMLHLATADVLFVSVLPFKISYYFSGSDWQFGSELCRFVTA AFYCNMYASILLMTVISIDRFLAVVYPMQSLSWRTLGRASFTCLAIWALAIAGVVPLLLK EQTIQVPGLNITTCHDVLNETLLEGYYAYYFSAFSAVFFFVPLIISTVCYVSIIRCLSSS AVANRSKKSRALFLSAAVFCIFIICFGPTNVLLIAHYSFLSHTSTTEAAYFAYLLCVCVS SISCCIDPLIYYYASSECQREKDSLNGSNLIHKTISEDTGGLKWQLPFFKDSARQIAAAP GYQKATVDIFPVSGSG >gi568815593f:76616308_76833500|GENSCAN_predicted_CDS_3|1311_bp atggggccgcggcggctgctgctggtggccgcctgcttcagtctgtgcggcccgctgttg tctgcccgcacccgggcccgcaggccagaatcaaaagcaacaaatgccaccttagatccc cggtcatttcttctcaggaaccccaatgataaatatgaaccattttgggaggatgaggag aaaaatgaaagtgggttaactgaatacagattagtctccatcaataaaagcagtcctctt caaaaacaacttcctgcattcatctcagaagatgcctccggatatttgaccagctcctgg ctgacactctttgtcccatctgtgtacaccggagtgtttgtagtcagcctcccactaaac atcatggccatcgttgtgttcatcctgaaaatgaaggtcaagaagccggcggtggtgtac atgctgcacctggccacggcagatgtgctgtttgtgtctgtgctcccctttaagatcagc tattacttttccggcagtgattggcagtttgggtctgaattgtgtcgcttcgtcactgca gcattttactgtaacatgtacgcctctatcttgctcatgacagtcataagcattgaccgg tttctggctgtggtgtatcccatgcagtccctctcctggcgtactctgggaagggcttcc ttcacttgtctggccatctgggctttggccatcgcaggggtagtgcctctgctcctcaag gagcaaaccatccaggtgcccgggctcaacatcactacctgtcatgatgtgctcaatgaa accctgctcgaaggctactatgcctactacttctcagccttctctgctgtcttctttttt gtgccgctgatcatttccacggtctgttatgtgtctatcattcgatgtcttagctcttcc gcagttgccaaccgcagcaagaagtcccgggctttgttcctgtcagctgctgttttctgc atcttcatcatttgcttcggacccacaaacgtcctcctgattgcgcattactcattcctt tctcacacttccaccacagaggctgcctactttgcctacctcctctgtgtctgtgtcagc agcataagctgctgcatcgaccccctaatttactattacgcttcctctgagtgccagagg gaaaaagatagcttaaatggctctaatcttatacataaaaccatttcagaagacactggg ggtttaaagtggcaactcccattttttaaagatagtgcccggcagatagcggcagcgcct ggttaccagaaagcaactgtggatatttttcctgtctctgggtctggctga >gi568815593f:76616308_76833500|GENSCAN_predicted_peptide_4|398_aa MTLNEHAAFKHLFNKAHLAPPLIHLTLSGHSTCFREHRVGEALLTSQTGRWGKGAPHISE EGRPGRDAPHFLDGMAAGKRRSSLPRWDGDRAETLLTFQTGQPRRGAPHVPDDRRPGRDA PHFPDGSLPPITPHLSAHIEQLLSWEWSCAGLSLAKETAAAQLKENRPAHSNRKVLGVKV KASKTKLAVDEPKEEGAMKRRMGRETKMQDTTENQNSNQESECYQNPGRLPRAVFQSRIP TRILDIRPLSDARFANIFSHSFRHLELPTFLASSQPDLSISTSHAAALKGADNVTPGVPW VTAAAASRAGRVFTMFSKTVRSQEPRGALAQHRDSRCQLQTPESASPERCLGTSVLHTHP SRLLKLPKHGPTLEPLHVLRFSPLIYLQIQLLIQMPFH >gi568815593f:76616308_76833500|GENSCAN_predicted_CDS_4|1197_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcacca cccttaatccatttaaccctgagtggacacagcacctgtttcagagagcacagggttgga gaggcgctcctcacatcccagacagggcggtggggcaaaggcgctccccacatctcagaa gaagggcggccaggcagagatgctcctcacttcctagatgggatggcggccgggaagagg cgctcctcacttcctagatgggatggcgaccgggcagagacgctcctcactttccagact gggcagccacgcagaggggctcctcacgtcccagacgataggcggccaggcagagacgct cctcacttcccagacgggtctctaccccccatcactccccacctctctgcccatattgag cagctcctttcttgggagtggtcctgtgctggattgagtttagccaaggagactgctgct gcacagctcaaagaaaaccgacctgctcactccaacaggaaagtcctgggggtcaaagta aaagcatcaaagactaagttagcggtggatgagccaaaagaggagggagctatgaagagg aggatgggaagagaaaccaagatgcaagatactacagaaaaccagaattcaaatcaagaa tcagaatgctaccagaatcctggaagactccctcgtgctgtcttccagtcacgaattccc accaggattttggatattagacctttgtcagatgcacggtttgcaaatattttctcacat tctttcaggcacctggagctgccgacattcttagcttcctctcagccagacctctccatt tctacctcccatgctgctgctctcaaaggtgctgacaatgtcacacctggtgtcccctgg gtgacagcagcagctgccagcagagcaggcagggtcttcacgatgttttccaaaactgtc aggtcacaagaaccacgtggagcacttgctcaacatagagattccagatgccagctgcag actcctgaatcagcatctccagagagatgcctgggcacctctgtgcttcacacacacccc agcaggctgctgaagttacccaaacacggccccaccttggaacctttgcatgtgctgcgc ttttctccactgatctatctgcagatccagctcctcatccagatgccttttcattaa >gi568815593f:76616308_76833500|GENSCAN_predicted_peptide_5|295_aa MRSPSAAWLLGAAILLAASLSCSGTIQGTSRSSKGRSLIGKVDGTSHVTGKGVTVETVFS VDEFSASVLTGKLTTVFLPIVYTIVFVVGLPSNGMALWVFLFRTKKKHPAVIYMANLALA DLLSVIWFPLKIAYHIHGNNWIYGEALCNVLIGFFYGNMYCSILFMTCLSVQRYWVIVNP MGHSRKKANIAIGISLAIWLLILLVTIPLYVVKQTIFIPALNITTCHDVLPEQLLVGDMF NYFLSLAIGVFLFPAFLTASAYVLMIRMLRSSAMDENSEKKRKRAIKLIVTVLAI >gi568815593f:76616308_76833500|GENSCAN_predicted_CDS_5|885_bp atgcggagccccagcgcggcgtggctgctgggggccgccatcctgctagcagcctctctc tcctgcagtggcaccatccaaggaaccagtagatcctctaaaggaagaagccttattggt aaggttgatggcacatcccacgtcactggaaaaggagttacagttgaaacagtcttttct gtggatgagttttctgcatctgtcctcactggaaaactgaccactgtcttccttccaatt gtctacacaattgtgtttgtggtgggtttgccaagtaacggcatggccctgtgggtcttt cttttccgaactaagaagaagcaccctgctgtgatttacatggccaatctggccttggct gacctcctctctgtcatctggttccccttgaagattgcctatcacatacatggcaacaac tggatttatggggaagctctttgtaatgtgcttattggctttttctatggcaacatgtac tgttccattctcttcatgacctgcctcagtgtgcagaggtattgggtcatcgtgaacccc atggggcactccaggaagaaggcaaacattgccattggcatctccctggcaatatggctg ctgattctgctggtcaccattcctttgtatgtcgtgaagcagaccatcttcattcctgcc ctgaacatcacgacctgtcatgatgttttgcctgagcagctcttggtgggagacatgttc aattacttcctctctctggccattggggtctttctgttcccagccttcctcacagcctct gcctatgtgctgatgatcagaatgctgcgatcttctgccatggatgaaaactcagagaag aaaaggaagagggccatcaaactcattgtcactgtcctggccatn