GENSCAN 1.0 Date run: 7-Nov-116 Time: 14:38:33 Sequence gi568815597r:35084459_35293049 : 208591 bp : 42.47% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9456 9625 170 1 2 78 38 150 0.819 7.74 1.02 Intr + 20099 20311 213 1 0 61 80 66 0.290 1.09 1.03 Intr + 25836 25989 154 2 1 52 95 34 0.412 -0.78 1.04 Intr + 27314 27454 141 0 0 35 93 109 0.493 5.50 1.05 Intr + 27629 27672 44 0 2 56 95 51 0.962 -0.36 1.06 Intr + 28519 30513 1995 0 0 79 14 503 0.843 28.37 1.07 Term + 37838 38050 213 1 0 47 48 230 0.607 11.15 1.08 PlyA + 38647 38652 6 1.05 2.22 PlyA - 39191 39186 6 1.05 2.21 Term - 42798 42792 7 2 1 115 53 0 0.563 -4.44 2.20 Intr - 43976 43814 163 0 1 63 103 47 0.547 1.81 2.19 Intr - 45410 45309 102 0 0 100 113 65 0.849 9.43 2.18 Intr - 49137 49003 135 1 0 20 95 115 0.223 5.02 2.17 Intr - 51107 51055 53 1 2 47 110 60 0.180 1.73 2.16 Intr - 51953 51683 271 0 1 93 18 121 0.079 1.28 2.15 Intr - 58544 58372 173 1 2 22 38 221 0.013 9.16 2.14 Intr - 64784 64647 138 1 0 54 57 101 0.009 2.26 2.13 Intr - 92941 92861 81 0 0 74 97 72 0.117 4.53 2.12 Intr - 94219 94161 59 1 2 54 72 35 0.033 -4.74 2.11 Intr - 102664 102543 122 2 2 89 50 110 0.450 6.59 2.10 Intr - 102793 102745 49 1 1 54 107 53 0.996 1.13 2.09 Intr - 103632 103515 118 2 1 50 100 189 0.982 15.85 2.08 Intr - 104629 104545 85 2 1 41 83 104 0.983 3.16 2.07 Intr - 104924 104728 197 1 2 30 93 239 0.958 16.74 2.06 Intr - 106135 106040 96 0 0 18 69 166 0.983 5.91 2.05 Intr - 106537 106236 302 1 2 58 105 440 0.994 37.31 2.04 Intr - 107071 106883 189 1 0 64 70 173 0.997 11.96 2.03 Intr - 108738 107764 975 0 0 -20 105 400 0.048 21.12 2.02 Intr - 126418 126287 132 1 0 19 69 133 0.061 4.42 2.01 Init - 159447 159340 108 0 0 53 94 55 0.104 2.87 2.00 Prom - 169770 169731 40 -3.45 3.06 PlyA - 169786 169781 6 1.05 3.05 Term - 177300 177108 193 2 1 -19 38 301 0.990 10.31 3.04 Intr - 179371 179315 57 0 0 74 110 71 0.967 4.98 3.03 Intr - 184186 184165 22 2 1 53 115 45 0.043 -0.62 3.02 Intr - 195043 194884 160 1 1 -8 64 171 0.303 3.74 3.01 Intr - 200182 200068 115 0 1 19 43 147 0.203 2.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 53169 53008 162 0 0 45 114 80 0.815 6.08 S.002 Sngl - 60896 60678 219 2 0 89 37 194 0.909 9.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:35084459_35293049|GENSCAN_predicted_peptide_1|976_aa XIWKLFFRKKPISLELENSFASDTKMKEPLLGGECDKAVASQLGLLDEIKTEPDNAQIQY EVKYQNVKHNLCSNACLSKFHSANNFIMNCCENCGTYCYTSSSLSHILQMEGQSHYFNSS KSITAYKQKPAKPLISVPCKPLKPSDEMIETTSDLGKTELFCSINCFSAYSKAKMESSSV SVVSVVHDTSTELLSPKKDTTPVISNIVSLADTDVALPIMNTDVLQDTVSSVTATADVIV DLSKSSPSEPSNAVASSSTEQPSVSPSSSVFSQHAIGSSTEVQKDNMKSMKISDELCHPK CTSKVQKVKGKSRSIKKSCCADFECLENSKKDVAFCYSCQLFCQKYFSCGRESFATHGTS NWKKTLEKFRKHEKSEMHLKSLEFWREYQFCDGAVSDDLSIHSKQIEGNKKYLKLIIENI LFLGKQCLPLRGNDQSVSSVNKGNFLELLEMRAKDKGEETFRLMNSQVDFYNSTQIQSDI IEIIKTEMLQDIVNEINDSSAFSIICDETINSAMKEQLSICVRYPQKSSKAILIKERFLG FVDTEEMTGTHLHRTIKTYLQQIGVDMDKIHGQAYDSTTNLKIKFNKIAAEFKKEEPRAL YIHCYAHFLDLSIIRFCKEVKELRSALKTLSSLFNTICMSGEMLANFRNIYRLSQNKTCK KHISQSCWTVHDRTLLSVIDSLPEIIETLEVIASHSSNTSFADELSHLLTLVSKFEFVFC LKFLYRVLSVTGILSKELQNKTIDIFSLSSKIEAILECLSSERNDVYFKTIWDGTEEICQ KITCKGFKVEKPSLQKRRKIQKSVDLGNSDNMFFPTSTEEQYKINIYYQGLDTILQNLKL CFSEFDYCKIKQISELLFKWNEPLNETTAKHVQEFYKLDEDIIPELRFYRHYAKLNFVID DSCINFVALKEDLDALKEKFRTMESNQKSSFQEIPKLNEELFSKQKKLEKIESGEMSLNK VWINITEMNKQCRVGS >gi568815597r:35084459_35293049|GENSCAN_predicted_CDS_1|2931_bp ngaatctggaaactgttcttcaggaagaaacccattagtttggaactggagaattccttt gcatcagatactaaaatgaaagaaccacttttaggtggtgagtgtgacaaggcagtggca tcacagctggggctgctagatgaaattaagacagaacccgacaatgctcaaattcagtat gaagtaaaataccaaaatgtgaaacataatctttgcagtaatgcctgcctttcaaagttt cactctgctaacaacttcatcatgaactgctgtgagaactgtggcacttactgttacacc agctctagtctgtcccacatacttcagatggaaggacagtctcattactttaatagttca aagagtattacagcatataagcagaaacctgccaaaccacttatatctgttccttgcaaa ccattgaagccctcagatgaaatgattgagactacgagtgatttggggaagacagagctt ttctgctctattaattgtttctctgcatacagtaaagctaagatggaatcttcttcagta agtgttgtttctgtggtgcatgatacttcaacagagcttctttctccaaagaaagatacg actccagttataagcaatatagtgtcattggcagacaccgatgttgccttgcccatcatg aacactgatgtcttacaagatacagtttcttcagtaacagcaacagcagatgtcattgtg gatctttctaagagttcacctagtgaacccagtaatgctgttgctagtagtagtacggaa cagccaagcgtttcaccatcttcatcagtattcagtcagcatgcaattggttccagtaca gaagtacaaaaagacaatatgaaatctatgaaaataagtgatgaactatgtcacccaaaa tgtacatccaaagtacaaaaagttaaaggtaaatcacgaagtattaaaaaatcttgttgt gcagattttgagtgtttggaaaacagtaaaaaagatgtggcattctgttattcatgccag ttgttctgccaaaaatattttagctgtggaagagagtcatttgcaacccacggaacttct aattggaaaaaaaccctggaaaaattcagaaagcatgaaaaaagtgaaatgcatttgaag tcattggaattttggagagaataccaattttgtgatggagctgtcagtgacgatttatct attcattcgaaacagattgagggaaataaaaagtacctaaagcttataattgaaaatatt ttatttcttggaaagcagtgtttacccttaagaggaaacgaccagtcagtttcatctgtg aataaaggcaattttttagaattgttagaaatgagagcaaaagataaaggagaagaaaca tttcgacttatgaattcacaagttgacttctataacagtacacaaattcaaagtgatatt atcgaaataataaagactgaaatgttgcaggatattgtgaatgagatcaatgactcctca gcattttcaatcatatgtgatgagacaatcaatagtgccatgaaagaacagctttcaatt tgtgtaagatacccacaaaaatcatcaaaggctatcttaattaaggaaagattcttgggt tttgttgatactgaggagatgactgggacccacttacataggactatcaaaacttatctg cagcaaattggagttgatatggataaaatacatggccaggcctatgatagcaccactaat ttgaagataaaatttaataaaatagcagcagaattcaagaaagaagaaccaagagcttta tacatacattgttatgcacactttttggatttatcaataattaggttttgtaaagaagta aaagaactccgaagtgctctaaaaactctcagttctttgttcaacactatttgtatgtct ggggaaatgttggcaaattttcgaaacatttataggctaagtcaaaacaaaacatgcaag aaacatatatcacaatcatgttggacagtccatgatcgtacattactatctgtgattgac agtcttccagagattattgaaacattggaagttatagcaagccattcttcaaatacaagt ttcgccgatgaattgagtcatttgctgacattggtttccaaatttgaatttgtcttttgt ttgaaattcctgtatcgagtgctgagtgttacaggaattctttccaaagagcttcaaaat aaaaccatagacattttttctttgtcttcaaaaatagaagcaattttggaatgtttatca tctgaaagaaatgacgtatactttaaaacaatctgggatggaacagaggaaatatgtcaa aaaataacctgtaaaggttttaaagttgaaaaaccttctcttcagaaaagaagaaaaatt cagaaatcagtagatcttggcaattcagataatatgttttttcctacttcaacagaagaa caatataaaattaatatctattaccaaggattagatactatattacaaaatttaaagtta tgtttttcggagtttgattattgcaaaataaagcaaatttcagaactgttatttaaatgg aatgaaccattaaatgaaacaacagcaaaacatgttcaggaattttataaacttgatgag gacattatcccagaacttagattttatcgacattatgcaaagcttaactttgtcatagat gatagttgcataaacttcgttgctctcaaggaggatctggatgccctcaaggaaaaattt cgaacaatggaatctaatcagaaaagctcattccaggaaatccccaaacttaatgaagaa ctattcagcaagcaaaaaaaacttgagaagattgaatctggagagatgagtttgaacaaa gtctggataaacatcacggaaatgaataagcagtgcagggtcgggtcatga >gi568815597r:35084459_35293049|GENSCAN_predicted_peptide_2|1184_aa MRDHVEWSQVNPAEAILDQPAASQPCSCLQIHYKVQKHQILTPRWALKRQQARGLWSKFS TQYKGSGGGLQGGRCRTAEEAGPCQLAYIRRGQRRRAACVIRHFVRSKVASTFPERLLRF CLDRPLTTDMSRDRFRSRGGGGGGFHRRGGGGGRGGLHDFRSPPPGMGLNQNRGPMGPGP GQSGPKPPIPPPPPHQQQQQPPPQQPPPQQPPPHQPPPHPQPHQQQQPPPPPQDSSKPVV AQGPGPAPGVGSAPPASSSAPPATPPTSGAPPGSGPGPTPTPPPAVTSAPPGAPPPTPPS SGVPTTPPQAGGPPPPPAAVPGPGPGPKQGPGPGGPKGGKMPGGPKPGGGPGLSTPGGHP KPPHRGGGEPRGGRQHHPPYHQQHHQGPPPGGPGGRSEEKISDSEGFKANLSLLRRPGEK TYTQRCRLFVGNLPADITEDEFKRLFAKYGEPGEVFINKGKGFGFIKLESRALAEIAKAE LDDTPMRGRQLRVRFATHAAALSVRNLSPYVSNELLEEAFSQFGPIERAVVIVDDRGRST GKGIVEFASKPAARKAFERCSEGVFLLTTTPRPVIVEPLEQLDDEDGLPEKLAQKNPMYQ KERETPPRFAQHGTFEYEYSQRWKSLDEMEKQQREQVEKNMKDAKDKLESEMEDAYHEHQ ANLLRQDLMRRQEELRRMEELHNQEMQKRKEMQLRQEEERRRREEEMMIRQREMEEQMRR QREESYSRMGYMDPRERDMRMGGGGAMNMGDPYGSGGQKFPPLGGGGGIGYEANPGVPPA TMSGSMMGSDMAYSEGECLLPQKKIGSLLSQWIDTSQDITQRESIKEGSHKQTDNVSRSE VIGGWSRQLLRQLLRQLKPALWNIPAGDSDQTSPSDVDPESVPGQLTLAAQDKWTSQQDA DHPTWLILRRDQAAAEDDSPALVTPEADQSTHGRSLRTHRANPSDLSPPRQAEPRPNSSS ASAPPPYNPFITSPPHTWSGLQFRSSTSLLPPAQQFLLKKVAGAKGILKVNAPFSLSNLS ENQLAFRPFFIKSPLNQHGRRVSGNSHMEGSSIINASLIKTLLKAALLPKEAGVIHCKGH QKASDPIALDNAYAVSLASNKEGKFSSFHDDGKQVNSFAGFELASNREGHSWRNWIFYQG FDLNGVLSFKELNVTYVANKSPLEKTGLILSSTQSLYRVPDLWT >gi568815597r:35084459_35293049|GENSCAN_predicted_CDS_2|3555_bp atgagagatcatgtggaatggagccaagtcaacccagctgaggccatcctagaccagcca gctgccagtcaaccctgtagctgcttgcagattcattacaaagtccagaagcatcagatc ctcactcccaggtgggctctgaagaggcagcaggcacgtggcttgtggtctaaattcagc acgcagtacaaaggcagtgggggagggctgcaaggtgggcgatgccggactgcagaggag gcggggccgtgccagctggcctatataagacgaggacaaaggcggcgcgccgcctgtgtc atccgccattttgtgagaagcaaggtggcctccacgtttcctgagcgtcttcttcgcttt tgcctcgaccgccccttgaccacagacatgtctcgggatcggttccggagtcgtggcggt ggcggtggtggcttccacaggcgtggaggaggcggcggccgcggcggcctccacgacttc cgttctccgccgcccggcatgggcctcaatcagaatcgcggccccatgggtcctggcccg ggccagagcggccctaagcctccgatcccgccaccgcctccacaccaacagcagcaacag ccaccaccgcagcagccaccgccgcagcagccgccaccgcatcagccgccgccgcatcca cagccgcatcagcagcagcagccgccgccaccgccgcaggactcttccaagcccgtcgtt gctcagggacccggccccgctcccggagtaggcagcgcaccaccagcctccagctcggcc ccgcccgccactccaccaacctcgggggccccgccagggtccgggccaggcccgactccg accccgccgcctgcagtcacctcggcccctcccggggcgccgccacccaccccgccaagc agcggggtccctaccacacctcctcaggccggaggcccgccgcctccgcccgcggcagtc ccgggcccgggtccagggcctaagcagggcccaggtccgggtggtcccaaaggcggcaaa atgcctggcgggccgaagccaggtggcggcccgggcctaagtacgcctggcggccacccc aagccgccgcatcgaggcggcggggagccccgcgggggccgccagcaccacccgccctac caccagcagcatcaccaggggcccccgcccggcgggcccggcggccgcagcgaggagaag atctcggactcggaggggtttaaagccaatttgtctctcttgaggaggcctggagagaaa acttacacacagcgatgtcggttgtttgttgggaatctacctgctgatatcacggaggat gaattcaaaagactatttgctaaatatggagaaccaggagaagtttttatcaacaaaggc aaaggattcggatttattaagcttgaatctagagctttggctgaaattgccaaagccgaa ctggatgatacacccatgagaggtagacagcttcgagttcgctttgccacacatgctgct gccctttctgttcgtaatctttcaccttatgtttccaatgaactgttggaagaagccttt agccaatttggtcctattgaaagggctgttgtaatagtggatgatcgtggaagatctaca gggaaaggcattgttgaatttgcttctaagccagcagcaagaaaggcatttgaacgatgc agtgaaggtgttttcttactgacgacaactcctcgtccagtcattgtggaaccacttgaa caactagatgatgaagatggtcttcctgaaaaacttgcccagaagaatccaatgtatcaa aaggagagagaaacccctcctcgttttgcccagcatggcacgtttgagtacgaatattct cagcgatggaagtctttggatgaaatggaaaaacagcaaagggaacaagttgaaaaaaac atgaaagatgcaaaagacaaattggaaagtgaaatggaagatgcctatcatgaacatcag gcaaatcttttgcgccaagatctgatgagacgacaggaagaattaagacgcatggaagaa cttcacaatcaagaaatgcagaaacgtaaagaaatgcaattgaggcaagaggaggaacga cgtagaagagaggaagagatgatgattcgtcaacgtgagatggaagaacaaatgaggcgc caaagagaggaaagttacagccgaatgggctacatggatccacgggaaagagacatgcga atgggtggcggaggagcaatgaacatgggagatccctatggttcaggaggccagaaattt ccacctctaggaggtggtggtggcataggttatgaagctaatcctggcgttccaccagca accatgagtggttccatgatgggaagtgacatggcatatagtgaaggtgaatgtcttctc cctcagaaaaaaattggttccttgctgtcccagtggatagatacttctcaagacatcaca cagcgtgagtcaatcaaggagggaagccacaagcagactgacaacgtttctagaagcgag gtgattggtggatggtcaaggcagctccttaggcagcttcttaggcagcttaagcctgcc ctgtggaacatccctgcgggggactccgaccagaccagcccaagcgacgtggatcctgag agcgttcctggtcagctaacactggcagctcaggacaagtggaccagccagcaggatgca gatcacccaacctggctgatcctgagacgagaccaagctgctgctgaggacgacagccct gctctggtcactccggaggctgaccagtctacgcacggccgaagcttgaggactcatcga gcaaacccatctgacctctcccctcctcgccaggccgagccacgtcccaattcttcctca gcctccgctcctccaccctataatccttttatcacctcccctcctcacacctggtccggc ttacagtttcgttcctcaactagccttctcccacctgcccagcaatttcttcttaaaaag gtggctggagctaaaggcatactcaaggttaatgctcctttttctttatccaacctctcc gaaaatcagttagcgtttaggccctttttcatcaaaagccccctaaaccagcacggacgc cgagtttcgggtaactctcacatggaggggtcctccatcattaatgcctctttaataaaa actctgctcaaggccgctttacttccaaaggaagctggagtcattcactgcaaaggccat caaaaggcatcagatcccattgctctagacaacgcttatgctgtttcacttgcttccaac aaggaaggcaaattttccagcttccatgatgatggaaaacaggttaactcctttgccggg tttgagcttgcttccaatagggaaggacacagttggagaaactggatattttaccaaggc tttgacttgaatggtgtgctttcctttaaggaattgaacgtgacttatgtagccaataaa agccccttggaaaaaactggcctcatactttcatctacacagtccctgtacagggttcct gacctgtggacatga >gi568815597r:35084459_35293049|GENSCAN_predicted_peptide_3|182_aa XHPDYDTVLRQPKLIRWSTDFQQGFQDHSMGKEQALQQKRAGHVAGQPYAEDTSSVTHGS HQSSQQKRGMEMGLSVRNCEESSRLMIWTPLISPGPERARISHKGARVDVSYEALWQSRR RRKRRKKEKRKKRRKEEEGGGEEGGGGEEGEEGEGREEEEEGKKEKERKKEREREKMKDE HS >gi568815597r:35084459_35293049|GENSCAN_predicted_CDS_3|549_bp nnccacccagactatgatactgtgttacgacagcccaaactgatacgatggtcaactgac tttcaacaaggtttccaggaccactcaatggggaaagaacaggctcttcaacaaaagaga gcaggccacgtggctggacaaccatatgctgaagatactagcagtgtgactcatggatcc catcaatcatctcaacagaaacgaggaatggagatggggttatcagtaaggaactgtgaa gaatcctctcgtctaatgatatggacccccttgatatcccccggtcctgagcgtgcgcga ataagccataaaggagcaagggtggatgtaagttacgaggctttgtggcagtccaggaga aggagaaaaaggaggaagaaggagaagaggaagaagaggaggaaggaggaagaaggagga ggagaagaaggaggaggaggagaagaaggagaagaaggagaaggaagagaggaggaggaa gaaggaaagaaagaaaaagaaagaaagaaagaaagagaaagagaaaagatgaaagatgag cacagttaa