GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:00:25 Sequence gi568815589f:96046009_96247853 : 201845 bp : 46.40% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1935 2014 80 0 2 145 80 41 0.890 8.19 1.02 Intr + 3190 3355 166 2 1 80 30 48 0.278 -2.78 1.03 Intr + 5568 5697 130 0 1 44 99 55 0.259 2.90 1.04 Intr + 13215 13459 245 2 2 80 39 163 0.334 6.90 1.05 Intr + 24249 24415 167 0 2 81 31 136 0.253 6.80 1.06 Intr + 30041 30150 110 0 2 72 50 73 0.390 1.90 1.07 Intr + 34213 34396 184 0 1 79 105 41 0.113 4.26 1.08 Intr + 36325 36530 206 2 2 52 75 48 0.199 -1.18 1.09 Intr + 39567 39795 229 2 1 55 78 117 0.245 4.84 1.10 Intr + 47704 47800 97 2 1 102 88 123 0.436 12.77 1.11 Term + 48990 49068 79 0 1 90 37 88 0.899 1.14 1.12 PlyA + 50685 50690 6 1.05 2.00 Prom + 51139 51178 40 -6.26 2.01 Init + 53216 53329 114 1 0 86 77 58 0.166 4.72 2.02 Intr + 55526 55741 216 1 0 49 29 130 0.023 1.90 2.03 Intr + 70297 70404 108 0 0 120 39 85 0.002 7.28 2.04 Intr + 71745 71853 109 2 1 3 80 89 0.001 -0.54 2.05 Intr + 82296 82477 182 1 2 82 73 71 0.424 4.59 2.06 Intr + 94902 95082 181 2 1 81 52 84 0.588 3.54 2.07 Intr + 100300 100970 671 2 2 74 35 660 0.764 50.72 2.08 Term + 101057 101848 792 1 0 23 41 475 0.750 29.60 2.09 PlyA + 101935 101940 6 1.05 3.00 Prom + 102379 102418 40 -1.66 3.01 Init + 106873 106919 47 0 2 71 97 39 0.816 3.26 3.02 Intr + 121158 121308 151 0 1 97 37 90 0.133 4.96 3.03 Intr + 121894 121995 102 0 0 19 66 97 0.153 0.97 3.04 Intr + 122213 122282 70 1 1 33 52 71 0.389 -3.25 3.05 Intr + 122921 123115 195 0 0 91 94 65 0.577 6.79 3.06 Term + 123355 123416 62 2 2 65 43 62 0.577 -2.63 3.07 PlyA + 123877 123882 6 1.05 4.14 PlyA - 124300 124295 6 1.05 4.13 Term - 126267 125979 289 2 1 35 48 155 0.392 1.05 4.12 Intr - 126579 126501 79 1 1 77 92 49 0.391 2.81 4.11 Intr - 127877 127793 85 2 1 92 45 85 0.278 3.99 4.10 Intr - 133852 133806 47 2 2 106 77 10 0.011 -0.17 4.09 Intr - 135074 134964 111 0 0 61 34 111 0.010 3.35 4.08 Intr - 137889 137774 116 2 2 86 18 73 0.004 0.19 4.07 Intr - 159810 159739 72 2 0 87 41 74 0.085 1.12 4.06 Intr - 163699 163641 59 1 2 77 105 37 0.177 1.98 4.05 Intr - 168478 168424 55 0 1 107 97 49 0.548 6.68 4.04 Intr - 186929 186733 197 2 2 102 58 57 0.017 2.31 4.03 Intr - 189562 189461 102 1 0 98 77 95 0.928 9.77 4.02 Intr - 198386 198321 66 2 0 100 94 86 0.924 9.50 4.01 Intr - 200582 200548 35 0 2 93 116 0 0.298 1.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 61337 61272 66 2 0 135 39 81 0.956 5.84 S.002 Intr + 137836 137906 71 2 2 125 94 40 0.832 7.20 S.003 Term + 139367 139447 81 1 0 113 54 55 0.802 2.29 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:96046009_96247853|GENSCAN_predicted_peptide_1|564_aa XDGFDISNCSMQPGRPGFSGCAGDAGPGWHQTPAHQTSYHQLEVAFWFMEQDGTVKSVVL MGQSQVSAPAPPEPKEASGGPARILLIGAFYRTLIGAFYRVLIGAFYRALIGAFHRALIG AFYNPLFSKQLDVYISEFDASGVLSSEVSAFSEEMNNLEATAQKPSLAHLYKKKEVLVCA TMDDPNQIVLSERSQTWKTAYNMIPHQTSNQAGKFQQKEIPVNEGKRHGTQEIVPTTHES GNSKSRNNSCAADPDISHRRGGLSNTIHSFHGLKILDTNWGNCLVYQDFLNPLCGRQATQ DIRYYLFSLTPTCPVLFQISAPEIVTLFLKVACSELCICIFRHPRKYGSSCSNHGNDILS KAPSSHIHITATSAPIAAHCSRLDCGEVCIHLSWETSEWLFSSQHLNGSKNLDCPLEYSG CGILGMSAFCLITDSGSGHSPHLTFKASGRGPCAGSDHTLTGMQPHLPLHVQLRPPGVRG VFSFPHLTWSPPVSLPPDAGSSLVVTECAAMKHTVEGDMTPMTREQVEAELLHAGFVCSG SFPNGTLRGKELVHVHAATICPQR >gi568815589f:96046009_96247853|GENSCAN_predicted_CDS_1|1695_bp nntgacggatttgacataagcaactgctccatgcagccaggccggccaggcttctcaggc tgtgctggggacgcagggcccggatggcaccagacacccgcccaccagacttcataccac cagctggaagtggctttctggtttatggagcaagacggaactgtaaagtcagtcgttctc atgggccagagtcaagtcagcgcccccgccccacccgagccaaaggaggcttctggaggc cccgccagaattctgctgattggtgcgttttacagaacactgattggtgctttttacaga gtgctgattggtgcattttacagagcactgattggtgcctttcacagagcgctgattggc gcgttttacaatcctctgttctcaaaacaactggatgtgtacatttctgagtttgatgcc agtggggttttgagctcagaggtctcagccttcagtgaagaaatgaacaacctagaggcc actgcccagaagccatctttggctcatttatacaagaagaaggaagtgctggtgtgtgct acaatggatgaccccaaccaaattgtgctaagtgaaagaagccagacatggaagaccgca tacaatatgattccacaccaaacgagtaatcaagcaggaaagttccagcaaaaagaaatt ccagtaaatgaaggaaaaagacatggaacccaggaaatagtgcctacaacccatgagagt ggtaacagcaagtccaggaataacagctgtgctgcggaccctgatatcagccataggcga ggaggcctttctaacaccattcatagcttccatggattgaagatcctagacacaaactgg ggcaactgcctggtctaccaggatttccttaaccctctgtgtgggcggcaagccacccag gatattcggtattacctattttccctaactcctacttgtcctgtactttttcaaatatca gctcccgaaattgtcaccctcttccttaaggtggcttgctctgagctctgtatctgcatt ttccgacatcccagaaagtacggtagttcctgttctaatcatggcaatgacatactcagt aaagccccttccagccacattcacatcactgcaacctctgctcccattgctgcccattgc agtcgactggactgcggagaggtgtgcatccacctttcctgggaaacatctgaatggctt ttttccagccagcacctaaacggctctaagaacctggattgtcccctggaatattctgga tgtggtatcttgggtatgtctgctttctgtctcatcactgacagcggctcaggccacagt cctcacctcacattcaaggcctccggccgtgggccttgtgctgggagtgaccacactctc accggcatgcagccgcacctgcccctccacgtccagctgcgtccacctggggtcagggga gtcttcagcttcccgcacctgacctggagccctcccgtgagcctccctccagatgcgggc agctcacttgtggtcactgagtgtgctgcaatgaagcacacagttgagggggacatgacc cccatgacccgtgagcaggtagaagcagaattgctgcatgcgggcttcgtgtgttcagga agttttccgaatggcactctgcgtgggaaggagcttgtgcacgtgcacgctgccaccatc tgccctcagcgctga >gi568815589f:96046009_96247853|GENSCAN_predicted_peptide_2|790_aa MAHAGKEQVLRSSPSHLPENCCSGHCGTIGSVDATGMVEIRHSKSQDEIGGRHKIQITKT LLIKQVAVKKPAKTHQNQDGDESDSGRPHCYTPISAMTVYRCHSNVWKLPALTQGWMGAG AGGPGIASGLHNKTLSTYESNFQSPSERDKKTLAHKSEENMKGVTAERGQLNDCKRKSDL AVTSPRKSYPGSSQLRLKHDLFGKPLIPLGYVPIATFCKTHQTCNEFSIVWLLHEMGSSM KARCCRSGVLKAGGLARNREAWAAARGGTSSWSTLDQSRPGQSLARSPSEVELDPLHAQA EEQGNLPYDVTEESIKEFFRGLNISAVRLPCEPSNPVRLKGFGYAEFEDLDSLLSALSLK EESLGNRRIRVDVADQAQDKDRDDCSFGRDRNRDSDKTGTDWRARPATDSFDDYPPRRGD DSFGDKYRDRYDSDRYRDGYRDGYRDGQRRDMDPYGGRDRYDDRGSRDYDRGYDSRIGSG RRAFGSGYRRDDVSEEAGTTMKTDMTDGMIGLGAPEMITLWMIIGVMSTRAASIFGGAKP VDTAAREREVEERLQKEQEKLQHQLDEPKLERRPPERHPSWRSEETQERERSRTGSESSQ TGTSTTSGRSKSAQDARRRENEKSLENETLNKEEDCHSPTSKPPKPDQPLKVMPAPPPKE NAWVKRSSNPPARSQSSDTEQQSPTSGEGKVAPAQPSEEGPARKDENKVDGMNVPKGQTG NSSRGPGDGGNKDHWKESDRKDGKKDQDSRSAPEPKKPEENPASKFSSASKYAALSVDGE DENEGEDYAE >gi568815589f:96046009_96247853|GENSCAN_predicted_CDS_2|2373_bp atggcccacgcagggaaggagcaggtcttgcgctcctctccatcccacttgccagaaaat tgctgcagtggacactgtggcactataggttctgtggacgccacgggcatggtggagatt aggcattctaagtcacaggatgagataggaggtcggcacaagatacagatcacaaagacc ttgctgataaaacaggttgcagtaaagaagccggccaaaacccaccaaaaccaagatggt gacgagagtgactctggtcgtcctcactgctacactcccatcagcgccatgacagtttac agatgccatagcaacgtctggaagttacctgctttgacccagggctggatgggagctgga gcaggaggcccaggcattgcatctggtctccacaacaagacgctcagtacctacgagtcc aactttcagtctccaagtgaaagggacaaaaagacattggcgcacaaaagcgaagagaac atgaaaggagtcacagcagagagaggtcaactgaatgactgcaagaggaagtcagatcta gcagtgacttcaccacgaaaatcctatcccgggagttctcagttgcgacttaaacatgac ttatttgggaagcctttgatccccctcggatatgttcccattgcaaccttctgtaaaacc catcagacttgcaatgaattttctattgtctggcttctccatgaaatgggaagctccatg aaagctaggtgctgtcgaagtggcgtcctgaaagccgggggccttgcccggaacagggag gcttgggctgcggccaggggaggcaccagcagctggagcacactagaccagagccggcca ggacagagcctggcccgatctccctcggaggtagaactggacccccttcatgctcaggct gaggagcagggaaacctaccctatgatgttacagaagagtcaattaaggaattctttcga ggattaaatatcagtgcagtgcgtttaccatgtgaacccagcaatccagtgaggttgaaa ggttttggttatgctgaatttgaggacctggattccctgcttagtgccctgagtctcaag gaagagtctctaggtaacaggagaattcgagtggacgttgctgatcaagcacaggataaa gacagggatgattgttcttttggccgtgatagaaatcgggattctgacaaaacaggtaca gactggagggctcgtcctgctacagacagctttgatgactacccacctagaagaggtgat gatagctttggagacaagtatcgagatcgttatgattcagaccggtatcgggatgggtat cgggatgggtatcgggatggccaacgccgggatatggatccatatggtggccgggatcgc tatgatgaccgaggcagcagagactatgatagaggctatgattcccggataggcagtggc agaagagcatttggcagtgggtatcgcagggatgatgtctcagaggaggcggggaccact atgaagaccgatatgacagacgggatgatcggtcttggagctccagagatgattactctc tggatgattataggcgtgatgtccactcgagctgcttctatctttggaggggcaaagccc gttgacacagctgctagagaaagagaagtagaagaacggctacagaaggaacaagagaag ttgcagcatcagctggatgagccaaaactagaacgacggcctccggagagacacccaagc tggcgaagtgaagaaactcaggaacgggaacggtcgaggacaggaagtgagtcatcacag actgggacctccaccacatctggcagaagtaagtcagcccaggatgcacgaaggagagag aatgagaagtctctagaaaatgaaacactcaataaggaggaagattgccactctccaact tctaaacctcccaaacctgatcagcccctaaaggtaatgccagcccctccaccaaaggag aatgcttgggtgaagcgaagttctaaccctcctgctcgatctcagagctcagacacagag cagcaatcccctacaagtggtgagggaaaagtagctccagctcaaccatctgaggaagga ccagcaaggaaagatgaaaacaaagtagatgggatgaatgtcccaaaaggccaaactggg aactctagccgtggtccaggagatggagggaacaaagaccactggaaggagtcagatagg aaagatggcaaaaaggatcaagactccagatctgcacctgagccaaagaaacctgaggaa aatccagcttccaagttcagttctgcaagcaagtatgctgctctctctgttgatggtgaa gatgaaaatgagggagaagattatgccgaatag >gi568815589f:96046009_96247853|GENSCAN_predicted_peptide_3|208_aa MNKHRDFQVALIERDSEKCRALLSFEPNARAKAGLLLEVRLPQLRSSHARRKRQQASSWT FFQVPQRLYIQLEESEVNMIITKANIPPMTSQERKRCVEMTQHEKDSIERAPVQAASGIH WEPGTLQYTEEIKASFQKPAVQLRTIQACACTAPPKDLPMQMAGPLSTASVSAGLGQGLR ISISNKAPGKALVTVSSGTAQLPGMTKR >gi568815589f:96046009_96247853|GENSCAN_predicted_CDS_3|627_bp atgaataaacacagagactttcaagttgccttgattgaacgggacagcgagaaatgcaga gctctcctgtcatttgaacctaatgctcgggccaaggcaggcctgctcctcgaggtccgc ctacctcagcttagatccagccatgcgcgcaggaagcggcagcaggccagctcatggacg ttcttccaggtccctcaaaggctgtacatccagctagaagaatccgaagtcaacatgata atcacaaaggcaaacattccaccaatgacaagtcaagagcgcaagagatgtgttgagatg acccagcacgagaaggacagcattgaaagagcccctgtccaggcggcttctggcattcac tgggaaccagggaccctacagtacacagaagagataaaagcaagcttccagaaacctgca gtccagcttcgcactatacaagcctgcgcgtgcacggcgcctcccaaggacttgcccatg cagatggctgggcccctttccacagcatctgtttccgcaggtctgggtcagggcctgaga atcagcatttctaacaaggctccagggaaggccctggtcacagtctcctcagggacagct cagctgcctggaatgaccaagcgctga >gi568815589f:96046009_96247853|GENSCAN_predicted_peptide_4|437_aa XDTANSETYGIKAFVCAFSKALQEEYKAKEVIIQAGFLSLIPAWAFYSGAFQRLLLTHYV AYLKLNTKPSICATSQEAEGRRVPFGWHWEQENEVIMGCCSKKYWQLLLGRLPGVSSLSC SCGWEPEHPTSKTLLMHSASHVGIGDEADYVAEVNAGCAPVECKGILRTSKREIRSNTKM KVLEMQPNNKKKPMAGVIIRLQRATSWRCSLNVTCEEEQWSRGMQRKKRYRGRRDCGMYS QEITDSQDAWQQVGGQNFWRWNLLDSVSPEKVPSEAPCYPSTMGIVMSMRTVCSSKRDLP AAMAVSEAKGKGKPKEYSGQANFPDSRNQCKVGHKDPQGLVINAAMPSKAHGPLYPLPVE PEPKTLPFHLTNVFIEDLEYSKSYSRPVDFKCGLQTSSFGITQKPVSNANSQAPSLAKPE ALGVSPASVSNKPTRRL >gi568815589f:96046009_96247853|GENSCAN_predicted_CDS_4|1314_bp natgacacagctaattctgaaacatatggaatcaaggcgtttgtgtgcgcattttccaag gccctgcaagaggaatataaagcaaaagaagtcatcatccaggcgggctttctgagcctg atcccggcctgggccttctacagcggtgccttccaaaggctgctcctgacacactatgtg gcatacctgaagctcaacaccaagccttccatctgtgctacctcccaggaagccgaaggc cgcagagtccctttcggatggcactgggagcaggaaaatgaggtgattatgggctgctgc tccaagaagtattggcagctgttgctggggcggctccctggggtgtcatccctttcttgc tcttgtggatgggaaccagagcaccccacttcaaagactctgctcatgcactctgccagt catgtgggcattggcgatgaggcagactatgttgccgaagttaatgcaggatgtgctcca gtggaatgtaaaggtatcctacgaacaagcaagagagagattagatcaaacaccaagatg aaagtgttggagatgcagcccaataacaaaaagaaacccatggctggggtaattataagg cttcagcgagctacgtcctggaggtgctccctgaatgttacctgtgaggaggagcagtgg agcaggggaatgcagaggaagaagaggtatcgtggaagaagggattgtgggatgtacagc caggagatcacagactctcaggatgcctggcagcaagtggggggccagaatttctggcgg tggaacttactggacagcgtcagccctgagaaggttccctctgaagctccctgttaccca agcacaatgggaattgtaatgagcatgcggacagtctgctcctcaaagagggacttgcct gcagccatggctgtttcagaggccaaaggaaaagggaaaccgaaagaatacagcggccaa gcaaactttcctgacagcagaaatcaatgcaaggtgggccataaagacccacaaggcctt gttataaatgctgcaatgccgtccaaagctcacggccctctctatcctttaccagtagaa ccagaacccaagacactgccttttcatttaaccaacgtatttattgaggacctggaatac tccaagtcttattctaggccagtggatttcaaatgtggtctccagaccagcagcttcggc atcacccagaaacctgtgagcaatgcaaactctcaggcccccagccttgctaagccagaa gctctgggggtcagccctgcatctgtgtctaacaagcccaccaggcgactctga