GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:32:13 Sequence gi568815577f:36906773_37118204 : 211432 bp : 43.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 7914 7774 141 0 0 93 92 42 0.593 5.65 1.05 Intr - 15715 15680 36 1 0 101 79 32 0.081 2.06 1.04 Intr - 23661 23479 183 0 0 122 105 -23 0.132 2.78 1.03 Intr - 30620 29677 944 0 2 61 78 796 0.647 66.78 1.02 Intr - 32222 32060 163 2 1 107 75 88 0.553 8.95 1.01 Init - 59866 59672 195 1 0 90 80 338 0.961 30.13 1.00 Prom - 81770 81731 40 -2.86 2.00 Prom + 85330 85369 40 -6.26 2.01 Sngl + 88049 88303 255 1 0 87 55 265 0.643 18.21 2.02 PlyA + 88912 88917 6 1.05 3.00 Prom + 88950 88989 40 -4.66 3.01 Init + 97708 97742 35 0 2 67 89 48 0.472 2.15 3.02 Term + 99172 99322 151 1 1 38 48 149 0.494 3.28 3.03 PlyA + 101209 101214 6 -0.45 4.04 PlyA - 102639 102634 6 1.05 4.03 Term - 103219 103002 218 2 2 46 38 275 0.614 15.61 4.02 Intr - 103989 103852 138 1 0 42 91 57 0.684 1.84 4.01 Init - 109520 109514 7 2 1 97 98 0 0.488 2.93 4.00 Prom - 117624 117585 40 2.44 5.00 Prom + 136135 136174 40 -1.36 5.01 Init + 143451 143618 168 2 0 94 45 158 0.575 11.63 5.02 Intr + 166335 166592 258 2 0 39 100 281 0.951 22.06 5.03 Intr + 166652 166701 50 1 2 76 89 17 0.838 -1.92 5.04 Intr + 166814 166925 112 2 1 39 90 76 0.262 3.28 5.05 Intr + 167089 167203 115 0 1 84 23 19 0.015 -4.98 5.06 Intr + 167262 167396 135 1 0 76 43 65 0.023 1.34 5.07 Intr + 180475 180629 155 2 2 98 103 136 0.935 15.89 5.08 Intr + 188578 188672 95 0 2 69 82 52 0.232 1.46 5.09 Intr + 194834 194906 73 2 1 102 78 10 0.053 0.81 5.10 Term + 203395 203745 351 0 0 70 42 328 0.119 20.89 5.11 PlyA + 204203 204208 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 158940 158810 131 1 2 79 43 92 0.946 2.14 S.002 Intr - 160608 160490 119 2 2 64 93 78 0.945 6.01 S.003 Init - 160863 160856 8 0 2 75 107 0 0.910 1.06 S.004 Init + 178795 178855 61 0 1 81 72 35 0.866 2.61 S.005 Sngl + 203416 203745 330 0 0 71 42 314 0.879 21.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:36906773_37118204|GENSCAN_predicted_peptide_1|554_aa MLITLCYLYLWARWGRRPAELVRATVRRLRASRCSFTFCGAAAQPPGARVCLSRGGRVFC VSDSQIVKWSDCCLPLACRPGDPYRLIAEASVDNFSKLGVAFMEDRLHMDNGLVPQKIVS VHLQDSTLKEVKDQVSNKQAQILEPKPEPSLEIKPEQDGMEHVGRDDPKALGEEPKQRRG SASGSEPAGDSDRGGGPVEHYHLHLSSCHECLELENSTIESVKFASAENIPDLPYDYSSS LESVADETSPEREGRRVNLTGKAPNILLYVGSDSQEALGRFHEVRSVLADCVDIDSYILY HLLEDSALRDPWTDNCLLLVIATRESIPEDLYQKFMAYLSQGGKVLGLSSSFTFGGFQVT SKGALHKTVQNLVFSKADQSEVKLSVLSSGCRYQEGPVRLSPGRLQGHLENEDKDRMIVH VPFGTRGGEAVLCQVHLELPPSSNIVQTPEDFNLLKSSNFRRYEVLREILTTLGLSCDMK QVPALTPLYLLSAAEVQHAEFGHTAIQNRFRFYSEPVWLGGGVMPIKPALLEMTYSLPAF TVLLKPFVVFASWQ >gi568815577f:36906773_37118204|GENSCAN_predicted_CDS_1|1662_bp atgctcatcacgctgtgctacctgtacctgtgggcgcgctggggtcgccggccggctgag ctcgtgcgcgccacggtgcggcggctgcgtgcctcgcgctgttccttcaccttctgcggc gcggccgcgcagcccccgggcgcccgcgtgtgcctgagccgtggcggccgcgtcttctgc gtcagcgacagccagattgtcaagtggtcagactgttgtttgccattagcttgcagacct ggggatccttatcggctaattgctgaagcaagtgtggacaacttcagcaagctgggggtg gcgttcatggaagatagactccacatggataatggactggtaccccaaaagattgtgtcg gtgcacttgcaggactccactctgaaggaagttaaggatcaggtctcaaacaagcaagcc cagatcctagagccgaagcctgaaccttctcttgagattaagcctgagcaggacggtatg gagcatgttggcagagatgacccaaaggctcttggtgaagaacccaaacaaaggagaggc agtgcctctgggagtgagcctgctggggacagtgacaggggagggggccccgttgagcat tatcacctccatctgtctagttgccacgagtgtctggaacttgagaacagcaccattgag tcagtcaagtttgcgtctgccgagaacattccagaccttccctacgattatagcagcagt ttggagagtgttgctgatgagacctcccccgaaagagaagggaggagagtcaacctcacg ggaaaggcacccaacatcctcctctatgtgggctccgactcccaggaagccctcggccgg ttccacgaggtccggtctgtgctggccgactgtgtggacattgacagttatattctctac cacctgctggaggacagtgctctcagagacccgtggacggacaactgtctgctgttggtc attgctaccagggagtccattcccgaagacctgtaccagaagttcatggcctatctttct cagggagggaaggtgttgggcctgtcttcatccttcacctttggtggctttcaggtgaca agcaagggtgcactgcacaagacagtccagaacttggttttctccaaggctgaccagagc gaggtgaagctcagcgtcttgagcagtggctgcaggtaccaggaaggccccgtccggctc agccccggcaggctccagggccacctggagaatgaggacaaggacaggatgattgtgcat gtgccttttggaactcgcgggggagaagctgttctttgccaggtgcacttagaactacct cccagctccaacatagtgcaaactccagaagattttaacttgctcaagtcaagcaatttt agaagatacgaagtccttagagagattctgacaacccttggcctcagctgtgacatgaaa caagttcctgccttaactcctctttacttgctgtcagctgcggaggtacagcatgccgag tttggccacacagctatccagaatagattccgcttctacagtgagcctgtttggcttggt ggaggtgtgatgcctataaaaccagccttattagaaatgacttattcactccctgctttt actgttctactcaagccttttgtggtcttcgctagttggcag >gi568815577f:36906773_37118204|GENSCAN_predicted_peptide_2|84_aa MRTLWINRITAATQEHGPKYPAFTGNLIKCQVKRHKKVPANLAIYEPETFKSLAALANRR RQEEFAAALGDGKAAYFSRMVQYR >gi568815577f:36906773_37118204|GENSCAN_predicted_CDS_2|255_bp atgaggaccctctggattaatcgaattacagctgctacccaggaacacggcccgaagtac ccagcgttcactggcaatttaattaagtgccaggtgaagcgccacaagaaagtcccagcg aatctggccatctacgaaccagagacttttaaatctttggctgccttggccaataggagg cgacaggaagaatttgctgctgccttaggagatgggaaggcagcatatttttccagaatg gtacagtaccgctga >gi568815577f:36906773_37118204|GENSCAN_predicted_peptide_3|61_aa MSYCSLNFLELRVAAPHPTRFATGSSPGVGASGSPLSSTEGKAGARGPEDSVDVPEDPLP L >gi568815577f:36906773_37118204|GENSCAN_predicted_CDS_3|186_bp atgtcttactgcagcctcaacttcctcgagctcagggtcgccgcgccccaccccacccgc ttcgccacgggttcgagccctggcgtcggggcgtccgggagcccactgtccagcaccgaa ggcaaggccggtgcacgcggacccgaggattcggtagatgtccccgaagacccgctgccg ctctaa >gi568815577f:36906773_37118204|GENSCAN_predicted_peptide_4|120_aa MEGLRSGSLADRAGPEEQTVPDVFQETASLTVPASGQEGNATEIKKGAGNPGRCTVPEHR LALVALRNQLGSAKSGLRARGEDWRALQPPGNNHSSHTHIRGQNYFSVTGYLLISEKVNV >gi568815577f:36906773_37118204|GENSCAN_predicted_CDS_4|363_bp atggaggggctaagaagtgggagcctggcagatagggcaggcccagaagaacaaacagtc cctgatgttttccaggaaactgcctccctgacagtgcccgcaagtggccaggagggaaac gcaacagagatcaagaaaggtgcagggaacccagggcggtgcacagtaccagagcacaga ctggcgctggtcgctctgcgcaaccagctgggctctgcaaagagcgggctccgggctagg ggtgaggattggcgggctctacagcccccagggaacaaccactcttcccacacccacatc cgtggacagaactacttcagcgtgacaggctacctgctgatcagcgagaaagttaacgtt taa >gi568815577f:36906773_37118204|GENSCAN_predicted_peptide_5|503_aa MEIQKIIQGYYEHLYAHKLENLEEMDKFLEKHNPPSFNQEELDTLNRPITSSKIEMAHND PPRVGASERGGGRDPAARDGEGGGGGVRLRTRAASPRDAPRFNAATRSLPPSADVEGRRW RRRRRRLLLLLPASEARGRRARPRCRGSGAFHTPSVGVCGFAPLVWCPFASEALGCRCEG VSSPFVRNPPLPVPRPRAWPRLLQLRPPPRALLAQVCVLACLRVTQSVCAWVLLSFVGER ERSCTPLAVCACHDSVATCFRVNWQSSPGPVPTERTQEGDLCTMDNFAEGDFTVADYALL EDCPHVDDCVFAAEFMSNDYVRVTQLYCDGVEGELMKMKGNEEFSKERFDIAIIYYTRAI EYRSLCIPIIFYAYTSIYVEPDEIAIFERSSSPAMEQSWTENDFDELREEGFRRSNYSEL KEEVRTNGKEVRNFEKKLDEWITRITNAEKSLKDLMELKTTARELCDKCTNLSNRCDQLE ERVSAMEDEMNEMKHEDKFREKE >gi568815577f:36906773_37118204|GENSCAN_predicted_CDS_5|1512_bp atggaaatacaaaagatcattcaaggctactatgaacacctttatgcacataaactagaa aacctagaagagatggataaattcctggaaaaacacaaccctccaagcttcaatcaggaa gaattagataccctgaacagaccaataacaagcagcaagattgaaatggcgcacaacgac cctccccgggtgggagccagcgagcgcgggggcgggcgtgatcccgcggctcgcgacggc gaaggagggggcggtggggtgcgactgcgcacgcgcgctgccagcccacgtgacgcacct cgcttcaacgccgccacccgctccctcccgccgtcggctgacgtggagggccggaggtgg cggcggcggcggcggcggctgctgctgctgctgcccgcgtccgaggctcgcgggcggcgg gcccggccgagatgccgggggagcggggccttccacaccccctccgtgggtgtgtgtggg tttgcccccttggtctggtgccccttcgcgagcgaggccctgggatgccggtgcgagggt gtgagtagtccctttgtgcgcaaccccccgctgcccgtgcctcggccgcgcgcgtggccg cgccttctgcaattgcgacccccgcctagagccttgctcgcgcaagtgtgcgtccttgcg tgtctccgagtgacgcagagtgtctgcgcatgggtcctgttgagtttcgttggtgagcgt gaacgttcgtgcacaccccttgctgtgtgtgcgtgtcacgactctgttgccacctgtttt cgtgtgaactggcagtccagtccagggcctgtccccacggaacggacccaggagggggac ttgtgcaccatggacaattttgctgagggagatttcactgtggcggattatgccttgtta gaagattgccctcacgtggatgattgtgtctttgctgctgaatttatgagcaatgattat gttcgtgtgactcagctttactgtgatggggtggaaggagaactaatgaaaatgaaagga aatgaagagttttccaaagaaagatttgatatagctattatctattacaccagagccatt gaatatagatccttgtgtattcccataatattttatgcctatacaagcatttatgttgaa ccagatgaaattgccatttttgaacgcagctcctcaccagcaatggaacaaagctggacg gagaatgactttgacgagttgagagaggaaggcttcagaagatcaaactactccgagcta aaggaggaagttcgaaccaatggcaaagaagttagaaactttgaaaaaaaattagatgaa tggataactagaataaccaatgcagagaagtccttaaaggacctgatggagctgaaaacc acggcacgagaactatgtgacaaatgcacaaacctcagtaaccgatgcgatcaactggaa gaaagggtatcagcgatggaagatgaaatgaatgaaatgaagcatgaagataagtttaga gaaaaagaataa