GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:07:54 Sequence gi568815590r:66076890_66277477 : 200588 bp : 40.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2973 3035 63 2 0 71 82 35 0.478 2.40 1.02 Intr + 3488 3659 172 1 1 124 81 185 0.687 19.99 1.03 Intr + 36143 36495 353 0 2 59 40 255 0.446 11.92 1.04 Intr + 36722 36978 257 1 2 51 55 255 0.548 13.82 1.05 Intr + 37702 37890 189 1 0 17 105 180 0.545 10.38 1.06 Intr + 50145 50273 129 0 0 73 50 72 0.350 0.79 1.07 Intr + 50316 50551 236 0 2 25 58 186 0.265 5.71 1.08 Intr + 51410 51587 178 0 1 32 92 198 0.536 12.66 1.09 Intr + 58101 58266 166 0 1 98 87 198 0.796 19.74 1.10 Intr + 58868 58963 96 1 0 88 71 36 0.688 1.19 1.11 Intr + 60206 60301 96 1 0 31 105 169 0.943 12.19 1.12 Intr + 72756 72989 234 2 0 28 111 246 0.776 17.86 1.13 Intr + 73328 73350 23 1 2 92 69 16 0.970 -4.48 1.14 Intr + 73453 73577 125 1 2 58 68 109 0.974 5.21 1.15 Intr + 75488 75738 251 0 2 27 91 373 0.996 27.73 1.16 Intr + 77158 77445 288 0 0 46 92 302 0.745 22.92 1.17 Term + 77491 77607 117 0 0 85 49 39 0.699 -2.74 1.18 PlyA + 78468 78473 6 1.05 2.05 PlyA - 80117 80112 6 1.05 2.04 Term - 80309 80248 62 0 2 128 47 54 0.941 2.29 2.03 Intr - 82049 81894 156 0 0 29 55 160 0.203 5.86 2.02 Intr - 87055 86955 101 0 2 14 97 75 0.149 -0.17 2.01 Init - 97750 97545 206 1 2 71 68 151 0.671 9.78 2.00 Prom - 98466 98427 40 -8.35 3.02 PlyA - 98619 98614 6 1.05 3.01 Sngl - 100588 99998 591 1 0 75 41 810 0.998 68.94 3.00 Prom - 101788 101749 40 -7.65 4.04 PlyA - 102282 102277 6 -0.45 4.03 Term - 103076 102944 133 1 1 85 45 131 0.960 5.08 4.02 Intr - 104379 104253 127 1 1 81 111 19 0.350 2.32 4.01 Init - 111250 111205 46 1 1 62 107 36 0.316 3.80 4.00 Prom - 114984 114945 40 -6.05 5.00 Prom + 115165 115204 40 -5.05 5.01 Init + 115654 115702 49 0 1 44 77 43 0.011 -0.04 5.02 Intr + 115891 116078 188 2 2 32 84 116 0.006 4.09 5.03 Intr + 123338 123449 112 1 1 114 73 50 0.174 5.13 5.04 Term + 125949 126124 176 1 2 87 43 117 0.242 4.04 5.05 PlyA + 126771 126776 6 1.05 6.00 Prom + 130589 130628 40 -3.25 6.01 Init + 132693 132806 114 2 0 67 81 53 0.208 2.76 6.02 Intr + 137698 137790 93 0 0 115 48 51 0.363 3.04 6.03 Term + 138977 139087 111 1 0 69 40 88 0.431 -0.32 6.04 PlyA + 141655 141660 6 1.05 7.04 PlyA - 142681 142676 6 1.05 7.03 Term - 144319 144175 145 0 1 106 48 161 0.996 10.30 7.02 Intr - 145898 145720 179 2 2 -10 75 141 0.227 0.80 7.01 Init - 150978 150865 114 0 0 103 75 77 0.293 8.16 7.00 Prom - 159123 159084 40 -5.05 8.00 Prom + 159418 159457 40 -5.05 8.01 Init + 159778 159829 52 0 1 74 74 23 0.404 0.87 8.02 Intr + 160483 160859 377 2 2 68 89 224 0.252 14.31 8.03 Intr + 160899 160959 61 2 1 74 82 47 0.614 0.09 8.04 Term + 172172 172335 164 0 2 55 49 172 0.962 7.12 8.05 PlyA + 173050 173055 6 1.05 9.00 Prom + 194762 194801 40 -1.45 9.01 Init + 196283 196424 142 1 1 64 79 59 0.928 2.94 9.02 Intr + 199018 199167 150 2 0 88 38 150 0.817 9.21 9.03 Term + 199713 199936 224 1 2 99 42 136 0.922 6.20 9.04 PlyA + 200325 200330 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 116517 116365 153 0 0 96 43 89 0.884 2.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:66076890_66277477|GENSCAN_predicted_peptide_1|990_aa MREDARDLRRDAIQDLVTSRNALFVIVGLLTGCYFCCCLCCCCNCCCGHCRPESSVPEED FYVSPEDLEEQIKSDMEKVAVGLLCVENTLGNHFKGQRRLVKEKYDKGGRWNRRVLASRT PDTWVEGLSRVSQASEPERSVFLRTRRAHRPALGLCAPRITHAQCNLRLPGLALRFFCAE PQGTPTHVHVPSIAQLPFALRERVRAGSQRSCGSALGRATAGSRLLRLLNPSLALPGPRV TQEAAPAPGGQAGTFSSEPRAASATRGLATAAVRRWGVRGKSSGRPTGKTKQETENLETD ADYFERLLNRGSRNCSAPSQVRRADCLALGRSEVDRTRQKVALERIHPPVTGLYGHVSPP SRDRRLLKTAPAPTPNQGLKQCPPPRETKHLLEPLAAPFPGSTLGDSEEMSASLNYKSFS KEQQTMDNLEKQLICPICLEMFTKPVVILPCQHNLCRKCASDIFQVEQASNPYLPTRGGT TMASGGRFRCPSCRHEVVLDRHGVYGLQRNLLVENIIDIYKQESTRPEKKSDQPMCEEHE EERINIYCLNCEVPTCSLCKVFGAHKDCQVAPLTHVFQRQKGLSHGPLILSRQKDSVSLY TLPTCQPGFIHEQSELSDGIAILVGSNDRVQGVISQLEDTCKTIEECCRKQKQELCEKFD YLYGILEERKNEMTQVITRTQEEKLEHVRALIKKYSDHLENVSKLVESGIQFMDEPEMAV FLQNAKTLLKKISEASKAFQMEKIEHGYENMNHFTVNLNREEKIIREIDFYREDEDEEEE EGGEGEKEGEGEVGGEAVEVEEVENVQTEFPGEDENPEKASELSQVELQAAPGALPVSSP EPPPALPPAADAPVTQGEVVPTGSEQTTESETPVPAAAETADPLFYPSWYKGQTRKATTN PPCTPGSEGLGQIGPPGSEDSNVRKAEVAAAAASERAAVSGKETSAPAATSQGPTGTGHS KKKKGFQRGREKPQPGEKYRNNSSPNVGLF >gi568815590r:66076890_66277477|GENSCAN_predicted_CDS_1|2973_bp atgagggaggatgcaagagacttgaggagagacgctatccaggacttggtgaccagcagg aatgccctgtttgtcatcgttggcctcttgacgggctgctacttttgctgctgcctgtgc tgctgctgcaactgctgctgtggacactgccggcccgagtcatcagtgccagaagaggac ttctatgtgtccccagaggatctggaggagcagatcaagtctgacatggaaaaagtcgcc gttggattactgtgtgttgagaacacactcggcaaccactttaaaggacaacgcaggctg gtaaaggaaaagtacgacaagggggggcggtggaatcgcagggtcttggcatcgcggacc ccagacacctgggttgagggcctttcccgggtcagtcaggctagcgagccggagcgttct gtctttctgcgcacgcgtagagcacacaggccggctctggggctctgcgctcctcggatt acgcatgctcagtgcaatcttcggttgcctggactagcgctccggtttttctgtgctgaa cctcaggggacgccgacacacgtacacgtcccttcgatagctcagctgccattcgccctg cgggaacgtgtccgggcaggttcccagcgcagctgtgggtctgcgcttggccgagcgact gccgggtcacgacttctgcgtcttcttaacccgtctttggcattgcccgggccccgagtc acacaggaggcagcgccggctccagggggccaggcggggaccttctcctcagagccccgg gcagcttctgcgacccgagggctcgcaacggctgccgtgaggaggtggggggtccgcggg aagagttcggggagacccacagggaaaacaaaacaagaaacagagaacttggaaacggac gctgattactttgaacgtttgctcaaccgaggaagcaggaactgttcggccccatctcaa gtccgcagggcggactgcctggctctgggcagatccgaggtggataggacaaggcaaaag gtagccctggagaggatccatcctcctgtcactggcttatatggacacgtcagcccaccg tcacgtgaccgcaggctgctaaaaacagctccagcacccactccaaaccagggcctgaaa caatgtcctccaccgagagaaacgaaacacttgctggagccactcgcagcacccttccct ggcagcacacttggggacagcgaggagatgagcgcatctctgaattacaaatctttttcc aaagagcagcagaccatggataacttagagaagcaactcatctgtcccatctgcttagag atgttcacgaaacctgtggtgattctcccttgtcagcacaacctgtgtaggaaatgtgcc agtgatattttccaggtagaacaggcctctaacccgtatttgcccacaagaggaggtacc accatggcatcagggggccgattccgctgcccatcctgtagacatgaagtggttttggat agacatggggtatatggacttcagaggaacctgctggtggaaaatatcattgacatctac aagcaggagtccaccaggccagaaaagaaatccgaccagcccatgtgcgaggaacatgaa gaggagcgcatcaacatctactgtctgaactgcgaagtacccacctgctctctgtgcaag gtgtttggtgcacacaaagactgccaggtggctcccctcactcatgtgttccagagacag aagggcctcagccatggccccctaattctgagcagacagaaagacagtgtctccctctac acattaccaacctgccaacctggtttcattcatgagcagtctgagctcagtgatggcatc gccatcctcgtgggcagcaacgatcgagtccagggagtgatcagccagctggaagacacc tgcaaaactatcgaggaatgttgcagaaaacagaaacaagagctttgtgagaagtttgat tacctgtatggcattttggaggagaggaagaatgaaatgacccaagtcattacccgaacc caagaggagaaactggaacatgtccgtgctctgatcaaaaagtattctgatcatttggag aacgtctcaaagttggttgagtcaggaattcagtttatggatgagccagaaatggcagtg tttctgcagaatgccaaaaccctgctaaaaaaaatctcggaagcatcaaaggcatttcag atggagaaaatagaacatggctatgagaacatgaaccacttcacagtcaacctcaataga gaagaaaagataatacgtgaaattgacttttacagagaagatgaagatgaagaagaagaa gaaggcggagaaggagaaaaagaaggagaaggagaagtgggaggagaagcagtagaagtg gaagaggtagaaaatgttcaaacagagtttccaggagaagatgaaaacccagaaaaagct tcagagctctctcaggtggagctgcaggctgcccctggggcacttccagtttcctctcca gagccacctccagccctgccacctgctgcggatgcccctgtgacacagggggaggttgta cccactggctctgagcagaccacagagtctgaaactccagtccctgcagcagcagaaact gcggatcccttgttttaccctagttggtataaaggccaaacccggaaagccaccaccaac ccaccttgcaccccagggagcgaaggtctggggcaaatagggcctccaggttctgaggat tcgaatgtacggaaggcagaagtggcagcagccgcagcgagtgagagggcagctgtgagt ggtaaggaaactagtgcacctgcagctacttctcagggtcccactggaacaggccacagc aagaagaaaaagggttttcaaaggggcagagagaaacctcagccaggggaaaagtacaga aacaattcatcccccaatgttggcttgttttga >gi568815590r:66076890_66277477|GENSCAN_predicted_peptide_2|174_aa MHLSQADRGAAVGMNIIHLGSSTRKRRRYGELAQNQLHCHSLELQPVPGGEEPQIQSAME ENGNQRKKSFQTHKAKTDTVRIDKSTILVKDLKIPLSEADETVRSVSNFRRLQSGVIMAV EFMESEGGNSSCVNIQLNECLGRNLENGLGNWDLCRHVDVSIHDMGSVDHSLEA >gi568815590r:66076890_66277477|GENSCAN_predicted_CDS_2|525_bp atgcatctcagccaggcagacagaggggcagcagttggaatgaatatcattcatttaggg agttcaaccaggaaaaggagaagatatggcgagctggctcagaatcagctccactgccac tcgctggagctgcagcctgtccctggaggggaggagcctcaaatccaatctgcaatggaa gaaaatgggaatcagagaaaaaagagtttccaaacacataaggcaaaaactgatactgta agaatagacaaatccacaattctagtcaaagatctcaaaattcctctctcagaagctgat gaaacagtgaggtctgtaagtaactttaggcgacttcagagtggagtcatcatggctgtg gagttcatggaatctgaaggaggtaatagcagctgtgtaaacattcagctaaatgagtgc ctagggagaaatctggaaaatggtttgggaaattgggatttatgcaggcacgtggatgtc agcatacatgacatgggctcagtggatcacagccttgaagcatga >gi568815590r:66076890_66277477|GENSCAN_predicted_peptide_3|196_aa MRLPLLVSAGVLLVALLPCPPCRALLSRGPVPGARQAPQHPQPLDFFQPPPQSEQPQQPQ ARPVLLRMGEEYFLRLGNLNKSPAAPLSPASSLLAGGSGSRPSPEQATANFFRVLLQQLL LPRRSLDSPAALAERGARNALGGHQEAPERERRSEEPPISLDLTFHLLREVLEMARAEQL AQQAHSNRKLMEIIGK >gi568815590r:66076890_66277477|GENSCAN_predicted_CDS_3|591_bp atgcggctgccgctgcttgtgtccgcgggagtcctgctggtggctctcctgccctgcccg ccatgcagggcgctcctgagccgcgggccggtcccgggagctcggcaggcgccgcagcac cctcagcccttggatttcttccagccgccgccgcagtccgagcagccccagcagccgcag gctcggccggtcctgctccgcatgggagaggagtacttcctccgcctggggaacctcaac aagagcccggccgctcccctttcgcccgcctcctcgctcctcgccggaggcagcggcagc cgcccttcgccggaacaggcgaccgccaactttttccgcgtgttgctgcagcagctgctg ctgcctcggcgctcgctcgacagccccgcggctctcgcggagcgcggcgctaggaatgcc ctcggcggccaccaggaggcaccggagagagaaaggcggtccgaggagcctcccatctcc ctggatctcaccttccacctcctccgggaagtcttggaaatggccagggccgagcagtta gcacagcaagctcacagcaacaggaaactcatggagattattgggaaataa >gi568815590r:66076890_66277477|GENSCAN_predicted_peptide_4|101_aa MHKQYIRIHGLKEVGVCVPLLEKKVQRLLLHYANNMGLQSKELVPNSHFTTNLLSDFRAV VDWYRPKMEVEICIASFVTLATKEAAEVIRNEEQEISDKQL >gi568815590r:66076890_66277477|GENSCAN_predicted_CDS_4|306_bp atgcataagcagtatattaggattcatggactaaaagaagtgggagtttgtgtgcctctg ctggaaaagaaagtccaaaggttattgttacattatgcaaataatatgggcttgcaatca aaagagctggttcctaattctcactttaccactaacttgctgagtgacttcagagcagtt gtggattggtatagacccaaaatggaagttgaaatatgcattgcatcatttgtcacatta gctacaaaggaagcagcagaagtaatcagaaatgaggagcaggaaatttccgataaacaa ctctga >gi568815590r:66076890_66277477|GENSCAN_predicted_peptide_5|174_aa MQSIVPECVCEGVAKGDSKFFSFWTLGLTPVFCQGLLDLRPQTEGCTVSFPTFEVFGLRL ASLLLSLQTAYCGTSPCDHMWGYQALLDQLKPFLMRDVPCANEAALLIHLVLSTGGMRML SAQAAIQSGNSRNTLEMPLTPAKENSQYLSSQNVLSTQVTCIKLMLGAAGLQNF >gi568815590r:66076890_66277477|GENSCAN_predicted_CDS_5|525_bp atgcaaagtattgttcctgagtgtgtttgtgagggtgttgccaaaggagactccaagttc ttcagcttttggactcttggacttacaccagtgttttgccaggggctcttagatcttcgg ccacagactgaaggctgcactgtcagcttccctacttttgaggtttttggactcagactg gcttccttgctgctcagcttgcagacggcctattgtgggacttcaccttgtgatcatatg tggggatatcaagcactcctggatcagttgaagccattcctgatgagagatgttccttgt gccaatgaggctgctcttctcattcatttagtcctctccactggtggcatgagaatgctt tcagcacaggcagccattcagagtgggaactccaggaacactcttgagatgccactcaca cctgccaaagagaactcccaatacctgtcttcccagaatgtgctgtccactcaagttacc tgcatcaaactcatgcttggtgctgctgggctgcagaatttctaa >gi568815590r:66076890_66277477|GENSCAN_predicted_peptide_6|105_aa MLLPENRGANTAEEWSRGLQMAPGAEIKCAAHLIKTGEPITTAFTISSSGMFLRQLKIVS KNVSQKREKGQLIKDYLKNILEEKNQAMDKVERICLAVKLHNAKK >gi568815590r:66076890_66277477|GENSCAN_predicted_CDS_6|318_bp atgctgctccctgagaacagaggagcaaacacagctgaggaatggagtagaggtttgcag atggccccaggagctgaaattaaatgtgctgctcacctcatcaagacaggggaacccatc acgactgcgttcaccatttcttcaagtggaatgttccttcgacaacttaaaatagtttca aaaaatgtttcacagaaaagagaaaagggccaattaatcaaggattatttgaaaaacatt ttggaagaaaaaaatcaagcaatggataaggttgagcggatttgcctagctgtcaaattg cataatgccaaaaagtag >gi568815590r:66076890_66277477|GENSCAN_predicted_peptide_7|145_aa MAPEVPSVAHAIALKGASSKLWQHSCSAESADTQIQPHTVEEGIKKRRQVGMLEWVCYAR PGDRSEDYIPREGPDDTVVTMSVGKSESTGLVDGALLRKSDLWDPWRSCFLNLCKVDLAV WVTVTAVAMKRHNSDLGSPAAENID >gi568815590r:66076890_66277477|GENSCAN_predicted_CDS_7|438_bp atggctccagaagtcccaagtgtggcccatgccatagctttgaagggtgcaagcagtaag ctttggcagcattcatgtagtgctgaatctgcagacactcagattcaaccccatacagta gaggaaggcataaaaaagcgcaggcaagtaggtatgctagagtgggtgtgttatgcaaga ccaggagaccgatcagaggattatattcccagagagggcccagatgacacggtggtcacc atgtctgttgggaaatctgagtctacagggcttgttgatggtgctcttctgaggaagagt gacctatgggatccatggaggagctgcttcctgaatctgtgtaaagtagatcttgccgtg tgggtaactgtgactgcggtggccatgaagaggcacaactcagacctcggatctcctgct gcagagaacattgactga >gi568815590r:66076890_66277477|GENSCAN_predicted_peptide_8|217_aa MPNVGDGLMGAANHHGTGRAWLVLPAWIPRLSNSSQVRSSKGCVSKRVWGSGHGAQSGTP AAAVGQVHQVLAQAQALCEAMARPGMLQAASVAVTREHSGTRKLGDSRNHRAPKTESQPW LKELPGLGSLKGYSSLLLFNCNMLFQPCHLAGPKFVSCVQEEQVCWTSQRNLHQNRENEQ AFIAMAQLYKKKWRDVVGSQGHRMEGPAGAVAEKHKL >gi568815590r:66076890_66277477|GENSCAN_predicted_CDS_8|654_bp atgcctaatgtaggtgatgggttgatgggtgcagcaaaccaccatggcacaggccgagct tggctcgtgctaccagcctggatcccacgcctgtcaaactcaagccaggtgcggagtagc aaggggtgtgtgagcaagcgagtgtgggggtctggccacggtgcacagtcaggtacgccg gctgctgcagtggggcaggtgcaccaggtgctggcacaggcacaggctctctgtgaggct atggccagaccaggcatgctacaagcagcttccgtggctgtcaccagggaacacagtggc acccggaagcttggagactccaggaaccacagggccccaaagactgagtcacagccctgg ctcaaggagctcccaggtctgggctccctaaaaggctacagttctctccttctcttcaac tgcaacatgctctttcagccctgccacttggcaggtcccaagtttgtgtcctgtgtccag gaagaacaagtgtgctggaccagtcaaagaaacctgcatcaaaaccgagagaatgaacaa gccttcattgccatggcacagttatataaaaagaaatggagagatgttgtgggaagtcag ggacaccgaatggaaggaccggctggagccgtggcagagaaacataaattgtga >gi568815590r:66076890_66277477|GENSCAN_predicted_peptide_9|171_aa MVNFMCNLTGVGDTEIVGETISWCVYEVFQKRLAFESVDSVNTIPHHRLFFHGSVSIRNY SVFTGLFPPEYELCESGALPLRLFKLSQCPAQSPLALPVHISSHTDSIPHLQGSRPSLIQ LRSLGKVPTEDAGSLGAWTMFFLLICTSTRVHRARGTASGLQPPLSKILVE >gi568815590r:66076890_66277477|GENSCAN_predicted_CDS_9|516_bp atggttaattttatgtgcaacctgactggagtaggggacaccgagatagttggagaaact atttcttggtgtgtctatgaggtgtttcagaagagattagcatttgaatcagtggactca gtaaacacgatccctcaccacaggttattttttcatggcagtgtcagcatccgaaactac tctgtcttcactgggctcttccctccagaatatgagctctgtgagagcggggccttgccc cttcgcctcttcaagctgtcacagtgtccagcccagagtccacttgcacttcctgtccac atttctagccacacagactctatcccccacctgcagggctcccggccttccctgattcag ctgaggagccttggcaaggtgcccacagaggatgctgggtctctgggagcttggaccatg ttcttcctgctcatctgcacgtcaaccagggtgcatcgagcacgtggcacagcatctggc ctgcagccacccctcagtaaaattcttgttgaatga