GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:25:43 Sequence gi568815581f:1170564_1371268 : 200705 bp : 51.11% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2303 2379 77 1 2 78 89 44 0.224 4.01 1.02 Intr + 3987 4169 183 0 0 87 95 32 0.333 3.22 1.03 Intr + 7382 7539 158 2 2 110 16 74 0.386 2.57 1.04 Intr + 8890 9181 292 2 1 76 49 136 0.647 5.24 1.05 Intr + 11704 11748 45 1 0 127 110 29 0.720 7.01 1.06 Intr + 18459 18580 122 0 2 74 76 36 0.066 1.64 1.07 Intr + 21340 21469 130 2 1 107 95 67 0.991 9.56 1.08 Intr + 22730 22922 193 2 1 35 75 139 0.817 7.32 1.09 Term + 27629 27871 243 1 0 71 37 122 0.769 1.63 1.10 PlyA + 29272 29277 6 1.05 2.05 PlyA - 29376 29371 6 1.05 2.04 Term - 30688 30587 102 1 0 89 44 73 0.100 1.58 2.03 Intr - 46335 46219 117 0 0 65 47 112 0.433 5.97 2.02 Intr - 58212 58097 116 1 2 48 84 11 0.201 -2.63 2.01 Init - 59067 58230 838 0 1 57 80 1300 0.979 119.09 2.00 Prom - 63105 63066 40 -3.01 3.00 Prom + 66325 66364 40 -2.91 3.01 Sngl + 79422 79856 435 2 0 55 42 222 0.608 10.93 3.02 PlyA + 82494 82499 6 1.05 4.00 Prom + 89228 89267 40 -4.61 4.01 Init + 100001 100565 565 1 1 69 15 666 0.924 50.67 4.02 Term + 100738 100949 212 2 2 45 50 174 0.939 7.08 4.03 PlyA + 101235 101240 6 1.05 5.00 Prom + 102576 102615 40 -5.41 5.01 Init + 109439 109825 387 1 0 96 115 440 0.989 44.33 5.02 Term + 110256 110330 75 2 0 70 55 0 0.292 -6.96 5.03 PlyA + 113252 113257 6 1.05 6.04 PlyA - 114116 114111 6 1.05 6.03 Term - 116340 116222 119 1 2 26 48 158 0.587 4.61 6.02 Intr - 117870 117727 144 1 0 -58 56 184 0.424 1.36 6.01 Init - 118154 117872 283 2 1 64 3 371 0.310 23.40 6.00 Prom - 120870 120831 40 -7.00 7.03 PlyA - 121040 121035 6 1.05 7.02 Term - 123191 122867 325 1 1 47 44 469 0.541 32.99 7.01 Init - 123775 123735 41 1 2 70 97 -21 0.491 -3.39 7.00 Prom - 124308 124269 40 -6.70 8.00 Prom + 124425 124464 40 -8.19 8.01 Init + 124871 125060 190 1 1 60 94 336 0.387 28.55 8.02 Intr + 134217 134489 273 1 0 77 65 149 0.683 9.55 8.03 Intr + 157249 157340 92 2 2 93 56 38 0.405 1.31 8.04 Term + 158370 158948 579 2 0 18 49 385 0.772 22.50 8.05 PlyA + 159604 159609 6 -3.94 9.03 PlyA - 159750 159745 6 1.05 9.02 Term - 161321 161217 105 2 0 85 50 68 0.893 1.41 9.01 Init - 164323 164267 57 1 0 110 92 54 0.961 9.26 9.00 Prom - 169211 169172 40 -2.31 10.06 PlyA - 173093 173088 6 1.05 10.05 Term - 174936 174884 53 1 2 118 53 98 0.787 7.18 10.04 Intr - 183784 183648 137 0 2 57 115 143 0.998 14.62 10.03 Intr - 190735 190529 207 0 0 75 111 147 0.997 14.42 10.02 Intr - 191445 191339 107 0 2 97 51 30 0.990 -0.29 10.01 Intr - 194495 194296 200 0 2 91 81 182 0.214 17.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 27379 27450 72 0 0 70 40 160 0.873 8.47 S.002 Term + 37818 37970 153 2 0 87 39 94 0.829 2.63 S.003 Sngl - 98096 97827 270 2 0 60 46 192 0.831 5.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:1170564_1371268|GENSCAN_predicted_peptide_1|480_aa MGELQNSPPGTNSHLEKSAKNIEQSRWGTESIKGMKISQENEIALGNVAWGKRSELQKAG SSNNRWSRSAFHDGQATAGSCLAQQGRGRSRWSGGTWHRTYAQGIRLCPFDVEKPKQNEP SQGQLLGGGEEGTWMEEEADQPGSWVPTPIPIPNPTPIRTPISPPPALPGPARCLGPDPD LGVPIPILGSRSPSWGPDPDPGVPPPPGTRTAVEGVDPGQARQAPVAERLHPAAAARSEN RGPEQLSNPLRGSWHLVLSWSAQSHEHTVRCQELARETSTHDSCSPERRVKKGGKERSTE HSLNPAAGSATNLLCDSEELPFPVCPLMFSPRHEEMIQDSPALPASLLNALRPTCPEHRQ YLRSRHPVKHVTFTLRKQPLYVEATVTTPALQKRLKFTLLRPHMRLYMQAGHISFQAHKQ PRLPAKTTNPNVGTRGTLPKAELLLIFSVTFRTKKEEVLSGQGELGLRHQQTATSHLASA >gi568815581f:1170564_1371268|GENSCAN_predicted_CDS_1|1443_bp atgggtgagctgcagaactctccccctggtacaaacagtcacttggagaaaagtgccaag aatatagaacagagcagatggggcactgaatctataaagggcatgaaaatatcacaagag aatgaaatcgcccttggaaatgttgcttggggtaagagaagtgagctccaaaaagcaggc tctagtaacaacagatggtcacgttctgccttccatgacggccaagccactgcaggctct tgcttggctcagcagggaaggggaaggagcagatggagcggcggcacctggcacaggacc tatgcccaggggatccggctttgcccctttgatgttgagaaaccaaagcagaatgaacca tcacaaggacagctcctggggggaggagaggaaggaacatggatggaggaggaggcggac cagcccggctcctgggtcccgaccccgatcccgattcccaacccgaccccgatccggacc ccgatctcgcccccgcccgcgctccccggaccagcccggtgcctgggtcccgatcccgat cttggggtcccgatcccgatcctggggtcccgatctccatcctggggtcccgatcccgat cctggggtcccgcccccgcccggcacacgtactgctgtagagggtgtcgatccaggacag gcgcggcaggccccggtggctgagcggctccatcccgcggcggcggctcggtcagaaaac cgaggccctgagcagctgagtaacccactcagaggctcttggcaccttgtcctgagctgg tcagctcagtctcatgaacacacagtccggtgccaggagctggcacgagaaactagcaca catgactcgtgttctcccgaaaggagagtgaagaagggagggaaagagcgctcgaccgag cacagcctgaatccagctgctggttctgccactaacttgctgtgtgactctgaagaactg cctttccctgtctgtcccttgatgttctcacctcgacacgaagagatgattcaggactcc cccgctctcccggcgagtttgctgaacgcccttcggccgacgtgtcccgagcatcgccag tatttacgttctaggcaccctgttaagcacgtaaccttcactcttcgcaagcagcctctc tatgtggaagccacagtcaccacgcccgctttacagaagaggctcaagttcacactgcta aggccacacatgaggctgtacatgcaggcaggccacatttccttccaggcccacaaacaa cctcgacttcctgcaaagaccactaaccccaacgtcggtacacgtggcacgttgcccaag gccgagctgctgctgatttttagtgtgactttcaggaccaaaaaggaagaggttctgtct ggacaaggagagcttggcctgagacaccaacagacggcaacttcccacctggcctctgcc tga >gi568815581f:1170564_1371268|GENSCAN_predicted_peptide_2|390_aa MWDPRAFERRWRAEFPGAEAPVPRLESVRDAERELERRRLNLERLQQVLAEEQLKASLPQ AALARGGGGDSGEPGRPDPEAESPRGDPGRPDPEAESPRGDLPAGPGAAGEAGGGRSWAD AVHRQLLQPQLRFRAREDDAPLGDPQAAPGTDEGGDGGDHDFEMVDFNEKFILSHWLVAP CRFGTRERARSPRRWMHLIPGGRRPQRGTRDLEQREAEAKGRPGSPLAAEETPGREHLPP WRRRRFLRVPERDSPGHSSPERDSDGSRHSSDREDDFSADPARPRRAPPTPGCRDPFPAD SAALFNLELGAGERAGRRRAQSGFLNAELVGNLCGLAVSTQKLIIHINGLKELLFTENPH PDLKHCALHPEQQPRGHRPREAIADVNSLE >gi568815581f:1170564_1371268|GENSCAN_predicted_CDS_2|1173_bp atgtgggacccccgcgcgttcgagcggcgctggcgggccgagttccccggggcggaggcg ccggtcccgaggctggagtcggtgcgggacgcggagcgggagctggagcgccgcagactc aacctggagcggctgcaacaggtgctggcggaggagcagctcaaagcctcgctcccgcag gccgcgctggcccggggcggcggcggagactccggggagcccgggcgccccgaccccgag gccgagtctccccgcggggaccccgggcgccccgaccccgaggccgagtctccccgcggg gacctgcccgccgggcccggggcggccggagaagccggcgggggaaggagctgggccgac gccgtccaccggcagctcctgcagccccagctgcggttccgggcccgggaggacgacgcg cccctcggggacccccaggcggctcccggcacggacgaggggggcgacggcggcgaccac gacttcgagatggtggatttcaacgagaagttcatcctcagccactggctggtggcccct tgccggttcgggacccgcgagcgggcgcgcagcccccgccgctggatgcacctgatcccc ggggggcggcggccgcagcgggggacccgcgacctggagcagcgggaggccgaggccaag ggccgcccgggctcgcccctcgccgcggaggagacccccgggcgcgagcacctgcccccg tggcggcgccgcaggttcctgcgggtgcccgagcgggactcgcccggccacagctcgccg gagagagacagcgacggcagccggcacagctccgaccgcgaggacgacttctccgcagac cccgctcgaccccgccgcgcgccccccaccccgggctgccgggaccccttcccggccgac tccgcggccctgtttaacctcgagctcggggccggggagcgggcgggaaggaggagggca cagagtggcttcctgaatgccgagcttgttggtaacctttgtgggctggcagtgagcacg cagaaattaattattcatatcaatgggctgaaggagctgttgttcactgagaacccacat cctgacctaaagcattgtgctttgcatccggagcagcagcctcgtggacaccggccccgt gaagctattgctgatgtgaacagtcttgagtga >gi568815581f:1170564_1371268|GENSCAN_predicted_peptide_3|144_aa MGTLAGDCGGKIMGTLAGDCGGKIMGTLAGGCGGKIMGTLAGGCGGKIMGTLAGGCGGKI MGTLAGGCGGKIMGTLAGGCGGKIMGTLAGGCGGKIMGTLAGCCGGKIMGTLAGGCGGEE KSRERQTEFHSLQCTQLPLQQLCA >gi568815581f:1170564_1371268|GENSCAN_predicted_CDS_3|435_bp atggggacccttgctggggactgtggagggaagatcatggggacccttgctggggactgt ggagggaagatcatggggacccttgctgggggctgtggagggaagatcatggggaccctt gctgggggctgtggagggaagatcatggggacccttgctgggggctgtggagggaagatc atggggacccttgctgggggctgtggagggaagatcatggggacccttgctgggggctgt ggagggaagatcatggggacccttgctgggggctgtggagggaagatcatggggaccctt gctgggtgctgtggagggaagatcatggggacccttgctgggggctgtggaggggaagaa aagtcacgtgaacgacagacggaattccattccctccagtgcacacagctgccgctgcag caattatgtgcttaa >gi568815581f:1170564_1371268|GENSCAN_predicted_peptide_4|258_aa MLRGAPGLGLTARKGAEDSAEDLGGPCPEPGGDSGVLGANGASCSRGEAEEPAGRRRARP VRSKARRMAANVRERKRILDYNEAFNALRRALRHDLGGKRLSKIATLRRAIHRIAALSLV LRASPAPRGPCGHLECHGPAARGDTGDTGASPPPPAGPSLARPDAARPSVPSAPRCASCP PHAPLARPRRPGWQLDYKTRDCKGVLDRAELDRLRPGVEEKELAREKELRDPGLRPHIPA WGREKELRGLGLRPHIPA >gi568815581f:1170564_1371268|GENSCAN_predicted_CDS_4|777_bp atgctgcggggcgcgccaggactaggcctcacggcgcggaagggggccgaggactctgcg gaggacttggggggcccctgccccgagcccgggggcgattcgggggtgctgggggcgaac ggcgcttcctgcagccggggcgaggcggaggagccggcgggcaggaggcgcgcgcggccg gtgcggtccaaggcgcggcgcatggccgccaacgtgcgggagcgcaagcgcatcctagac tacaacgaggccttcaacgcgctgcgccgggcgctgcggcacgacctgggcggcaagagg ctctccaagatcgccacgctgcgcagggccatccaccgcatcgccgcgctctccctggtc ctgcgcgccagccccgcgccccgcgggccctgcggacacctggagtgccacggcccggcc gcgcgcggggacaccggggacacaggcgccagccccccgccgcctgcagggcccagcctc gcgcgcccagacgccgcccgcccctcggtgccgtccgcgccccgctgcgcctcgtgcccc ccgcacgcgcccctggcacggcccaggaggccgggctggcagctggactataaaacccgg gactgcaaaggcgtcttggacagagcagaactggaccggctgagacctggcgtggaagag aaggagctcgccagagagaaggagctccgggacccggggttgcgcccccacatcccagcc tggggcagagagaaggagctccggggcctggggctgcgcccccacatcccagcctag >gi568815581f:1170564_1371268|GENSCAN_predicted_peptide_5|153_aa MAHPVQSEFPSAQEPGSAAFLDLPEMEILLTKAENKDDKTLNLSKTLSGPLDLEQNSQGL PFKAISEGHLEAPLPRSPSRASSRRASSIATTSYAQDQEAPRDYLILAVVACFCPVWPLN LIPLIISIMATWTCSQPLPAPVSKSRSGFPAVP >gi568815581f:1170564_1371268|GENSCAN_predicted_CDS_5|462_bp atggcccacccggtgcagtccgagtttccttcagcacaggagccaggctccgccgcattc ctggacctgccggagatggagatactcctcaccaaggcagagaacaaggatgacaagacc ctgaatctgtccaagaccctctcggggcctctggatctggagcagaacagccagggccta cccttcaaggccatctccgaggggcacctggaggccccactgcctcggtccccctcccgg gccagctcaaggagggcgtcctccatcgccaccacctcctatgcccaagaccaagaagcc cccagagattacctcatcctggccgtcgtcgcctgcttctgccccgtctggcccctcaac ctcatccccctcatcatttccatcatggccacctggacatgctcccagcctctgccggct ccggtttccaagagtcgcagcggcttccctgccgtaccctga >gi568815581f:1170564_1371268|GENSCAN_predicted_peptide_6|181_aa MGNPVGDEEPVGGGEPVGDEKLEGDGEPMGDGEPVGDEKLEGDEEPMGDGEPIGDGEPVG DEKLEGDGEPVGDGKPMGDGEPVGDEEPVGDGEPMGDEEPVGVGEPVGDEEPMGDGEPMG DGEPVGDEEPVGDGEPVGDGEPAGANHTRLANNNTPQADNNTPQADNNTPQADNTPRADN T >gi568815581f:1170564_1371268|GENSCAN_predicted_CDS_6|546_bp atggggaacccggtgggggatgaggagcccgtggggggtggggaacccgtgggggatgag aagctggagggggatggggaacccatgggggatggggaacccgtgggggatgagaagctg gagggggacgaggagcccatgggggatggggaacccattggggatggggaacccgtgggg gatgagaagctggagggggatggggagcccgtgggggatgggaaacccatgggggatggg gaacccgtgggggatgaggaacccgtgggggatggggaacccatgggggatgaggaacct gtgggggttggggaacctgtgggggatgaggagcccatgggggatggggagcccatgggg gatggggaacccgtgggggatgaggagcccgtgggggatggggaacccgtgggggatggg gaacccgctggagcaaatcacacccggctggccaataacaacactccacaggccgacaac aacaccccacaggccgacaacaacaccccacaggccgacaacaccccacgggctgataac acctga >gi568815581f:1170564_1371268|GENSCAN_predicted_peptide_7|121_aa MWLHGGLVLCITPRSVWDSVTSSNSTCDIIKLNPQHQTQLMTSSNPAHDIIKPNLRHHQT QPTTSSNSTHDIIKPNPRHHQTQLVTSSNPTCDIIKHNPRHHQTQPTTSSNPTHDIIKLN S >gi568815581f:1170564_1371268|GENSCAN_predicted_CDS_7|366_bp atgtggcttcacgggggccttgttttatgtatcactcctcgctccgtctgggacagcgtg acatcatcaaattcaacctgtgacatcatcaaactcaacccacaacatcaaactcaactc atgacatcatcaaacccagcccatgacatcatcaaacccaacctgcgacatcatcaaaca caacccacgacatcatcaaactcaacccacgacatcatcaaacccaacccacgacatcat caaactcaactcgtgacatcatcaaacccaacctgcgacatcatcaagcacaacccacga catcatcaaactcaacccacgacatcatcaaacccaacccacgacatcatcaaactcaac tcgtga >gi568815581f:1170564_1371268|GENSCAN_predicted_peptide_8|377_aa MKLTHSLPGSRGLSVLSPQSRSSMQQGNVDGARRLGRLARLLSITLIIMGIVIIMVAVTV NFTGIPAPGVGPGSYAAFVLTAKWALLYRQKHENCKRLPFLGCLRLDAQPGVLRVVGFQR SWALPEQGTVCEAVLGRPARYALLGSFPGGGSTAGGDGVSICKPGPSADAGSACTFTSDF PASRTRNTGRTLTQYPTETTGRTLTQYPTGTTGRTLARYPTETAGRTLTQYPTETMGRTL TQYSRGPTGRTLTQYPTETTGRTLTKYTTETAGRTLTQYLRGPTGRTLSQYPTETARRTL TRYSTEIARRTLTQYPTETTGRTLTQYPTETAGRTLTQYPTVLRKRVSPRSCRGARSGSM GSAGGDSGPPLKMGFGS >gi568815581f:1170564_1371268|GENSCAN_predicted_CDS_8|1134_bp atgaagctgacccacagccttcccggttcccggggtctctctgtgctctctccgcagtct cgaagcagcatgcaacagggcaacgtggacggcgcccggaggctgggccgcctggctcgg ctgctcagcattaccctcatcatcatgggcatcgtcattatcatggtggccgtgaccgtc aacttcacaggaattcctgctcccggagttggcccaggttcatatgctgcctttgtgctg actgcaaagtgggccctcctatacagacaaaaacatgaaaactgcaaacgcctccccttc ctgggatgtttgcggctggacgcccagcctggtgtgctgagggtagttggatttcagagg tcctgggcactgcctgagcagggcacagtctgcgaggccgtcctgggcagacctgcccgc tacgccttgttggggtcctttcccggaggtggcagcacagccggtggggacggagtgtcc atctgcaaaccaggaccctcggccgacgctgggtctgcctgcaccttcacctcagacttc ccagcctccagaacgagaaacactggaaggactctgacccagtaccccacagagaccact ggaaggactctgacccagtaccccacagggaccactggaaggactctggcccggtacccc acagagactgctggaaggactctgacccagtaccccacagaaaccatgggaaggactctg acccagtactccagagggcccactggaaggactctgacccagtacccgacagagaccact ggaaggaccctgaccaagtacaccacagagactgctggaaggactctgacccagtacctc agagggcccactggaaggactctaagccagtaccccacagagaccgctagaaggactctg actcggtactccacagagattgctagaaggactctgacccagtaccccacagagaccact ggaaggactctgacccagtaccccacagagacagctggaaggactctgacccagtacccc acagttctaaggaagagggttagtccacggagctgcagaggtgcccgctccggctctatg gggtcagcaggaggtgactcaggaccacctctgaagatgggctttgggtcctga >gi568815581f:1170564_1371268|GENSCAN_predicted_peptide_9|53_aa MAPYEFLISVLEGIKADTQSCLLIFLSSLQLVITLFKDTVQLPVVCDRVWWYP >gi568815581f:1170564_1371268|GENSCAN_predicted_CDS_9|162_bp atggccccatacgagtttctaatctcggtgctcgagggaattaaggcagatactcagagc tgcctccttatcttcctcagcagcttacaacttgtcatcactctcttcaaggacacggtg cagctccctgttgtttgtgaccgtgtgtggtggtacccatga >gi568815581f:1170564_1371268|GENSCAN_predicted_peptide_10|234_aa XMVESMKKVAGMDVELTVEERNLLSVAYKNVIGARRASWRIISSIEQKEENKGGEDKLKM IREYRQMVETELKLICCDILDVLDKHLIPAANTGESKVFYYKMKGDYHRYLAEFATGNDR KEAAENSLVAYKAASDIAMTELPPTHPIRLGLALNFSVFYYEILNSPDRACRLAKAAFDD AIAELDTLSEESYKDSTLIMQLLRDNLTLWTSDMQGDGEEQNKEALQDVEDENQ >gi568815581f:1170564_1371268|GENSCAN_predicted_CDS_10|705_bp naaatggtggagtcaatgaagaaagtagcagggatggatgtggagctgacagttgaagaa agaaacctcctatctgttgcatataagaatgtgattggagctagaagagcctcctggaga ataatcagcagcattgaacagaaagaagaaaacaagggaggagaagacaagctaaaaatg attcgggaatatcggcaaatggttgagactgagctaaagttaatctgttgtgacattctg gatgtactggacaaacacctcattccagcagctaacactggcgagtccaaggttttctat tataaaatgaaaggggactaccacaggtatctggcagaatttgccacaggaaacgacagg aaggaggctgcggagaacagcctagtggcttataaagctgctagtgatattgcaatgaca gaacttccaccaacgcatcctattcgcttaggtcttgctctcaatttttccgtattctac tacgaaattcttaattcccctgaccgtgcctgcaggttggcaaaagcagcttttgatgat gcaattgcagaactggatacgctgagtgaagaaagctataaggactctacacttatcatg cagttgttacgtgataatctgacactatggacttcagacatgcagggtgacggtgaagag cagaataaagaagcgctgcaggacgtggaagacgaaaatcagtga