GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:33:34 Sequence gi568815596f:98420049_98687605 : 267557 bp : 43.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1491 1530 40 -2.76 1.01 Init + 13418 13490 73 1 1 39 43 102 0.650 2.03 1.02 Term + 16630 16721 92 2 2 126 42 38 0.413 0.88 1.03 PlyA + 23364 23369 6 1.05 2.02 PlyA - 23399 23394 6 1.05 2.01 Sngl - 25425 24577 849 0 0 67 42 314 0.909 19.59 2.00 Prom - 44780 44741 40 -3.96 3.14 PlyA - 46729 46724 6 1.05 3.13 Term - 47592 47413 180 0 0 -6 38 137 0.170 -3.09 3.12 Intr - 50473 50341 133 0 1 110 89 65 0.989 9.45 3.11 Intr - 52128 51941 188 0 2 65 37 120 0.790 3.09 3.10 Intr - 54671 54612 60 2 0 64 94 46 0.485 1.73 3.09 Intr - 57323 57145 179 0 2 74 -18 138 0.212 1.44 3.08 Intr - 61278 61184 95 2 2 82 60 41 0.084 0.31 3.07 Intr - 64438 64320 119 1 2 92 53 25 0.137 -1.34 3.06 Intr - 69146 69085 62 0 2 145 65 42 0.435 6.05 3.05 Intr - 70294 70134 161 0 2 110 82 -1 0.665 1.13 3.04 Intr - 85402 85302 101 1 2 114 82 78 0.873 8.71 3.03 Intr - 86663 86552 112 1 1 88 46 18 0.536 -2.02 3.02 Intr - 88066 87896 171 0 0 128 91 94 0.986 12.76 3.01 Init - 91750 91581 170 1 2 62 90 65 0.335 3.11 3.00 Prom - 92568 92529 40 -8.26 4.00 Prom + 96938 96977 40 -2.46 4.01 Init + 100001 100138 138 1 0 79 80 127 0.513 11.21 4.02 Term + 100216 100434 219 0 0 60 36 110 0.381 0.04 4.03 PlyA + 100508 100513 6 1.05 5.00 Prom + 101117 101156 40 -2.86 5.01 Init + 114937 115082 146 0 2 74 22 116 0.369 3.27 5.02 Intr + 117815 117926 112 2 1 68 96 185 0.893 17.68 5.03 Intr + 118389 118472 84 2 0 122 119 -62 0.519 0.32 5.04 Intr + 118843 118933 91 0 1 39 64 84 0.904 0.67 5.05 Intr + 119480 119627 148 0 1 79 94 255 0.999 24.49 5.06 Intr + 123829 123959 131 1 2 104 90 190 0.999 21.14 5.07 Intr + 125921 126025 105 0 0 87 82 20 0.729 1.49 5.08 Intr + 126538 126646 109 2 1 20 98 180 0.997 11.54 5.09 Intr + 130367 130471 105 2 0 64 69 77 0.861 2.73 5.10 Intr + 132738 132921 184 0 1 45 97 231 0.974 19.49 5.11 Intr + 134223 134441 219 2 0 84 92 294 0.998 27.90 5.12 Intr + 135505 135643 139 0 1 11 86 269 0.995 19.04 5.13 Intr + 143417 143589 173 0 2 129 78 337 0.999 36.46 5.14 Intr + 144592 144715 124 0 1 117 89 313 0.995 34.46 5.15 Intr + 145592 145718 127 0 1 118 5 199 0.999 14.34 5.16 Intr + 145981 146121 141 1 0 105 92 266 0.887 28.17 5.17 Intr + 148523 148620 98 2 2 43 87 154 0.820 10.45 5.18 Intr + 152767 152879 113 2 2 119 69 92 0.983 10.50 5.19 Intr + 156941 157095 155 1 2 72 109 226 0.987 21.97 5.20 Term + 167428 167560 133 1 1 104 47 106 0.608 5.66 5.21 PlyA + 167592 167597 6 1.05 6.03 PlyA - 167660 167655 6 1.05 6.02 Term - 173183 173020 164 0 2 35 38 135 0.448 1.30 6.01 Init - 174433 174346 88 1 1 78 64 43 0.403 1.70 6.00 Prom - 174557 174518 40 -2.46 7.04 PlyA - 175320 175315 6 1.05 7.03 Term - 181630 181526 105 1 0 99 54 29 0.777 -0.99 7.02 Intr - 184143 184060 84 0 0 78 103 29 0.853 3.42 7.01 Init - 188357 188259 99 2 0 95 74 140 0.975 13.57 7.00 Prom - 201471 201432 40 -3.96 8.10 PlyA - 202354 202349 6 1.05 8.09 Term - 214820 214764 57 2 0 90 42 58 0.618 -0.91 8.08 Intr - 216547 216469 79 0 1 81 86 64 0.922 5.05 8.07 Intr - 219953 219760 194 2 2 83 81 121 0.862 9.19 8.06 Intr - 224005 223875 131 2 2 77 92 66 0.404 6.31 8.05 Intr - 225164 225136 29 1 2 35 93 36 0.139 -3.44 8.04 Intr - 236417 236304 114 1 0 58 105 62 0.699 4.46 8.03 Intr - 238216 238170 47 1 2 71 89 53 0.875 0.91 8.02 Intr - 243131 242998 134 0 2 84 76 64 0.890 5.16 8.01 Intr - 258423 258256 168 1 0 109 87 112 0.841 13.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:98420049_98687605|GENSCAN_predicted_peptide_1|54_aa MDGSSQRENEEDVKVETPDKTIKSRRETPNGHATGALDHSPFCQGPLDRPLREL >gi568815596f:98420049_98687605|GENSCAN_predicted_CDS_1|165_bp atggatggcagcagtcaaagagagaatgaggaagatgtaaaagtggaaacccctgataaa accatcaaatctcgccgtgaaactccaaacggtcatgcaactggagccttggaccacagc cccttctgccagggacccttagataggcctctgagggagctctga >gi568815596f:98420049_98687605|GENSCAN_predicted_peptide_2|282_aa MHVYFHPVMFPPQTRQGLQLQEPVDPALEPHRAGTASASRGHAPARHPAPCWGGRVFTGL GVGRARIVLKRRHRRWTTGTVRQCNFHENRGRRPHPPQRGPGRAPAPAVRPRPRPPRRPR RLLPGPAHLPGAPRPPAPAAPLSRRAARQSPAAAAAPPPGCEVAGCGPGLRCWSSCPGPR SPTRRQTSSEPSTRAAALASRRRSRRRAQPRPGPAPLSRMRAEGPARALRPPPHYQPELG PPSETVGPVTELLAPIPTARFPQDPTLSCDCFTCKEFLNFWT >gi568815596f:98420049_98687605|GENSCAN_predicted_CDS_2|849_bp atgcatgtttactttcacccggttatgtttccaccccaaacacggcaaggcctccagctt caggagcccgtagacccagccctggagccgcatcgggcaggcaccgcgtctgcgtcccgc ggccacgcgcccgctcggcacccggctccctgctggggcggccgcgtcttcacggggctg ggggtcggccgggctcggattgttctcaaaagacgccaccgcaggtggaccactggcacc gttaggcagtgcaatttccatgaaaaccgcgggcgccgtccacatccgccccagcgggga cctggacgcgcccccgcgcccgctgtccgtccccggccccggcccccgcggcggccgcgt cgcctcctgcccggcccggcccacctacctggcgctccccggcccccggcgccagccgcc ccgctgtcccgccgggctgcgcgccagagccccgcagcagcagcagcgccgccgcctggc tgtgaagtcgcgggctgcgggcccggcctcaggtgctggtcctcctgccccggcccacgg agcccgacccggcggcaaaccagcagcgaaccctccacgcgcgccgcagccctagccagc cgccgccgctctagacgccgggcgcagccccgccctggccccgcccccttgtcgcgcatg cgcgctgagggcccggcccgcgccctgcgacccccgccccactaccagcccgagctgggt ccgccctctgagacagtagggccggtgaccgagctgctcgcgccaattcctactgctagg ttcccacaagatcccaccctatcctgtgactgcttcacctgcaaggagttcttgaacttc tggacttga >gi568815596f:98420049_98687605|GENSCAN_predicted_peptide_3|576_aa MASDLVNRCEACDLPQGPISVHQFPRLSNEVTTTVGRNSESPNQRSVSGPWTPREVRRGV AAAAHGFSGQNAKWAAHIPATAMSFALPYSVLHSPHALTTHPLNPGDAVTMPSWTSTVSA ANASKGRSFSQCFQSTEEGQERRRPRTSQAKVQQGSTAGKVNELITLAGHIKSQAPHGKR KPPPRDSHVICSMLRKRKPDETPSPSSDAQFSGMLAVAVSSLLCPRSSPSQQWHCHSPST TILGSSDVSDTSFQPSTKLGPVHGFPPQSRDCILDFELIRKVTGAWSSLTAGSWETKIQE FSEEDILAPHRITPGPHSSKSPVHFLASLWGRKTKLYHHTIQDKEPRSVPEAGSSGRRDG GGAAHSMLCMAPEPDHPGSNPNCRSPATLHADADGNLQREAAKCQGDGTQVSTHVDHTSA PRQAETGPATGENEKMESHGRSKRGLVGADQCPAETLDSYTLLVGHKKRDKDSAWLRRRK APPGLLAGAEAQRAMPPCGGKGQTTKEQGLRAQKQRQEKKLHIKNTEVKSMEAEKIISTQ FQGQRTLMMLGSLDLLSPHVDGVANIADSFTVLTAT >gi568815596f:98420049_98687605|GENSCAN_predicted_CDS_3|1731_bp atggcatcagacctggtgaaccgctgtgaagcatgtgaccttccacagggtcccatctct gtgcatcagtttcctcgcctgtcaaatgaagttacgactactgttggaaggaattcagag tctcccaatcagaggagtgtgagtggaccctggactcccagggaagtgaggagaggtgtg gctgcagctgctcatggcttctcaggacagaatgcaaagtgggcggcccacatcccagcc acagcaatgtcctttgccttgccgtacagtgttttgcattcaccccatgccctcaccacc caccccctcaacccaggggatgcagtgacgatgccatcatggacttcaacagtgtctgct gccaatgcaagcaagggacgtagtttctcacaatgtttccaatccacagaagagggtcaa gagaggaggagaccaagaacttctcaggccaaggttcaacaaggaagcactgcagggaaa gtgaacgagctcatcactctggctggacacataaaaagtcaggccccacatgggaaacgc aagcctccaccaagagattcccatgtgatatgcagcatgctcagaaaaaggaaaccggat gagacacccagcccttcaagtgatgcccaattttctgggatgctcgctgttgctgtgtcc agcctgctctgcccccggtcttctccatctcagcaatggcactgccattctccaagtacc accattcttgggagctcagatgtatctgacacctctttccaaccatcaaccaaactgggc cctgtgcatggcttcccaccccagtcacgtgactgcattctggactttgagctcatcagg aaagtaactggggcttggtcttccctgactgcaggatcatgggaaacaaagatacaggaa ttttcagaggaagacatccttgctccacacaggatcacaccaggtccacattcctccaag tctcctgttcacttcctggcctctctgtggggcaggaagaccaagctgtaccaccacacc attcaggacaaagagccaagaagcgtgcctgaagcaggcagctcagggagacgggacggc gggggagctgcacacagcatgctgtgcatggccccagagccagaccatccgggctccaac cccaactgccgctcacccgccacgctacacgctgatgctgatggcaatctgcaaagagag gctgccaaatgtcagggtgatggcacccaggtctccacccatgtggaccatacttctgct ccacggcaggcagaaacaggcccagccacaggagagaacgagaaaatggagtcacatggg agatcaaaaagaggccttgttggagcagaccaatgcccagctgaaactcttgattcctac accctccttgtggggcacaaaaagagggacaaggacagtgcctggctgagaaggcgaaag gccccaccaggcctccttgcaggggccgaagcccagcgggccatgcctccctgcggaggg aagggacagaccactaaggagcaaggacttagagctcagaagcagcgccaagaaaagaag cttcacatcaagaacacagaagttaaatccatggaggctgagaagataatttccactcag ttccaagggcaaaggacactgatgatgctaggttcactggacctgttgtccccacatgtg gacggtgtcgctaacattgctgacagcttcacagtcttaacagcaacataa >gi568815596f:98420049_98687605|GENSCAN_predicted_peptide_4|118_aa MTAREHSPRHGARARAMQRASTIDVAADMLGLSLAGEPHRACTGLQTTGYPGLASFWLNQ EIGKGDVSPLPVNRFCWSLTTLVFQFGGRGTTRVVWECPSVLLAHAQAMVKALPVVDV >gi568815596f:98420049_98687605|GENSCAN_predicted_CDS_4|357_bp atgacagcaagagagcacagccctcgccatggtgccagggcccgtgcaatgcagcgggct tccaccatcgacgtggcggccgacatgctgggcctctctctggcaggtgagcctcacagg gcctgcaccgggctccagactacagggtaccctggtttggcatccttctggttgaaccag gaaatagggaagggggatgtcagccctcttcctgtgaaccgcttctgctggagcctgact acattagttttccagtttggtggcaggggcaccaccagggtagtgtgggaatgtccaagt gttttgctggcccatgcccaggccatggtaaaagccctccctgttgtggatgtgtag >gi568815596f:98420049_98687605|GENSCAN_predicted_peptide_5|878_aa MSWLRPQQPTQVELSSQLDHLDLDLQREKGARGPGNGGCDNYLVPERVWSAESDRVGNIT VIGWQMEEKSDQRPPVTRSVDTVNGRLHPPAPTVMHSLSHRHSKKNSNFRALALMVLPVD ESLTEALGIRSKYASLRKDTLLKSVFGGAICRMYRFPTTDGNHLRILEQMAESVLSLHVP RQFVKLLLEEDAARVCELEELGELSPCWESLRRQIVTQYQTIILTYQENLTDLHQYRGPS FKASSLKADKKLEFVPTNLHIQRMRVQDDGGSDQNYDIVTIGAPAAHCQGFKSGGLRKKL HKFEETKKQVTVAQQLPAVPGQGHKTPPLRSHGIFSLAHDPSSGTSSGCQSIIYIPQDVV RAKEIIAQINTLKTQVSYYAERLSRAAKDRSATGLERTLAILADKTRQLVTVCDCKLLAN SIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEK VWLNVDKSLECIIQRVDKLLQKERLHGEGCEDVFPCAGSCTSKKGEWSEALYPLLTTLTD CVAMMSDKAKKAMVFLLMQDSAPTIATYLSLQYRRDVVFCQTLTALICGFIIKLRNCLHD DGFLRQLYTIGLLAQFESLLSTYGEELAMLEDMSLGIMDLRNVTFKVTQATSSASADMLP VITGNRDGFNVRVPLPGPLFDALPREIQSGMLLRVQPVLFNVGINEQQTLAERFGDTSLQ EVINVESLVRLNSYFEQFKEVLPEDCLPRSRSQTCLPELLRFLGQNVHARKNKNVDILWQ AAEICRRLNGVRFTSCKSAKDRTAMSVTLEQCLILQHEHGMAPQVFTQALECMRSEGCRR ENTMKNVGSRKYAFNSLQLKAFPKHYRPPEGTYGKVET >gi568815596f:98420049_98687605|GENSCAN_predicted_CDS_5|2637_bp atgagttggttgaggccccagcagcccacccaggtggagctgtccagtcagcttgaccac ctggatctggatctccagagagaaaaaggagccagaggcccagggaatggtggctgtgat aattacctggtgcctgagagagtatggtctgcagagagtgaccgtgtaggtaacatcacc gtgattggctggcagatggaggagaagtcagaccaacggccccctgtgacccggtctgtg gacactgtcaatgggaggctacaccctcctgcacccactgtcatgcatagtctctcacac agacactcaaagaaaaattctaatttcagagccctagcgctgatggttcttcctgtcgat gagagcttgacggaggcgttaggaatccgatccaaatacgcttcattgcgaaaggacact ttgctgaaatcggtgttcggtggtgccatctgccgcatgtaccggtttccaaccactgat ggtaaccatttgcggatcctggagcagatggcagagagcgtgctctccctgcacgtgccc cggcagttcgtgaagctcctactagaggaagatgcagccagagtgtgtgagctggaggag ctgggagagctgtccccttgctgggagagcctccggcgccaaattgtcacccagtaccag accatcatcctcacataccaggagaacctgaccgacctccatcagtacagagggccctcg tttaaagcaagcagtttgaaagcagataaaaagttagaatttgttcccacaaacttgcac atacaaaggatgagagttcaagacgatggaggatcagatcagaactacgacatcgtcacc attggggcgccagcagcacactgccaaggttttaagtcaggaggtctccgcaaaaagctg cacaaatttgaagagaccaagaaacaagtgactgttgctcagcagttaccagcagttcct ggccagggccacaagactccacccttacgctctcatggcatcttcagtttagcccatgac cccagctctggtacatcatctggctgccagtccataatctacataccccaggatgttgtc agagccaaggagatcatcgcccagatcaacaccctgaaaacccaagtgagttactacgca gagcggctgtcaagggcagccaaggacaggtctgccactggccttgagaggacactcgcc atcttggcagacaagacacggcagctggtcacggtctgcgactgcaagctcctggccaac tccatccatgggctgaacgctgcacggcctgactacattgcctccaaggcctctcccact tcgactgaggaggagcaggtgatgcttagaaatgaccaggacaccctcatggcccggtgg acagggagaaacagccgatcttccctgcaggtggactggcacgaggaggagtgggagaaa gtgtggctgaacgtggacaagagcctagagtgcatcattcagcgtgtggacaagctgctg cagaaggagcggctgcatggcgagggctgtgaggatgtcttcccctgtgcaggcagctgc accagcaagaaaggtgaatggagtgaggccctttacccgctgctgaccactctcaccgac tgcgtggccatgatgagtgacaaggccaagaaggccatggtattcctgctcatgcaggac agcgcgcccaccatagccacctacctgagcctgcagtaccgccgtgacgtggtcttctgc cagacgctgaccgccctcatctgcggcttcatcattaagctgaggaactgcctgcatgac gacggcttcctgcgccagctctacaccatcgggctgctggcccagttcgagagcctgctg agcacctacggggaggagctggcaatgctggaggacatgagccttgggatcatggacttg aggaacgtgaccttcaaagtcactcaggccacttccagcgcctccgcagacatgctgccc gtcatcacaggaaatcgcgacgggtttaacgtgcgggtccctctgccgggcccgctgttt gacgccttgccccgggagatccagagtggcatgctgctgcgagtgcagcccgtcctcttc aacgtgggcatcaatgagcagcagacactggccgagaggtttggcgatacgtctttacaa gaagtcatcaacgtggagagtttggtgcggttaaattcctactttgagcagtttaaggaa gttttgcctgaggattgcctgcctcggtctcgcagtcagacgtgcctgccagagctgctg cggtttctgggtcagaacgtgcatgcccggaagaataagaacgtcgacattctctggcaa gctgctgagatctgccgccgccttaatggggtccggttcaccagctgcaagagcgctaag gaccgtacagccatgtcggtgacactggagcagtgcctgatcctgcaacacgagcatggc atggccccgcaggtcttcacccaggccctggagtgcatgcgcagtgagggttgtcgaaga gaaaatacaatgaagaatgttggaagtcgcaaatatgcatttaattccctgcagctgaag gctttccccaagcattacaggcctcccgaagggacttacggaaaagttgaaacgtga >gi568815596f:98420049_98687605|GENSCAN_predicted_peptide_6|83_aa MEYYAAIKNDEFMSFVGTWMKLEIIILSNAFSVMWDNSVKDLPGLACKAHRGLMQEEIMA KTGFQWRESRVWVFSGYTLGTLS >gi568815596f:98420049_98687605|GENSCAN_predicted_CDS_6|252_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aaattggaaatcatcattctcagtaatgctttttctgttatgtgggacaacagtgtgaaa gacctgcctggactggcctgcaaggcccacaggggcctgatgcaggaagaaatcatggca aagacagggttccagtggagggaatctagagtatgggtgttttcaggctacaccctggga accctgtcgtag >gi568815596f:98420049_98687605|GENSCAN_predicted_peptide_7|95_aa MPKYYEDKPQGGACAGLKEDLGACLLQSDCVVQEGKSPRQCLKEGYCNSLKYAFFECKRS VAMDWYRSMAWRLGTPNIEFLGSDIDYASLPYKLT >gi568815596f:98420049_98687605|GENSCAN_predicted_CDS_7|288_bp atgcctaagtattatgaggacaagccgcagggcggcgcgtgcgcgggcctgaaggaggac ctgggcgcgtgtctgctgcagtcggactgtgtggtccaggaaggaaaatcacctcggcag tgtttgaaggaaggatactgcaactctttgaagtacgcattttttgagtgtaaaagatca gtggccatggactggtacaggtctatggcctggaggttagggacccccaatatagaattt ttggggagtgacattgactatgcaagtttgccttacaaactgacttga >gi568815596f:98420049_98687605|GENSCAN_predicted_peptide_8|317_aa XKLIAYQREFLALKERLRIAEHRISQRSSELNTIVQQFKRVGAETNGSKDALNKFSVSIV MGIPTVKREVKSYLIETLHSLIDNLYPEEKLDCVIVVFIGETDIDYVHGVVANLEKEFSK EISSGLVEVISPPESYYPDLTNLKETFGDSKERVSVLHNANRGAGKMFQAPDLTLIVEFI FMFYKEKPIDWLLDHILWVKVCNPEKDADKDYMKPLLLKIHVNPPAEVSTSLKVYQGHTL EKTYMGEDFFWAITPIAGDYILFKFDKPVNVESYLFHSGNQEHPGDILLNTTVEVLPFKR HICVEMSTDIALYLAAT >gi568815596f:98420049_98687605|GENSCAN_predicted_CDS_8|954_bp naaaaactgattgcttatcaacgagaattccttgctttgaaagaacgtcttcgaatagct gaacacagaatctcacagcgctcttctgaattaaatacgattgtgcaacagttcaagcgt gtaggagcagaaacaaatggaagtaaggatgcgttgaataagttttcagtttcaatagtc atgggcattcccacagtgaagagagaagttaaatcttacctcatagaaactcttcattcc cttattgataacctgtatcctgaagagaagttggactgtgttatagtagtcttcatagga gagacagatattgattatgtacatggtgttgtagccaacctggagaaagaattttctaaa gaaatcagttctggcttggtggaagtcatatcaccccctgaaagctattatcctgacttg acaaacctaaaggagacatttggagactccaaagaaagagtaagcgttcttcacaatgcc aaccgtggtgctggtaaaatgtttcaagcgccggatcttactctgattgtagaattcata ttcatgttttacaaggagaaacccattgattggctcctggaccatattctctgggtgaaa gtctgcaaccctgaaaaagatgcagataaagattatatgaaaccattacttcttaaaatc catgtaaacccacctgcggaggtatctacttccttgaaggtctaccaagggcatacgctg gagaaaacttacatgggagaggatttcttctgggctatcacaccgatagctggagactac atcttgtttaaatttgataaaccagtcaatgtagaaagttatttgttccatagcggcaac caagaacatcctggagatattctgctaaacacaactgtggaagttttgccttttaagcgt cacatctgtgttgagatgtccactgacatagccttatacctggcagccacatag