GENSCAN 1.0 Date run: 30-Jan-118 Time: 11:04:51 Sequence gi568815589f:126240508_126603260 : 362753 bp : 46.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5518 5530 13 0 1 61 116 9 0.217 1.29 1.02 Intr + 32482 32544 63 2 0 107 65 28 0.017 1.09 1.03 Intr + 54292 54474 183 2 0 71 86 48 0.043 2.66 1.04 Intr + 64518 64692 175 1 1 75 20 128 0.056 3.70 1.05 Term + 66174 66303 130 0 1 73 47 85 0.228 0.55 1.06 PlyA + 67930 67935 6 -0.45 2.00 Prom + 69301 69340 40 -2.96 2.01 Init + 72275 72332 58 1 1 97 73 36 0.549 4.47 2.02 Intr + 86207 86343 137 0 2 37 85 69 0.251 1.79 2.03 Intr + 86684 86789 106 1 1 29 94 121 0.481 6.59 2.04 Intr + 96348 96436 89 1 2 17 69 64 0.001 -2.91 2.05 Intr + 100001 100123 123 1 0 70 86 132 0.005 11.98 2.06 Intr + 140557 140664 108 0 0 85 101 54 0.205 6.88 2.07 Intr + 146055 146151 97 2 1 84 94 129 0.448 12.68 2.08 Intr + 151559 151688 130 0 1 99 92 103 0.981 11.55 2.09 Intr + 153043 153208 166 1 1 38 69 123 0.243 5.36 2.10 Intr + 159001 159145 145 0 1 75 80 14 0.035 -0.84 2.11 Intr + 161731 161794 64 2 1 84 99 26 0.149 1.08 2.12 Intr + 167913 168047 135 0 0 78 110 11 0.020 1.98 2.13 Intr + 181347 181441 95 0 2 128 91 50 0.263 8.91 2.14 Intr + 188356 188495 140 2 2 82 80 53 0.144 4.08 2.15 Intr + 208608 208895 288 2 0 64 41 165 0.438 6.84 2.16 Intr + 209525 209655 131 1 2 93 64 37 0.571 1.29 2.17 Intr + 211674 211818 145 0 1 63 98 27 0.500 1.48 2.18 Intr + 228262 228446 185 0 2 114 33 53 0.031 0.99 2.19 Intr + 229930 230020 91 1 1 92 59 33 0.010 0.80 2.20 Intr + 240486 240570 85 2 1 84 1 86 0.019 -1.11 2.21 Intr + 240862 240917 56 2 2 123 84 18 0.130 3.60 2.22 Intr + 243466 243589 124 0 1 89 27 94 0.110 3.56 2.23 Intr + 246792 246839 48 1 0 76 69 52 0.113 0.75 2.24 Term + 248030 248190 161 0 2 73 49 158 0.356 8.50 2.25 PlyA + 250782 250787 6 1.05 3.16 PlyA - 251076 251071 6 1.05 3.15 Term - 255364 255296 69 1 0 82 48 49 0.267 -1.76 3.14 Intr - 257411 257176 236 0 2 51 80 134 0.403 6.31 3.13 Intr - 258294 258251 44 2 2 118 64 14 0.137 -0.12 3.12 Intr - 264066 263867 200 0 2 83 42 137 0.190 6.75 3.11 Intr - 264504 264242 263 1 2 75 53 103 0.124 2.71 3.10 Intr - 273291 273195 97 0 1 90 47 49 0.243 0.58 3.09 Intr - 274217 274083 135 2 0 63 27 81 0.170 0.26 3.08 Intr - 281407 281350 58 0 1 81 81 51 0.319 2.59 3.07 Intr - 285201 284976 226 1 1 93 89 69 0.383 4.54 3.06 Intr - 316981 316907 75 2 0 45 86 63 0.043 1.29 3.05 Intr - 317820 317679 142 0 1 23 72 101 0.005 1.93 3.04 Intr - 328566 328486 81 0 0 59 60 83 0.021 2.43 3.03 Intr - 332073 331959 115 2 1 95 86 30 0.030 3.95 3.02 Intr - 343001 342916 86 2 2 113 81 14 0.072 1.82 3.01 Intr - 348495 348365 131 1 2 76 38 108 0.176 5.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100013 100123 111 1 0 94 86 136 0.963 14.21 S.002 Init - 126671 126543 129 2 0 79 91 110 0.993 10.55 S.003 Term - 130754 130583 172 1 1 62 50 186 0.986 9.50 S.004 Init - 168565 168460 106 1 1 71 86 74 0.833 5.88 S.005 Init + 207451 207518 68 0 2 45 94 137 0.894 8.55 S.006 Init - 323981 323912 70 2 1 100 71 70 0.917 7.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:126240508_126603260|GENSCAN_predicted_peptide_1|187_aa MTTEGNFQDLLFILGVLKFHNNMPKYEWAPIPSPVNDAAHICRGSSIKTLYQQNHGSPWP AIGPQLPCHATPLELTLDCLPETGRQVVSMPMSTEFQWTHNEWESNDCQSNQKASVLLLR LSGMLKAPRGHTPSANQNLLQTERRKLGQTQAEEGAVGNVALGFMSAGVHAASRALCKAV KAIVGLG >gi568815589f:126240508_126603260|GENSCAN_predicted_CDS_1|564_bp atgacaacagaaggaaatttccaggatcttctctttatacttggtgttctgaaatttcac aacaatatgcctaagtatgaatgggctcccatcccgagtcccgtgaatgatgcagcccac atctgcagagggagctcgataaagaccctttatcagcagaaccatggctccccatggcca gccatcggaccccagctcccctgtcacgctactcccctggagctgaccctggactgcctg ccagagacagggaggcaagtggtttccatgcccatgagcacggaattccagtggacccac aatgagtgggagtccaatgactgtcaaagcaatcagaaggcaagcgtgttacttctgcgg ctaagtgggatgctgaaagccccaagaggccacactccctctgcaaaccagaacttgctt caaacagaaaggaggaaattgggccaaactcaggctgaggaaggggctgtgggaaatgtg gccctgggcttcatgtctgcaggggtccatgctgcctccagggctctgtgcaaggctgtg aaggcgatagtgggtctggggtga >gi568815589f:126240508_126603260|GENSCAN_predicted_peptide_2|968_aa MKEHTNMLKGPRSLYHRWQAPAPGGPEPGPPPRPAEGAAAATAVVAAAARPRPAPPPPPP PPPLSSPERAAAAGARARPGGERREKLGDPSASSAPKARAALEIISQRLYARLGERSEWN HLDLMTCEVKDQSTMPEVKDLSEALPETSMDPITGVGVVASRNRAPTGYDVVAQTADGVD ADLWKDGLFKSKVTRYLCFTRSFSKENSHLGNVLVDMKLIDIKDTLPVGFIPIQETVDTQ EVAFRKKRLCIKFIPRDSTEAAICDIRIMGRTKQAPPQYTFIGPEQEVGSAPLELKDKPR LVLLLKYGGAVKSQKGGEWMPAVQKEIDVHCLQSTEIQGAKRGNTFLASIRAAPPPSVTC RASPVSAGKKRFRCCWDRGWEEPQEAVINCDMDFELHFHSGKKWFGGRPDGSGSGVCACS GPRQGAGAAPSFCLAIHFLLKVATANGAFSTRGHISLTLPATFRGRNSTRTDYEYQHSNL YAISGVPVALVFSGYVLVARGGIAFLTLASLRFFAPQRIFVITDNWLKRWWGHSFTGQLL RKLPCAQPWGLDWQRAPGSEWELRAHGAPTHQLDATCHRYKYNLQKSDVLVPLLRRLGGR KHVLQSTEPSWQMATAVEKTRRPFPLQAEHSMRDALGLLTSNIRVNRESSAVKFMSQSGL AYAQSLGGPIRHAQPGLPGRMCILNNRPSLIPGVHVEAIQPALSPNYLLVILLESGTQKL KDPLVAPQGPHDVLQSAENGHQDSWQAGSWLSLDPTSMQQLKRLRHRALRIDVETQSIEK NFQRTALDLHEVVQPSTAPWTLEGGTALKEEPVPASCVTFILLPAPGDSPVTDQRPDSRA MDGVPFMISEKFSCVPESMQPFDLLGITIKSLAEIEKEVSKPSGEGRAGLGHRLHTGVSP TTVTIVTVLLVHREGRSLCTTWEEGGSTHLRKSGLKKLSDWPDSHSSNPELNRGLSVTLA EYMTVFIY >gi568815589f:126240508_126603260|GENSCAN_predicted_CDS_2|2907_bp atgaaggagcacaccaatatgctcaagggccccagaagcctctaccacagatggcaagcg ccagcgccgggaggcccggagcccggcccgccgccgcggcccgcggagggagccgccgcc gccactgccgttgtcgccgccgccgccaggccgcgccccgcccctccgccgccgccgccg ccgccgccgctctcgagccccgagcgcgccgcggctgccggtgcccgggctcggcctggc ggcgagaggcgggagaagctgggggacccgagtgccagcagcgcccccaaggcccgggca gccttggaaattatctcacagaggctgtatgcccgcttaggggagaggtcagaatggaac catctggacctgatgacctgcgaggtgaaagaccagtccaccatgcctgaagtcaaagac ctctcagaagccttgccagaaacgtcaatggatcccatcacgggagtcggggtggtggct tctcggaaccgagccccgacaggctatgacgtagttgcacagacagcagatggtgtggat gctgacctctggaaagacggcttatttaaatccaaggttaccagatacctgtgtttcaca agatcattttccaaagaaaatagtcatctggggaacgtgttagtagatatgaagctcatt gacatcaaggacacactgcctgtgggcttcatcccaattcaggagacggtggacacacag gaagtggcttttaggaagaagaggctgtgcattaaatttattccacgggattcaacggaa gctgcgatttgtgacattcggatcatgggccggaccaagcaggccccgcctcagtacacg tttattgggcctgagcaggaagtaggatcagctccattggaactcaaggacaaaccacga ctggtgctgctcctaaaatatggaggggcagtcaaaagtcagaaagggggtgagtggatg ccagctgtccagaaggaaatagatgttcactgtcttcagtcaactgaaatccagggggcc aagagaggaaacaccttcctggcctctatcagagcagcccctccaccgtctgtcacatgc agggcgtcacctgtttctgctgggaagaagaggttccgctgctgctgggacagaggttgg gaagagccccaggaggcagtaataaattgtgacatggattttgagctccacttccatagt ggcaagaaatggtttgggggcaggcctgatggatccggatccggggtctgtgcctgcagt gggcccaggcagggggctggggcggctccctccttctgcctggcgatccacttcctgctg aaagtggccactgcaaatggagccttttccacaagagggcacatctccctaacacttcct gccaccttccgaggcaggaacagcacccggacggactacgagtaccagcactccaatttg tatgccatatcaggggtccctgtggcccttgtcttcagtggatatgttctcgtggctcgg ggtggcattgcatttctgaccctggcttccctgaggttcttcgctccacagaggattttc gtaatcaccgacaactggctcaagcggtggtgggggcattcgttcactgggcagttactg aggaaactcccatgtgcccagccctggggcctggactggcagagagccccgggcagtgag tgggagctcagagcacacggggcccccacacaccaactggatgcgacctgccaccggtac aaatacaacctgcagaagtcggatgtgctggtccctctgttgagaaggctcggtggccgc aagcacgtattgcagtccacagaaccatcctggcagatggcaacggctgtagagaagacc cgcaggccctttcccttgcaggctgagcactccatgagagatgcccttggcctcctcaca tctaacatcagagttaaccgtgagagctctgcagtcaagttcatgtcgcagtctggtctc gcctatgcccaaagcctgggagggcccatcaggcacgcgcaaccagggctcccagggaga atgtgcattttaaacaatagacccagcctaattcctggtgttcacgtggaggccattcag ccagccctgagccccaactacttgctggtcattctacttgaatctgggacacaaaagctc aaagaccctttggtggctccccagggacctcatgatgtccttcagtctgctgagaatggc catcaggactcctggcaggctggttcttggttgtctctggaccctacttctatgcaacag ttaaaaaggctgagacacagagctctaagaattgatgtggaaacacagtcaattgaaaaa aactttcagagaactgccctggacctgcatgaggttgtgcagcccagcactgcaccctgg accctggagggagggactgccctgaaggaggagcctgtccctgccagctgtgtcaccttc atcctgctgccagcccctggtgactctcccgttacagatcagagaccagactccagggca atggatggtgtgccttttatgatttcagagaagttttcttgtgttccagaaagtatgcag ccctttgatctcctgggaatcaccatcaaatctctagcagaaatcgaaaaagaggtaagc aagccatcaggggaggggagggcaggcctgggccatcgcctgcacaccggcgtctctccc accaccgtcaccatcgttaccgttctccttgtccacagagaagggcgatccctctgcacc acgtgggaggaaggggggtccacccatttacggaagtctggactcaagaagttaagtgac tggccagattcccacagcagcaacccagagctgaacaggggtctgagtgtgaccctggct gaatacatgaccgtcttcatctactga >gi568815589f:126240508_126603260|GENSCAN_predicted_peptide_3|652_aa XTSDAASVMMAATNPHGSTWPAVHVELLLPVDFLCSDDLGGNGKGDLAGQRIIADEDTQV PRAQAFCPRSHSQPHLTVGTTATQLENLMQWEQLDHGVAGAEWQHSTAKGKGKKRAMVLA GVIDPDHQEDIGRCSTMEAGPQKTLAELFDPLPRGQMKPGASCDAVPHTSQQMAAENKAP DRWSPGPMPSKPDLLDRRKKCEEWVSGSSQGMACGGEATLEQLHAPVRKSSFSSSRLSHT TPLLGGRKLHIPARVPGRTFFPRPTIDKSSWTSWSLHDDSVQRESRPGPVGLNTDTGQGK LTERLGRVELGGLLGPLQSTRIPDQSERGWARPRSLWEPPQVHEVRTPQGAKYVMGLLNS SSRKPWLLLAAVIPEPQGHIKGGHHPRSHGRHRCPLWVPQGSSGPQNHTGLCERSIGSQP GSPLPAQHQSHDKVLLLCHPGVLRRPPPPGASSGLREGEVAASLIHRNQELALLWAAQDL QQEAVPAEAEASTDAGSAAPNRSIKPPHPRANSWGPKDVRSPYSEESGHSPVPDTTESHT KHSRGLQTRKGSSTSVGVGVLTRTPGDSEQLFGLETLHVFFQQLQGQLKSLLPTWAMVLE TLERLHTQREAPAPESVRKCFTRRKAVHEQTVYKLQVHTVLACAVFHNSHKH >gi568815589f:126240508_126603260|GENSCAN_predicted_CDS_3|1959_bp nggacaagcgatgctgcttctgtcatgatggctgccactaatccacatggctccacttgg cctgctgtccatgtggagctgctcctccctgtggacttcctgtgctctgatgaccttgga ggaaacggcaagggagatctggctggccagaggataattgctgatgaggacactcaggtt ccgagagctcaggctttttgcccaaggtcacacagccagccacaccttacagtgggaacc acagccactcaattggaaaacctaatgcagtgggagcaactggatcatggggtggcaggg gccgaatggcagcactcaactgccaaaggcaagggaaagaagcgagctatggtgttggct ggggtgattgatccagatcaccaagaggacattgggcgctgctccacaatggaggcaggt cctcagaaaacacttgcagaattatttgatcctcttccaaggggacaaatgaagcctggt gccagctgtgatgcagtgccccataccagccagcagatggcagcagagaacaaggcacct gaccggtggagcccaggtccaatgccctccaagccagatcttttggatcggaggaagaag tgtgaggaatgggtctcagggtcatctcaagggatggcgtgtggtggggaagccaccctg gagcagcttcatgctcccgtccgaaagagctccttcagcagctcgcggctgagccacacg acgcctctgctgggaggcaggaagttgcatattcctgcccgtgtacccggcagaacgttc tttccacgtcctaccattgacaaatcctcttggacttcctggtcacttcatgatgacagc gtgcagagggagagcaggcctgggccagtgggcttgaacaccgatactggccagggcaag ctgacggagaggctggggcgggtggagctggggggcttgctcgggcctctccagtcgacc cgcattcccgaccaaagcgagcgcggctgggcccggccccgcagtctatgggagccaccc caggtccacgaggtccggacgccgcagggggccaagtatgtaatgggtttgctcaattcc tcctccagaaagccctggttgctgctggctgctgttatcccagagccccaggggcacatc aagggagggcaccacccgaggagccacggcagacataggtgccccctgtgggtgccacag ggctcctcaggtccccagaaccacactggcttatgtgaacggagcatcggcagccagcct ggcagccccctccctgctcagcatcagtcccatgacaaggtcctgctgctgtgccaccca ggagtcctgagaaggccacccccaccaggggcatcctcgggtctcagggagggagaagtg gccgcttcccttatccacagaaatcaagagctagcactactgtgggctgcccaggacctg cagcaggaggccgtccccgctgaggcagaggcttccacggacgcaggtagtgcagccccc aaccgcagcatcaagcctccacatccaagagccaactcttggggacccaaagatgtcagg tccccatactctgaggaatcaggacacagcccagtgcctgacaccacagagtctcatact aaacactctagagggctccagacgagaaaaggcagcagtacttcagtgggtgtgggcgtt ttgacaaggaccccaggtgactctgagcaactgttcggtttggaaaccctgcacgtcttc ttccagcagctgcagggccagctgaagtccttgcttcctacttgggccatggttctagag acattggagaggctgcatacccagcgggaagccccagctccagaaagcgtgagaaagtgc ttcactaggaggaaggctgttcacgagcagacagtgtacaagctgcaggttcacacagtg ctggcctgtgcggtgtttcacaatagtcataaacactga