GENSCAN 1.0 Date run: 3-Nov-116 Time: 09:09:05 Sequence gi568815597r:30833214_31159566 : 326353 bp : 45.70% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 1468 1463 6 1.05 1.04 Term - 3629 3425 205 1 1 94 32 91 0.284 0.94 1.03 Intr - 7576 7560 17 1 2 97 94 10 0.179 -3.16 1.02 Intr - 13143 12970 174 0 0 52 100 109 0.486 8.64 1.01 Init - 14123 13992 132 2 0 46 76 83 0.247 3.04 1.00 Prom - 17592 17553 40 -3.86 2.00 Prom + 19348 19387 40 -4.16 2.01 Init + 27713 27730 18 1 0 82 82 22 0.751 1.10 2.02 Term + 29148 29348 201 2 0 64 44 172 0.835 7.79 2.03 PlyA + 34093 34098 6 1.05 3.09 PlyA - 36273 36268 6 1.05 3.08 Term - 40164 39998 167 1 2 151 40 284 0.525 27.98 3.07 Intr - 41375 41084 292 2 1 82 66 365 0.996 30.41 3.06 Intr - 43952 43339 614 0 2 109 94 512 0.975 46.00 3.05 Intr - 45527 45410 118 2 1 100 87 196 0.995 20.74 3.04 Intr - 47066 46976 91 1 1 99 52 54 0.398 2.90 3.03 Intr - 48482 48305 178 0 1 37 15 142 0.343 0.78 3.02 Intr - 54095 53990 106 2 1 13 105 59 0.309 -0.11 3.01 Init - 56236 56183 54 1 0 113 36 65 0.475 5.08 3.00 Prom - 66680 66641 40 -2.76 4.03 PlyA - 66807 66802 6 -4.04 4.02 Term - 67221 67060 162 0 0 60 43 165 0.901 7.14 4.01 Init - 75373 75236 138 1 0 111 84 231 0.997 23.14 4.00 Prom - 80859 80820 40 -4.76 5.20 PlyA - 83543 83538 6 1.05 5.19 Term - 100129 99998 132 1 0 111 54 205 0.986 17.49 5.18 Intr - 103622 103430 193 1 1 66 102 346 0.999 33.29 5.17 Intr - 108059 107938 122 2 2 102 102 28 0.997 4.89 5.16 Intr - 108910 108785 126 1 0 103 72 66 0.871 7.38 5.15 Intr - 112270 112133 138 1 0 63 111 90 0.995 9.46 5.14 Intr - 117048 116920 129 0 0 9 91 157 0.592 8.99 5.13 Intr - 119150 119021 130 1 1 81 74 132 0.897 11.80 5.12 Intr - 120768 120501 268 1 1 72 109 220 0.991 19.09 5.11 Intr - 133065 132769 297 1 0 94 75 152 0.916 11.35 5.10 Intr - 134097 133954 144 1 0 56 99 74 0.898 5.55 5.09 Intr - 135279 135141 139 0 1 98 80 101 0.969 10.34 5.08 Intr - 141589 141438 152 2 2 97 99 103 0.965 12.18 5.07 Intr - 146965 146849 117 2 0 69 66 107 0.955 7.04 5.06 Intr - 148192 148099 94 1 1 104 123 122 0.997 16.84 5.05 Intr - 159447 159177 271 2 1 81 84 270 0.963 23.44 5.04 Intr - 162007 161841 167 1 2 54 58 274 0.963 19.76 5.03 Intr - 173889 173781 109 2 1 71 89 59 0.028 4.59 5.02 Intr - 226095 225991 105 2 0 58 98 88 0.522 6.13 5.01 Init - 232572 232403 170 0 2 66 86 170 0.886 11.65 5.00 Prom - 243779 243740 40 -4.16 6.04 PlyA - 244159 244154 6 1.05 6.03 Term - 261880 261774 107 2 2 17 35 166 0.147 2.77 6.02 Intr - 267547 267455 93 2 0 64 25 97 0.164 0.94 6.01 Init - 283196 283145 52 2 1 61 82 59 0.185 3.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 173840 173781 60 2 0 97 89 33 0.818 5.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:30833214_31159566|GENSCAN_predicted_peptide_1|175_aa MAFVCAFLSDSAERPTSNSPSSPQNPMRQLHDYPYFTDEETEPKGSCEAKTTDSNYTDVY AYLSIFQALQEDPISLLIKEKLCFSAAKKCLKATTSQTVMEKMRRLLSFQTECDKSHSVS ADKSTPKDDQVLGPQGQSENSLSDFTIVKTKVCNQVICMNPMHVMDFGASVVDQC >gi568815597r:30833214_31159566|GENSCAN_predicted_CDS_1|528_bp atggcttttgtctgtgccttcctgtcagactctgctgaacgccctacaagtaacagccca tccagtcctcagaaccctatgaggcagctccacgattacccctattttacagatgaggaa actgagccaaaggggagctgtgaagccaaaaccactgattcaaattacacagatgtgtat gcctacctctcaattttccaggcattacaggaggaccccatctccttgcttatcaaagag aagctctgtttttctgctgctaaaaaatgtctaaaagccacaaccagtcaaaccgttatg gagaagatgagaaggctgctcagcttccagacagagtgtgacaaatctcacagtgtctca gccgataaatcaacccccaaagatgatcaagttctaggcccccaaggtcagagtgaaaac tcattgagtgattttactattgtgaaaacaaaagtctgcaatcaagttatctgcatgaat cctatgcatgtaatggactttggggcctctgttgtcgaccagtgttaa >gi568815597r:30833214_31159566|GENSCAN_predicted_peptide_2|72_aa MTSIIQSVLLLLIKKVNRKTASGRFFRRYPEEGTVTTGDDSSVHAIAPEDLPVGQDLEVE DSDTDEPDTVEA >gi568815597r:30833214_31159566|GENSCAN_predicted_CDS_2|219_bp atgacttcgatcatccagagtgtactccttctacttattaaaaaagttaaccgtaagaca gcctcaggcaggttcttcaggaggtatccagaagaaggcactgttaccacaggagatgac agctccgtgcatgctattgcccctgaagaccttccagtgggacaagatctggaggtagaa gacagtgatactgacgaacctgacaccgtggaggcctag >gi568815597r:30833214_31159566|GENSCAN_predicted_peptide_3|539_aa MGPPGMEEKRMEFVRADHVVPRRLLGPRSGQPLPCLLAVTVMEPGDRLLEAPAGPQQQQA EVSRVPWERSAAPLGPDAQAFMGRLRSLCWVSPPEISGGRKQKVQCFHRARSLSPVISPS PVPPHLCGELQLLVVSMEKMVVMAQRWRSENFERPVDLEGSGDDDSFPDDELDDLYSGSG SGYFEQESGIETAMRFSPDVALAVSTTPAVLPTTNIQPVGTPFEELPSERPTLEPATSPL VVTEVPEEPSQRATTVSTTMATTAATSTGDPTVATVPATVATATPSTPAAPPFTATTAVI RTTGVRRLLPLPLTTVATARATTPEAPSPPTTAAVLDTEAPTPRLVSTATSRPRALPRPA TTQEPDIPERSTLPLGTTAPGPTEVAQTPTPETFLTTIRDEPEVPVSGGPSGDFELPEEE TTQPDTANEVVAVGGAAAKASSPPGTLPKGARPGPGLLDNAIDSGSSAAQLPQKSILERK EVLVAVIVGGVVGALFAAFLVTLLIYRMKKKDEGSYTLEEPKQASVTYQKPDKQEEFYA >gi568815597r:30833214_31159566|GENSCAN_predicted_CDS_3|1620_bp atggggcccccgggaatggaggaaaagaggatggaatttgtacgtgctgaccacgttgtt cctcggagactcctgggccccaggagcgggcaacccttgccgtgtttgctggcagtgacc gtcatggagcctggagacagactcctagaggcacctgctggccctcagcagcagcaggca gaggtctctagagtgccctgggaacgctcagcggctcccctgggacctgatgctcaggcc ttcatgggccgcttgcggagcctctgctgggtgtcccctcccgagatcagtggtggccgg aagcagaaggttcagtgcttccaccgggcacgctcgctttcccccgtgatctctccctct cctgtaccaccccatctctgtggggagctgcagctcctggtcgttagcatggagaagatg gtcgtcatggcccagcgctggcgcagtgagaacttcgagagacccgtggacctggagggc tctggggatgatgactcctttcccgatgatgaactggatgacctctactcggggtcgggc tcgggctacttcgagcaggagtcgggcattgagacagccatgcgcttcagcccagatgta gccctggcggtgtccaccacacctgcggtgctgcccaccacgaacatccagcctgtgggc acaccatttgaagagctcccctctgagcgccccaccctggagccagccaccagccccctg gtggtgacagaagtcccggaagagcccagccagagagccaccaccgtctccactaccatg gctaccactgctgccacaagcacaggggacccgactgtggccacagtgcctgccacagtg gccaccgccacccccagcacccctgcagcacccccttttacggccaccactgctgttata aggaccactggcgtacggaggcttctgcctctcccactgaccacagtggctacggcacgg gccactacccccgaggcgccctccccgcccaccacggcggctgtcttggacaccgaggcc ccaacacccaggctggtcagcacagctacctcccggccaagagcccttcccaggccggcc accacccaggagcctgacatccctgagaggagcaccctgcccctggggaccactgcccct ggacccacagaggtggctcagaccccaactccagagaccttcctgaccacaatccgggat gagccagaggttccggtgagtggggggcccagtggagacttcgagctgccagaagaagag accacacaaccagacacagccaatgaggtggtagctgtgggaggggctgcggccaaggca tcatctccacctgggacactgcccaagggtgcccgcccgggccctggcctcctggacaat gccatcgactcgggcagctcagctgctcagctgcctcagaagagtatcctggagcggaag gaggtgctcgtagctgtgattgtgggcggggtggtgggcgccctctttgctgccttcttg gtcacactgctcatctatcgtatgaagaaaaaggatgagggcagctacacgctggaggaa cccaagcaggcgagcgtcacataccagaagcctgacaagcaggaggagttctatgcctag >gi568815597r:30833214_31159566|GENSCAN_predicted_peptide_4|99_aa MKPGPPHRAGAAHGAGAGAGAAAGPGARGLLLPPLLLLLLAGRAAGAWMPLLGGVCSCIL PGSCLQDAPCVRETDTSTDDDEEDDEKDNGQYLLLHVMC >gi568815597r:30833214_31159566|GENSCAN_predicted_CDS_4|300_bp atgaagccggggccgccgcaccgtgccggggccgcccacggggccggcgccggggccggg gccgcggccgggcccggggcccgcgggctgctcctgccaccgctgctgctgctgctgctg gcggggcgcgccgcgggggcctggatgccattgttggggggtgtctgcagctgcatctta cctggttcctgccttcaagatgctccctgtgtcagggagacagacacaagcacagatgat gacgaggaagatgatgagaaagataacggccaatacttactgctgcatgtcatgtgctag >gi568815597r:30833214_31159566|GENSCAN_predicted_peptide_5|1000_aa MPLPPPGPGPEPIPGCTAPTQSPVGRHVVGVKGKGERERQREKIGGLKSIFILPLRPGSM VSSLGEEEVEEAAIIIANIDGLLGITFMQNISSSCQVKSFGKQMNPAKMDQKEYSWVING ETVPGEHQGPRDADSDENDKGEKKNKGTFDGDKLGDLKEEGDVMDKTNGLPVQNGIDADV KDFSRTPGNCQNSANEVDLLGPNQNGSEGLAQLTSTNGAKPVEDFSNMESQSVPLDPMEH VGMEPLQFDYSGTQVPVDSAAATVGLFDYNSQQQLFQRPNALAVQQLTAAQQQQYALAAA HQPHIGMFSAGLAPAAFVPNPYIISAAPPGTDPYTAGLAAAATLGPAVVPHQYYGVTPWG VYPASLFQQQAAAAAAATNSANQQTTPQAQQGQQQVLRGGASQRPLTPNQNQQGQQTDPL VAAAAVNSALAFGQGLAAGMPGYPVLAPAAYYDQTGALVVNAGARNGLGAPVRLVAPAPV IISSSAAQAAVAAAAASANGAAGGLAGTTNGPFRPLGTQQPQPQPQQQPNNNLASSSFYG NNSLNSNSQSSSLFSQGSAQPANTSLGFGSSSSLGATLGSALGGFGTAGGLTNGSGRYIS AAPGAEAKYRSASSASSLFSPSSTLFSSSRLRYGMSDVMPSGRSRLLEDFRNNRYPNLQL REIAGHIMEFSQDQHGSRFIQLKLERATPAERQLVFNEILQAAYQLMVDVFGNYVIQKFF EFGSLEQKLALAERIRGHVLSLALQMYGCRVIQKALEFIPSDQQNEMVRELDGHVLKCVK DQNGNHVVQKCIECVQPQSLQFIIDAFKGQVFALSTHPYGCRVIQRILEHCLPDQTLPIL EELHQHTEQLVQDQYGNYVIQHVLEHGRPEDKSKIVAEIRGNVLVLSQHKFASNVVEKCV THASRTERAVLIDEVCTMNDGPHSALYTMMKDQYANYVVQKMIDVAEPGQRKIVMHKIRP HIATLRKYTYGKHILAKLEKYYMKNGVDLGPICGPPNGII >gi568815597r:30833214_31159566|GENSCAN_predicted_CDS_5|3003_bp atgccgctgccaccgccggggccggggccggagcctatcccaggatgcaccgcgccaacc cagtccccagtgggccgccatgttgtcggagtgaaaggtaagggggagcgagagcgccag agagagaagatcggggggctgaaatccatcttcatcctaccgctccgcccaggcagcatg gtgagcagcttgggggaggaggaagtggaggaggcggctataataatagcaaacatcgat ggcctactggggataacattcatgcagaacatcagcagctcttgccaggtaaaaagtttt gggaaacagatgaatccagcaaagatggaccaaaaggaatattcctgggtgatcaatggc gagacagtgcctggggaacatcagggcccaagggatgcagacagtgatgaaaacgacaaa ggtgaaaagaagaacaagggtacgtttgatggagataagctaggagatttgaaggaggag ggtgatgtgatggacaagaccaatggtttaccagtgcagaatgggattgatgcagacgtc aaagattttagccgtacccctggtaattgccagaactctgctaatgaagtggatcttctg ggtccaaaccagaatggttctgagggcttagcccagctgaccagcaccaatggtgccaag cctgtggaggatttctccaacatggagtcccagagtgtccccttggaccccatggaacat gtgggcatggagcctcttcagtttgattattcaggcacgcaggtacctgtggactcagca gcagcaactgtgggactttttgactacaattctcaacaacagctgttccaaagacctaat gcgcttgctgtccagcagttgacagctgctcagcagcagcagtatgcactggcagctgct catcagccgcacatcggtatgttttcagcaggtttagctcccgctgcgtttgtccccaat ccatacatcatcagcgctgctcccccagggacggacccctacacagctggattggctgca gcagcgacactaggcccagctgtggtccctcaccagtattatggagttactccctgggga gtctaccctgccagtcttttccagcagcaagctgccgctgccgctgcagcaactaattca gctaatcaacagaccaccccacaggctcagcaaggacagcagcaggttctccgtggagga gccagccaacgtcctttgaccccaaaccagaaccagcagggacagcaaacggatcccctt gtggcagctgcagcagtgaattctgcccttgcatttggacaaggtctggcagcaggcatg ccaggttatccggtgttggctcctgctgcttactatgaccaaactggtgcccttgtagtg aatgcaggcgcgagaaatggtcttggagctcctgttcgacttgtagctcctgccccagtc atcattagttcctcagctgcacaagcagctgttgcagcagccgcagcttcagcaaatgga gcagctggtggtcttgctggaacaacaaatggaccatttcgccctttaggaacacagcag cctcagccccagccccagcagcagcccaataacaacctggcatccagttctttctacggc aacaactctctgaacagcaattcacagagcagctccctcttctcccagggctctgcccag cctgccaacacatccttgggattcggaagtagcagttctctcggcgccaccctgggatcc gcccttggagggtttggaacagcaggaggactcacgaatggcagtggaagatacatctct gctgctccaggcgctgaagccaagtaccgcagtgcaagcagcgcctccagcctcttcagc ccgagcagcactcttttctcttcctctcgtttgcgatatggaatgtctgatgtcatgcct tctggcaggagcaggcttttggaagattttcgaaacaaccggtaccccaatttacaactg cgggagattgctggacatataatggaattttcccaagaccagcatgggtccagattcatt cagctgaaactggagcgtgccacaccagctgagcgccagcttgtcttcaatgaaatcctc caggctgcctaccaactcatggtggatgtgtttggtaattacgtcattcagaagttcttt gaatttggcagtcttgaacagaagctggctttggcagaacggattcgaggccacgtcctg tcattggcactacagatgtatggctgccgtgttatccagaaagctcttgagtttattcct tcagaccagcagaatgagatggttcgggaactagatggccatgtcttgaagtgtgtgaaa gatcagaatggcaatcacgtggttcagaaatgcattgaatgtgtacagccccagtctttg caatttatcatcgatgcgtttaagggacaggtatttgccttatccacacatccttatggc tgccgagtgattcagagaatcctggagcactgtctccctgaccagacactccctatttta gaggagcttcaccagcacacagagcagcttgtacaggatcaatatggaaattatgtaatc caacatgtactggagcacggtcgtcctgaggataaaagcaaaattgtagcagaaatccga ggcaatgtacttgtattgagtcagcacaaatttgcaagcaatgttgtggagaagtgtgtt actcacgcctcacgtacggagcgcgctgtgctcatcgatgaggtgtgcaccatgaacgac ggtccccacagtgccttatacaccatgatgaaggaccagtatgccaactacgtggtccag aagatgattgacgtggcggagccaggccagcggaagatcgtcatgcataagatccggccc cacatcgcaactcttcgtaagtacacctatggcaagcacattctggccaagctggagaag tactacatgaagaacggtgttgacttagggcccatctgtggcccccctaatggtatcatc tga >gi568815597r:30833214_31159566|GENSCAN_predicted_peptide_6|83_aa MGLKAMELDDSASKGNIAAVGKDFVNNELTRESGFILLPREATLTQQKAGKLIHSKKKGD GYVDTENKFLKLVAAIKAALAQG >gi568815597r:30833214_31159566|GENSCAN_predicted_CDS_6|252_bp atgggattgaaagccatggaattggatgacagtgcctccaaagggaatatagctgctgtg ggaaaggacttcgtgaacaatgagttgaccagagagtcgggcttcatcctactgcccaga gaggccaccttgactcagcagaaagctgggaagttgattcactctaagaagaaaggtgat ggctatgtggacacagaaaacaagtttctgaagctggtggctgccatcaaagctgccttg gctcagggctaa