GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:00:48 Sequence gi568815581r:64914260_65156593 : 242334 bp : 44.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5053 5225 173 0 2 93 68 130 0.381 8.41 1.02 Intr + 13377 13550 174 0 0 49 79 144 0.240 8.65 1.03 Intr + 14248 14431 184 1 1 82 9 182 0.580 9.49 1.04 Intr + 22885 22988 104 0 2 125 105 54 0.367 9.77 1.05 Intr + 24872 24996 125 2 2 91 57 40 0.364 1.43 1.06 Intr + 26324 26466 143 0 2 83 119 165 0.996 19.17 1.07 Intr + 39418 39790 373 0 1 41 -60 358 0.110 10.94 1.08 Intr + 39932 40234 303 0 0 53 58 203 0.527 10.36 1.09 Intr + 42040 42171 132 2 0 99 67 74 0.030 7.02 1.10 Intr + 43741 43836 96 2 0 109 86 12 0.040 3.08 1.11 Term + 51626 51900 275 0 2 22 44 237 0.804 8.33 1.12 PlyA + 52383 52388 6 1.05 2.03 PlyA - 52525 52520 6 1.05 2.02 Term - 58403 58231 173 0 2 78 54 150 0.989 8.59 2.01 Init - 61926 61431 496 0 1 49 72 406 0.946 28.37 2.00 Prom - 72726 72687 40 -5.06 3.06 PlyA - 73410 73405 6 1.05 3.05 Term - 100570 99998 573 1 0 86 41 545 0.996 44.05 3.04 Intr - 101710 101654 57 1 0 88 89 12 0.549 0.38 3.03 Intr - 104044 103994 51 1 0 135 100 45 0.985 9.50 3.02 Intr - 139469 139243 227 0 2 55 99 169 0.873 12.40 3.01 Init - 142781 142052 730 2 1 63 50 804 0.836 67.06 3.00 Prom - 161571 161532 40 -5.16 4.06 PlyA - 162799 162794 6 1.05 4.05 Term - 164296 164221 76 0 1 128 53 82 0.933 6.01 4.04 Intr - 172889 172804 86 2 2 77 75 61 0.063 2.32 4.03 Intr - 186565 186422 144 1 0 39 94 129 0.865 9.08 4.02 Intr - 186976 186802 175 0 1 55 110 39 0.551 2.74 4.01 Init - 188935 188874 62 1 2 79 75 22 0.537 0.72 4.00 Prom - 192081 192042 40 -4.96 5.00 Prom + 196089 196128 40 -5.06 5.01 Init + 208892 209027 136 1 1 82 60 87 0.202 3.74 5.02 Intr + 210644 210741 98 0 2 47 54 101 0.052 2.33 5.03 Intr + 214919 215079 161 1 2 60 45 83 0.078 -0.01 5.04 Intr + 222932 223030 99 2 0 113 80 18 0.018 2.73 5.05 Intr + 223274 223338 65 2 2 52 86 47 0.022 -0.74 5.06 Intr + 224233 224333 101 2 2 63 48 90 0.034 2.33 5.07 Intr + 239163 239259 97 2 1 112 64 140 0.430 13.58 5.08 Term + 242196 242215 20 1 2 103 43 32 0.392 -1.32 5.09 PlyA + 242261 242266 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 49851 49962 112 2 1 48 55 100 0.819 3.08 S.002 Init - 106360 106331 30 1 0 33 87 24 0.893 -3.46 S.003 Init + 107935 108006 72 0 0 59 78 107 0.971 7.87 S.004 Term + 108751 108957 207 0 0 52 49 119 0.886 1.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:64914260_65156593|GENSCAN_predicted_peptide_1|693_aa MGAVGAAGAGWGGQRGRAGGARPAEAVGGRRLNPNGWGLTADRLQPPSCFFRYRSAVTFY IGCDLCTNWYYGECVGIAEKEAKKMDVYICNDCKQAQEGSSEELYCICRTPYDESQSCRF FIGRDRCQNWYHGCCIGILQSEAELIDEYVCPQCQSTEDAMTVLTPLTEKDDEELKRRQP SKFSTPDYTVELKSTTLCWAPKTNPVRRTCWSHLLLATAPTLLLPFWGPSGRVDLEEAGI AEPAAGTLDTGNKGCFQQIGFFVRPKLCAFSGLYYCDICHQDDASVILARIIHSWDLTKH LQFMALKERIGWRYSLLFVGLLQLNIVVFGALLRPIIIRGPASPKIVIQENRKEAQYMLE NEKTRTSIDSIDSGVELTTSPKNVPTHTNPELEPKADLQQVLVKTSPRPSKKKAPLLDFS ILKEKKVFRRIGAGFVLNREPIRVIYIELICVILLTVSLFAFTFAREFWGLMSCSVFFGF MVGTTGGTHIPLLAEDDVVGIEKMSSAAGVYIFIQSIAGLAGPPLAGLLVDQSKIYSRAF YSCAAGMALAAVCLALVRPCKMGLCQHHHSGDGQCSLLHLYSSLMLIYDNSGEKHNKPSL VTGRPARHAAVEEVTLGRRRQGKPSDRPHTSEPRRGSPRARKQVVRRRFAAAVPAGLGAA GDSDAFPAREGPERRAGYSGPAAACFDFSTAAP >gi568815581r:64914260_65156593|GENSCAN_predicted_CDS_1|2082_bp atgggcgcagtgggggcggccggggccgggtggggaggccagaggggcagggcaggaggg gcccggcctgcggaagcggttggagggagaaggctcaatccgaatggctggggcctcact gcggatcgccttcagccgccatcttgtttcttccgttaccggagcgcggtcacgttctat attggctgtgatctttgtactaactggtattatggagaatgtgttggcatcgcagaaaag gaggctaagaaaatggatgtgtacatctgtaatgattgtaaacaggcacaagagggcagc agtgaggaattgtactgtatctgcagaacaccttatgatgagtcacaatcttgcagattt tttattggccgtgatcggtgtcagaattggtaccatgggtgctgcattggcatcttgcaa agtgaggcagagctcattgatgagtatgtctgtccacagtgccagtcaacagaggatgcc atgacagtgctcacaccactaacagagaaggatgatgaggagttgaagaggaggcagcct tccaagttttctacccccgactacacagtggagctgaagagcaccactctctgctgggcc cctaaaacaaatccagtaagaagaacgtgctggagccacttattgctcgccacagcccct actctgctgcttccattctggggcccgagtggacgggtggacctagaggaggctggcatt gctgagccagcagcagggacactagacacaggcaacaagggctgcttccagcagatcggc ttctttgtacgacccaagctctgtgccttctctggcctctattactgtgacatctgccac caagatgatgcctcagtgattctggccaggatcatccacagctgggacctcaccaagcac ctgcaattcatggctctgaaggagcgcattggctggagatacagcctcctcttcgtgggc ctactacagttaaacattgtcgtcttcggagcactgctcagacccatcatcatcagagga ccagcgtcaccaaaaatagtcatccaggaaaatcggaaagaagcacagtatatgcttgaa aatgagaaaacacgaacctcaatagactccattgactcaggagtagaactaactacctca cctaaaaatgtgcctactcacactaacccagaactggagccgaaggcagacctgcagcag gtcctggtgaagaccagccccaggccaagcaaaaagaaagccccgctattagacttctcc attttgaaagagaaaaaagttttcaggaggatcggagctggttttgtcctcaacagagag cccattcgtgtgatttacattgagctcatctgcgtcatcttattgactgtgtctctgttt gcctttacttttgctagggaattctggggtctaatgtcatgtagcgtattttttgggttt atggttggaacaacaggagggacccacattccactgcttgctgaggatgatgttgtgggc attgagaagatgtcttctgcagctggggtctacatcttcattcagagcatagcaggactg gctggaccaccccttgcgggtttgttggtggaccaaagtaagatctacagcagggccttc tactcctgcgcagctggcatggccctggctgctgtgtgcctcgccctggtgagaccgtgt aagatgggactgtgccagcatcatcactcaggtgacggtcaatgctcgctgctccatctt tactcatcattaatgttaatttatgacaactcgggggaaaaacataacaagcctagtttg gttacaggcaggcctgcacggcacgcagcagtggaggaggttaccctgggtcgccgacgc caggggaagccaagcgaccgaccccacacttctgagccccgccggggaagtccccgggcg cggaaacaggtggtgcgccggcggttcgcagctgctgtgcccgctggcctgggcgcagcc ggggacagcgacgcgtttcctgcccgggaagggcccgagcgcagggccggctatagcggt cccgcagctgcctgcttcgattttagcactgctgctccctag >gi568815581r:64914260_65156593|GENSCAN_predicted_peptide_2|222_aa MLAPRCLSDLTASPGIRVSASERQGRHEDTPHTCASGFQSPPPPTNKEGKVLWVTAPGSA AAVRQPAAAADSFGGYAGCIRHAGTPLKPAACRARSHTGDPGDAGASVALENQAAEPRRG QTGNWRLRGRTAGACSPLVTLYSGKGEPRPPNSSSGALRGVARAGRDILKFLKKKKPEDA FCVVGITMIDLYPRDLWNFVFGQASLTDGIPFLALLLEASST >gi568815581r:64914260_65156593|GENSCAN_predicted_CDS_2|669_bp atgctggccccgcggtgcctttccgacctcactgcctctcctggcatccgggtctcggcc tcggagcgccagggacgccacgaggacacgccccacacttgcgcgtctgggttccagtcc ccgccgcccccaacaaataaagaagggaaagtcctttgggtgactgccccagggagcgcc gcggctgtacgacaacctgcggcggccgccgactcatttgggggctacgctggttgcatt cgtcacgccgggacgcctctcaaacccgcggcctgccgagcacgttcccacacgggcgac cctggtgacgcgggcgcgtctgtggctctggagaaccaggccgcggagccccggcgaggg cagacggggaactggcgactgcgggggcggacggctggggcttgtagtcccctcgtgacc ctctactcaggaaaaggcgagccgcggccgccgaactccagctctggcgcgctcaggggc gtggcgcgggcgggtcgggacatcctgaagttcttgaaaaagaagaaacctgaagatgcc ttctgtgttgtgggaataacaatgattgatctttacccaagagacttgtggaattttgtc tttggacaggcctctttgacagatggtattccgtttttggcattgttgttagaagcttct tcaacttga >gi568815581r:64914260_65156593|GENSCAN_predicted_peptide_3|545_aa MKGKQKVGLPSLAHVTAVPQPSPRARPRLRARAPERHIELVTRRPGREAEKRPRRASAPP GGSGSAHAPGPPPLRGRARPPSLPPSPAVLARPARPALRVSSLVPSLPGRARAAAALPAL EPRCRSRPPPEEEVEGAGGARRGGGGGGKMADFLPSRSVLSVCFPGCLLTSGEAEQQRKS KEIDKCLSREKTYVKRLVKILLLGAGESGKSTFLKQMRIIHGQDFDQRAREEFRPTIYSN VIKGMRVLVDAREKLHIPWGDNSNQQHGDKMMSFDTRAPMAAQGMVETRVFLQYLPAIRA LWADSGIQNAYDRRREFQLGESVKYFLDNLDKLGEPVYRPSVITSISQTEVLTLRDYIPS QQDILLARRPTKGIHEYDFEIKNVPFKMVDVGGQRSERKRWFECFDSVTSILFLVSSSEF DQVLMEDRLTNRLTESLNIFETIVNNRVFSNVSIILFLNKTDLLEEKVQIVSIKDYFLEF EGDPHCLRDVQKFLVECFRNKRRDQQQKPLYHHFTTAINTENIRLVFRDVKDTILHDNLK QLMLQ >gi568815581r:64914260_65156593|GENSCAN_predicted_CDS_3|1638_bp atgaaagggaaacagaaagtcgggctccccagcctggcccacgtgaccgcggttcctcag ccctccccgcgcgcccggccgcggctgcgggcgcgggcacccgagcgacacatcgagctc gtgactcgccgcccagggcgggaggcggagaagcggccgcggcgtgcgagcgcgcctccg ggaggcagcgggagcgcgcacgctccggggccaccgcccctccggggccgggcccgccct ccttccctccctccctcccccgctgtcctggcccgccctgcccggcccgccctgcgagtc agttcgctggttccctccctccctgggcgcgctcgggccgccgccgcgctccccgccctc gagcctcggtgccggagccgcccgccgccggaggaggaggtggagggagccggaggggcc cgccgaggcggcggcggcggcggcaagatggcggacttcctgccgtcgcggtccgtgctg tccgtgtgcttccccggctgcctgctgacgagtggcgaggccgagcagcaacgcaagtcc aaggagatcgacaaatgcctgtctcgggaaaagacctatgtgaagcggctggtgaagatc ctgctgctgggcgcgggcgagagcggcaagtccaccttcctgaagcagatgcggatcatc cacgggcaggacttcgaccagcgcgcgcgcgaggagttccgccccaccatctacagcaac gtgatcaaaggtatgagggtgctggttgatgctcgagagaagcttcatattccctgggga gacaactcaaaccaacaacatggagataagatgatgtcgtttgatacccgggcccccatg gcagcccaaggaatggtggaaacaagggttttcttacaatatcttcctgctataagagca ttatgggcagacagcggcatacagaatgcctatgaccggcgtcgagaatttcaactgggt gaatctgtaaaatatttcctggataacttggataaacttggagaaccagtatatagacca agtgtgatcacctccatttcacagacagaggtgctgacactcagagattatattccatca caacaagatattctgcttgccagaagacccaccaaaggcatccatgaatacgactttgaa ataaaaaatgttcctttcaaaatggttgatgtaggtggtcagagatcagaaaggaaacgt tggtttgaatgtttcgacagtgtgacatcaatacttttccttgtttcctcaagtgaattt gaccaggtgcttatggaagatcgactgaccaatcgccttacagagtctctgaacattttt gaaacaatcgtcaataaccgggttttcagcaatgtctccataattctgttcttaaacaag acagacttgcttgaggagaaggtgcaaattgtgagcatcaaagactatttcctagaattt gaaggggatccccactgcttaagagacgtccaaaaattcctggtggaatgtttccggaac aaacgccgggaccagcaacagaagcccttataccaccacttcaccactgctatcaacacg gagaacatccgccttgttttccgtgacgtgaaggatactattctgcatgacaacctcaag cagcttatgctacagtga >gi568815581r:64914260_65156593|GENSCAN_predicted_peptide_4|180_aa MPSDTPDSHTSLYFGRGCFCSLQAQDAAWKPKRNCLYHNSHQSLNEIELHKSTQRGALHS CPTCPRTLPRCGPRHSRAQAPPTWKAAGIRASQSASDDAGSLACAARASTCRQVALSGPR RDPAERQALSFLQAASILHYSTLGQPPPGASVAVTRKPLEQDVKAALSIQPDILLPDAEG >gi568815581r:64914260_65156593|GENSCAN_predicted_CDS_4|543_bp atgccctcagacacaccagactctcatacatccctgtacttcggcaggggctgcttctgc agtctccaggcccaggacgcagcttggaaacctaaacggaattgtctttatcacaactca caccagtcacttaatgaaatcgagctgcacaaatcgacccaaaggggtgccctccactcc tgccccacgtgtcctcggacgcttcccagatgtggtccccggcactccagggcccaggcg cctccaacttggaaagcagctggaatccgcgccagccaatccgcgagcgacgacgcgggc tcattagcatgcgctgcgcgggcgtccacctgccgccaggtggcgctgtccggcccccgc cgggaccccgccgagcggcaggcactgagcttcctgcaggcagcatctattctgcattac tccaccctaggccagcccccgcctggggccagcgtggctgtgaccaggaagcccttggaa caggatgtaaaagcagcactttccatccagcctgacatcctgcttccagatgcagaaggc tga >gi568815581r:64914260_65156593|GENSCAN_predicted_peptide_5|258_aa MGFKGAPPKSLPPRPPIPRRASSVEEAARGPRVGIRGSRPPGKASLPTPEMVLNKGFLKK RNSLEPRDEKEVGTTSEKLLTKSGGGCWKLKEEVGPPNLQHTAVVPEPSRHSQARGESGQ LAETPREPQASRSSSSFSSRGPALRVSPAAPRGPGGGRIAPPGGRGQDDNPTPRPAVQAE DGISPKGLSMLKPGQSWGPRDGRIWSTCFKMLVLIFRVAEIEALVKDMQNPETGVRMQNQ RVLVTSVPHAMTGEFQLL >gi568815581r:64914260_65156593|GENSCAN_predicted_CDS_5|777_bp atgggcttcaagggggcccctccgaaaagccttccaccccggcccccaatcccgcggcgc gcgagttccgtggaggaggccgcccgcggccccagagtcggcattaggggttcgcgtcca cccggcaaggcctcccttcccaccccagaaatggtgctcaataaaggcttcttgaagaaa cggaatagccttgagcccagagatgagaaagaagttggaacaactagtgaaaagctcctt acaaagagcggtggtggctgctggaagctgaaggaggaggtgggaccccccaacctccaa cacacagctgtggtgcctgagccatcccgtcacagtcaggcaaggggagaatctgggcag ttggcagaaacaccgagggagccacaagcctccaggtcctcctcctccttctcctcccgg ggccctgccctgcgggtctcacctgctgctccgcggggtcccggcgggggccggatagcg ccccctggcggcaggggccaggatgacaatccgacaccaaggccagcagtacaggccgag gatggcatttctccaaaaggactttcaatgctaaaacctggacagtcctggggaccccga gacggtcgcatttggtccacctgtttcaagatgctcgtgctcatcttccgggtagcagag attgaagcgctcgtgaaggacatgcagaacccagagacaggggtccgaatgcagaaccag agggtcctggtcaccagcgttcctcatgccatgacaggtgaattccagctgctgtag