GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:20:01 Sequence gi568815597r:235012067_235228715 : 216649 bp : 43.61% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6424 6497 74 0 2 111 48 32 0.842 2.04 1.02 Term + 10449 10587 139 0 1 80 53 117 0.651 4.84 1.03 PlyA + 11019 11024 6 1.05 2.00 Prom + 17326 17365 40 -4.16 2.01 Init + 35471 35700 230 1 2 87 80 70 0.648 3.96 2.02 Intr + 37479 37584 106 0 1 106 36 90 0.408 5.82 2.03 Intr + 57824 57949 126 1 0 66 86 52 0.567 3.68 2.04 Intr + 58489 58566 78 0 0 49 52 106 0.864 2.85 2.05 Intr + 61697 61840 144 1 0 56 93 124 0.965 10.18 2.06 Intr + 62279 62408 130 1 1 14 81 71 0.743 -0.73 2.07 Term + 62484 62515 32 1 2 115 47 43 0.535 0.92 2.08 PlyA + 67476 67481 6 1.05 3.09 PlyA - 72330 72325 6 1.05 3.08 Term - 80560 80513 48 1 0 98 51 34 0.207 -2.00 3.07 Intr - 85368 85161 208 2 1 62 -11 267 0.366 13.28 3.06 Intr - 93198 92996 203 0 2 104 3 80 0.079 -0.82 3.05 Intr - 94417 94325 93 1 0 51 61 64 0.078 0.16 3.04 Intr - 101844 101702 143 1 2 71 110 69 0.563 7.47 3.03 Intr - 107833 107752 82 1 1 49 85 82 0.425 3.21 3.02 Intr - 110306 110260 47 0 2 87 85 11 0.341 -1.17 3.01 Init - 116649 116529 121 0 1 80 92 235 0.920 23.45 3.00 Prom - 117294 117255 40 -5.96 4.22 PlyA - 118751 118746 6 1.05 4.21 Term - 119931 119647 285 0 0 104 42 80 0.483 0.40 4.20 Intr - 123704 123586 119 0 2 70 64 197 0.512 15.68 4.19 Intr - 126885 126854 32 2 2 73 76 28 0.378 -2.13 4.18 Intr - 128334 128026 309 2 0 42 75 164 0.571 5.93 4.17 Intr - 128599 128443 157 2 1 55 97 65 0.521 3.17 4.16 Intr - 129942 129768 175 0 1 68 68 24 0.079 -2.09 4.15 Intr - 140699 140640 60 2 0 75 84 48 0.617 2.03 4.14 Intr - 143046 142815 232 2 1 59 113 196 0.994 16.98 4.13 Intr - 148581 148445 137 0 2 -10 77 145 0.992 2.97 4.12 Intr - 149001 148827 175 2 1 75 106 140 0.997 14.44 4.11 Intr - 156586 156463 124 2 1 93 61 10 0.044 -1.56 4.10 Intr - 160698 160552 147 1 0 25 106 79 0.749 3.61 4.09 Intr - 163333 163118 216 2 0 120 75 74 0.787 7.78 4.08 Intr - 170727 169519 1209 1 0 105 100 957 0.435 86.11 4.07 Intr - 182145 181947 199 0 1 85 92 143 0.987 13.32 4.06 Intr - 185480 185337 144 2 0 52 52 148 0.603 8.08 4.05 Intr - 201960 201708 253 2 1 48 54 438 0.227 33.84 4.04 Intr - 207902 207727 176 2 2 93 86 179 0.995 16.94 4.03 Intr - 208479 208236 244 2 1 87 59 239 0.999 18.40 4.02 Intr - 209596 209499 98 1 2 54 106 31 0.983 0.31 4.01 Intr - 211194 211100 95 1 2 97 82 59 0.713 5.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 42847 43188 342 0 0 84 55 205 0.872 12.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:235012067_235228715|GENSCAN_predicted_peptide_1|70_aa MEESALPFKTQLQEPYSVTSPAIYRKREINSEKIQRKATKMINELEKQVTALRGNTLAEL DGYHTPDPCV >gi568815597r:235012067_235228715|GENSCAN_predicted_CDS_1|213_bp atggaggaatctgcattgcctttcaagacccagcttcaggagccatacagcgtcacttcc ccagcaatttacaggaaacgagagatcaactcagagaaaattcagagaaaagcaacaaag atgattaatgagctagaaaagcaagtgacagccttgcgggggaatacattggcagagctg gatggatatcacacaccagatccgtgcgtgtga >gi568815597r:235012067_235228715|GENSCAN_predicted_peptide_2|281_aa MRSARGLTYKTTWRGRNPSSKKLEVNPRDQSKESQMITNKRLQEVQTHMGAPAGERAAFN HSQTQRMHSKLPMHLLGIQDKALAVKPSSTRLLQHGAGSPPKVLRDVSFIKLMTLDGQRP LTLQSLSPRDPGQRQWPVSSCQRLLETAPLPHSWLLHLYIANNATSTGEKQMRKWLQKYF AKEGLCQLVIRSPPHPPEEQEAQQLLLCLQACVTKPVGRAVVPESTHKGHPGLPKVENGE EPRRRQAKGRWTHLPFSQQLNLSFRLDIRLKGDFSKAFSRQ >gi568815597r:235012067_235228715|GENSCAN_predicted_CDS_2|846_bp atgaggtctgctaggggcctgacctataagaccacctggaggggcagaaaccctagcagc aaaaagctcgaggtaaacccaagggaccaaagcaaagagtctcaaatgatcaccaataag agactccaggaagtgcagacccacatgggagccccggcaggagagagggcagctttcaac cacagccagacccaaaggatgcacagcaagctgcccatgcacctactcggcatccaggac aaggccctggcagtaaagcccagctccacacggctcctgcagcatggagctggttctcct ccaaaagtcctgagagatgtatcatttatcaagctgatgacgctggatggtcagaggccg ctgaccctccagtccctgtcccctagggaccccggccagcgtcagtggccggtcagctcc tgccaacgcctcctggaaactgcaccgttgccccactcgtggcttctgcatctatacatc gccaataacgctacgagcactggggaaaagcaaatgagaaagtggctgcagaagtatttt gccaaggaaggactgtgccaactagtcatccggtcaccgccgcaccccccggaggagcag gaggcacagcagcttctcctgtgtctccaagcctgtgtcacaaagcccgttggccgtgcc gtggtcccagaatctacccacaagggtcacccagggctgcccaaggtggagaatggagaa gaacccaggcgaaggcaggccaagggccgatggacacacctcccctttagtcaacagctc aatctaagcttcagactggacattcgactcaaaggtgacttcagcaaggcctttagcaga cagtga >gi568815597r:235012067_235228715|GENSCAN_predicted_peptide_3|314_aa MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDL KDAEAVQKFFLEEIQLGEELLAQGEYEKGVDHLTNAIAVCGQPQQLLQVLQQTLPPPVFQ MLLTKLPTISQLSLELDELQISSPGRQLIQNFDDLEFSLLSQRRNNSELVSFSERLEVCD ERQVQETVKPALQTSALEPMVLFPSANDNYSDLLIKTRKSISDQFSKEAGPLMEPKSIQK RIKNFIWHQLDQFLVHYIKKLEVLLMGSESDCAGTARRASSKNQEPQRCFGKSSPAAVRV VFLFHTDVAIGSEG >gi568815597r:235012067_235228715|GENSCAN_predicted_CDS_3|945_bp atggtgggtcggaacagcgccatcgccgccggtgtatgcggggcccttttcattgggtac tgcatctacttcgaccgcaaaagacgaagtgaccccaacttcaagaacaggcttcgagaa cgaagaaagaaacagaagcttgccaaggagagagctgggctttccaagttacctgacctt aaagatgctgaagctgttcagaagttcttccttgaagaaatacagcttggtgaagagtta ctagctcaaggtgaatatgagaagggcgtagaccatctgacaaatgcaattgctgtgtgt ggacagccacagcagttactgcaggtcttacagcaaactcttccaccaccagtgttccag atgcttctgactaagctcccaacaattagtcagctttcccttgagctagatgaactacag atcagcagtccaggtaggcagcttattcagaactttgacgacttggaatttagtctactt tcacagcggagaaacaatagtgaattagtttctttctcagagagacttgaggtatgtgat gaaagacaagtccaagaaacagtgaagccagctttgcagacatcagccctagaaccaatg gtattatttcccagtgcaaatgataattactccgacctactcatcaagacaagaaaaagc atctctgatcaattttcaaaagaggctggacccctcatggagcccaagagcatccaaaag aggatcaaaaacttcatctggcaccagttagaccagttcctggtccactacatcaagaaa ctggaagtgctgctgatgggcagtgaatctgactgtgcagggaccgctcgccgtgcctcc tccaagaaccaagaaccgcaacgctgtttcgggaagagcagcccagctgccgtcagagtg gtgttcttgtttcacacggatgtggcaattggctcagagggttga >gi568815597r:235012067_235228715|GENSCAN_predicted_peptide_4|1528_aa XTPINKRPVLGYRNLNLFKLFRLVHKLGGFDNIESGAVWKQVYQDLGIPVLNSAAGYNVK CAYKKYLYGFEEYCRSANIEFQMALPEKVVNKQCKECENVKEIKVKEENETEIKEIKMEE ERNIIPREEKPIEDEIERKENIKPSLGSKKNLLESIPTHSDQEKEVNIKKPEDNENLDDK DDDTTRVDESLNIKVEAEEEKAKSGDETNKEEDEDDEEAEEEEEEEEEEEDEDDDDNNEE EEFECYPPGMKVQVRYGRGKNQKMYEASIKDSDVEGGEVLYLVHYCGWNHLPAYKQNPLH GDESSAFLAFTGYKPVVCTSRVTAQEFATDTFQAYARNKLDKEKDKDEKYSPKNCKLRRL SKPPFQTNPSPEMVSKLDLTDAKNSDTAHIKSIEITSILNGLQASESSAEDSEQEDERGA QDMDNNGKEESKIDHLTNNRNDLISKEEQNSSSLLEENKVHADLVISKPVSKSPERLRKD IEVLSEDTDYEEDEVTKKRKDVKKDTTDKSSKPQIKRGKRRYCNTEECLKTGSPGKKEEK AKNKESLCMENSSNSSSDEDEEETKAKMTPTKKYNGLEEKRKSLRTTGFYSGFSEVAEKR IKLLNNSDERLQNSRAKDRKDVWSSIQGQWPKKTLKELFSDSDTEAAASPPHPAPEEGVA EESLQTVAEEESCSPSVELEKPPPVNVDSKPIEEKTVEVNDRKAEFPSSGSNSVLNTPPT TPESPSSVTVTEGSRQQSSVTVSEPLAPNQEEVRSIKSETDSTIEVDSVAGELQDLQSEG NSSPAGFDASVSSSSSNQPEPEHPEKANSSDSEELSAGESITKSQPVKSVSTGMKSHSTK SPARTQSPGKCGKNGDKDPDLKEPSNRLPKVYKWSFQMSDLENMTSAERITILQEKLQEI RKHYLSLKSEVASIDRRRKRLKKKERESAATSSSSSSPSSSSITAAVMLTLAEPSMSSAS QNGMSVECRENPDDGVRGSPPEDYRLGQVASSLFRGEHHSRGGTGRLASLFSSLEPQIQP VYVPVPKQTIKKTKRNEEEESTSQIERPLSQEPAKKVKAKKKHTNAEKKLADRESALASA DLEEEIHQKQGQKRKNSQPGVKVADRKILDDTEDTVVSQRKKIQINQEEERLKNERTVFV GNLPVTCNKKKLKSFFKEYGQIESVRFRSLQLSLLLHQENVSGIGGFLVSLTSRMKPRTL AVSVTALKVVRRESVPSDVRICSEFLPSALWWSMGLGVVEQGVALLGEARAAQEPMEGVG GSGIAGCRSRTLPCGKAAKARACRAGRLLRVRGPPSPRPPGTPAGPQAPRSPGSHSRLSL HTSLQAEGAGSGLGQPRNGLPQRSGGLKGSSSAVKVGAQAEEVRRASEGSEDCQHAVTSQ EESSCQFTRNIEDKEVEESAIEKHFLDCGSIMAVRIVRDKMTGIGKGFGYVLFENTDSVH LALKLNNSELMGRKLRVMRSVNKEKFKQQNSNPRLKNVSKPKQGLNFTSKTAEGHPKSLF IGEKAVLLKTKKKGQKKSGRPKKQRKQK >gi568815597r:235012067_235228715|GENSCAN_predicted_CDS_4|4587_bp ngtacacctattaacaaacgacctgtacttggatatcgaaatttgaatctctttaagtta ttcagacttgtacacaaacttggaggatttgataatattgaaagtggagctgtttggaaa caagtctaccaagatcttggaatccctgtcttaaattcagctgcaggatacaatgttaaa tgtgcttataaaaaatacttatatggttttgaggagtactgtagatcagccaacattgaa tttcagatggcattgccagagaaagttgttaacaagcaatgtaaggagtgtgaaaatgta aaagaaataaaagttaaggaggaaaatgaaacagagatcaaagaaataaagatggaggag gagaggaatataataccaagagaagaaaagcctattgaggatgaaattgaaagaaaagaa aatattaagccctctctgggaagtaaaaagaatttattagaatctatacctacacattct gatcaggaaaaagaagttaacattaaaaaaccagaagacaatgaaaatctggatgacaaa gatgatgacacaactagggtagatgaatccctcaacataaaggtagaagctgaggaagaa aaagcaaaatctggagatgaaacgaataaagaagaagatgaagatgatgaagaagcagaa gaggaggaggaggaggaagaagaagaagaggatgaagatgatgatgacaacaatgaggaa gaggagtttgagtgctatccaccaggcatgaaagtccaagtgcggtatggacgagggaaa aatcaaaaaatgtatgaagctagtattaaagattctgatgtcgaaggtggagaggtcctt tacttggtgcattactgcggatggaatcatcttccagcatataaacagaaccccttgcat ggggatgaatcctcagcttttctggcttttactggctacaagccagtagtttgtaccagc agagtaactgcccaggagtttgctacagatactttccaggcctatgctagaaataaatta gacaaagaaaaagacaaagatgaaaaatactctccaaaaaactgtaaacttcggcgcttg tccaaaccaccatttcagacaaatccatctcctgaaatggtatccaaactggatctcact gatgccaaaaactctgatactgctcatattaagtccatagaaattacttcgatccttaat ggacttcaagcttctgaaagttctgctgaagacagtgagcaggaagatgagagaggtgct caagacatggataataatggcaaagaggaatctaagattgatcatttgaccaacaacaga aatgatcttatttcaaaggaggaacagaacagttcatctttgctagaagaaaacaaagtt catgcagatttggtaatatccaaaccagtgtcaaaatctccagaaagattaaggaaagat atagaagtattatccgaagatactgattatgaagaagatgaagtcacaaaaaagagaaag gatgtcaagaaggacacaacagataaatcttcaaaaccacaaataaaacgtggtaaaaga aggtattgcaatacagaagagtgtctaaaaactggatcacctggcaaaaaggaagagaag gccaagaacaaagaatcactttgcatggaaaacagtagcaacagctcttcagatgaagat gaagaagaaacaaaagcaaagatgacaccaactaagaaatacaatggtttggaggaaaaa agaaaatctctacggacaactggtttctattcaggattttcagaagtggcagaaaaaagg attaaacttttaaataactctgatgaaagacttcaaaacagcagggccaaagatcgaaaa gatgtctggtcaagtattcagggacagtggcctaaaaaaacgctgaaagagcttttttca gactctgatactgaggctgcagcttccccaccgcatcctgccccagaggagggggtggca gaggagtcactgcagactgtggctgaagaggagagttgttcacccagtgtagaactagaa aaaccacctccagtcaatgtcgatagtaaacccattgaagaaaaaacagtagaggtcaat gacagaaaagcagaatttccaagtagtggcagtaattcagtgctaaatacccctcctact acacctgaatcgccttcatcagtcactgtaacagaaggcagccggcagcagtcttctgta acagtatcagaaccactggctccaaaccaagaagaggttcgaagtatcaagagtgaaact gatagcacaattgaggtggatagtgttgctggggagctccaagacctccagtctgaaggg aatagctcgccagcaggttttgatgccagtgtgagctcaagcagtagtaatcagccagaa ccagaacatcctgaaaaagcaaatagtagtgatagtgaagaactttcagctggtgaaagt ataactaagagtcagccagtcaaatcagtttccactggaatgaagtctcatagtaccaaa tctcccgcaaggacgcagtctccaggaaaatgtggaaagaatggtgataaggatcctgat ctcaaggaacccagtaatcgattacccaaagtttacaaatggagttttcagatgtcggac ctggaaaatatgacaagtgccgaacgcatcacaattcttcaagaaaaacttcaagaaatc agaaaacattatctgtcattaaaatctgaagtagcttccattgatcggaggagaaagcgt ttaaagaagaaagagagagaaagtgctgctacatcctcatcctcctcttcaccttcatcc agttccataacagctgctgttatgttaactttagctgaaccgtcaatgtccagcgcatca caaaatggaatgtcagttgagtgcagagagaatcctgacgacggcgttcgcgggagtccg ccggaagactacaggcttggacaggtcgccagtagcttatttcgcggcgaacaccattcc agaggtggcaccggtcggctggcgtccctcttcagttctctggagccccagattcaaccc gtgtacgtgcctgtgcctaaacaaaccatcaaaaaaacgaaacggaatgaggaggaagaa agtacatcccagattgaaagaccactttcgcaagaacctgccaaaaaagtgaaagcgaag aagaaacacactaacgcagaaaaaaagttggcagacagggaaagcgctctagcgagtgct gatttagaagaagaaattcaccagaaacaagggcagaaaaggaaaaattctcaacctggt gttaaagtagcagatagaaaaatacttgatgacacagaagacacagttgtcagtcaaaga aagaaaattcaaatcaaccaagaagaagagagattaaagaatgagagaactgtgtttgtt gggaatttgcctgttacatgtaataagaagaagctgaagtcgttttttaaagagtatgga caaatagaatctgtacgatttcgttctctgcagcttagccttctcctccatcaagagaat gtgtccggaattggtgggttcttggtctcactgacttcaagaatgaagccgcggaccctc gcggtgagtgttacagctcttaaggtggtgcgtcgggagtctgttccttctgatgttcgg atttgttcggagtttcttccttctgccctttggtggtcgatgggactgggcgtcgtggag cagggggtggcgctcctcggggaggctcgggccgcacaggagcccatggagggggtggga ggctcaggcatagcgggctgcaggtcccgaaccctgccctgcgggaaggcagctaaggcc cgggcctgcagggccggccggctgctccgagtgcggggcccgccaagcccacgcccacct ggaactccagctggcccgcaagcgccgcgcagccctggttcccactcgcgcctctccctc cacacctccctgcaagctgagggagccggctccggccttggccagcccagaaacgggctc ccacagcgcagcggtgggctgaagggctcctcaagtgccgtcaaagtgggagcccaggca gaggaggtgcggagagcaagcgagggctctgaggactgccagcacgctgtcacctctcaa gaagaatccagctgccagtttacaagaaatatagaggacaaagaagttgaagaatctgcc attgagaagcactttctggactgtggaagtatcatggccgtgaggattgtgagagacaaa atgacaggcatcggcaaagggtttggctatgtgctctttgagaatacagattctgttcat cttgctctgaaattaaataattctgaactcatggggagaaaactcagagtcatgcgttct gttaataaagaaaaatttaaacaacaaaattcaaatccacgattgaagaatgtcagtaaa cctaagcagggacttaattttacttccaaaactgcagaaggacatcctaaaagcttattt attggagaaaaagctgttctccttaaaacgaagaagaaaggacagaagaaaagtggacgc cctaagaaacagagaaaacagaaataa