GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:38:19 Sequence gi568815585r:31037321_31261582 : 224262 bp : 43.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2635 2689 55 0 1 68 54 51 0.264 1.15 1.02 Intr + 4595 4708 114 0 0 54 76 79 0.371 3.72 1.03 Term + 6743 6852 110 0 2 33 50 128 0.518 2.17 1.04 PlyA + 7414 7419 6 1.05 2.32 PlyA - 7633 7628 6 1.05 2.31 Term - 8696 8591 106 1 1 71 44 110 0.398 2.78 2.30 Intr - 9593 9449 145 0 1 47 54 84 0.111 0.24 2.29 Intr - 15027 14891 137 2 2 92 105 -4 0.332 1.91 2.28 Intr - 16010 15906 105 1 0 20 105 81 0.306 2.33 2.27 Intr - 32745 32642 104 0 2 101 96 70 0.907 8.07 2.26 Intr - 41733 41656 78 0 0 72 47 68 0.047 0.85 2.25 Intr - 48995 48882 114 2 0 61 38 94 0.022 2.34 2.24 Intr - 74860 74810 51 1 0 65 91 36 0.053 0.70 2.23 Intr - 76431 76358 74 1 2 68 111 17 0.049 1.13 2.22 Intr - 81836 81756 81 0 0 110 95 -17 0.061 0.81 2.21 Intr - 87048 86986 63 1 0 102 90 40 0.073 4.29 2.20 Intr - 100204 100048 157 1 1 108 23 137 0.006 8.78 2.19 Intr - 101248 101087 162 1 0 60 113 156 0.828 15.47 2.18 Intr - 101580 101461 120 0 0 65 113 131 0.999 13.99 2.17 Intr - 101787 101680 108 0 0 44 78 86 0.911 3.68 2.16 Intr - 102989 102864 126 2 0 76 91 93 0.988 9.28 2.15 Intr - 103939 103802 138 1 0 80 107 66 0.995 8.36 2.14 Intr - 106603 106472 132 1 0 68 87 56 0.902 4.34 2.13 Intr - 108448 108243 206 2 2 104 95 174 0.998 18.62 2.12 Intr - 110772 110639 134 2 2 127 82 37 0.999 7.29 2.11 Intr - 111160 111054 107 1 2 95 75 36 0.995 2.01 2.10 Intr - 112862 112634 229 1 1 25 80 150 0.977 5.77 2.09 Intr - 113871 113627 245 0 2 43 89 216 0.690 13.50 2.08 Intr - 114422 114289 134 0 2 13 98 74 0.684 1.26 2.07 Intr - 115631 115532 100 2 1 106 95 38 0.993 5.98 2.06 Intr - 117435 117313 123 0 0 81 89 101 0.994 10.28 2.05 Intr - 118334 118194 141 2 0 65 85 69 0.954 4.85 2.04 Intr - 121543 121486 58 0 1 95 91 2 0.972 0.09 2.03 Intr - 123807 123610 198 2 0 88 55 65 0.753 1.67 2.02 Intr - 124478 124156 323 2 2 61 105 252 0.788 18.86 2.01 Init - 124802 124662 141 2 0 46 117 109 0.780 9.74 2.00 Prom - 129001 128962 40 -5.06 3.00 Prom + 131811 131850 40 -4.56 3.01 Init + 162765 162834 70 2 1 86 106 211 0.999 21.91 3.02 Term + 163400 163614 215 0 2 93 38 118 0.984 4.69 3.03 PlyA + 164558 164563 6 1.05 4.06 PlyA - 165735 165730 6 1.05 4.05 Term - 166084 165936 149 2 2 42 43 165 0.523 5.46 4.04 Intr - 170673 170649 25 0 1 105 78 22 0.118 0.50 4.03 Intr - 176418 176326 93 0 0 125 73 -20 0.130 0.36 4.02 Intr - 186664 186558 107 2 2 81 83 35 0.049 2.23 4.01 Init - 190830 190578 253 0 1 53 45 164 0.113 6.17 4.00 Prom - 203633 203594 40 -3.46 5.00 Prom + 208466 208505 40 -6.06 5.01 Sngl + 214211 214591 381 1 0 58 38 327 0.647 20.87 5.02 PlyA + 214819 214824 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 23658 23824 167 1 2 128 44 87 0.867 6.38 S.002 Term - 100204 99998 207 1 0 108 37 190 0.993 13.24 S.003 Init - 145849 145687 163 1 1 97 81 108 0.841 9.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:31037321_31261582|GENSCAN_predicted_peptide_1|92_aa MAFKITKNYEQSSVEEKNEFGTGSTSCIHSAPSLAFDKRSNCTITNHFVIFIDISEGGIH TAYGSDGAPGCVLFGFQAWDSPKWTPAQGQDE >gi568815585r:31037321_31261582|GENSCAN_predicted_CDS_1|279_bp atggcttttaaaatcaccaagaattatgaacagagtagtgttgaggagaagaatgagttt ggcacgggctccacctcttgcatccattcagccccttcactggcttttgacaagaggtcc aactgcaccatcacaaatcattttgtgattttcatagacatcagtgaagggggcatccac acagcctacggctcggatggtgcccctggatgtgtgctgttcggcttccaggcctgggac tcccccaaatggaccccggcacagggacaggatgagtag >gi568815585r:31037321_31261582|GENSCAN_predicted_peptide_2|1379_aa MARERPCRCHGNGGRSPGTGCPLGRIFPEGSRRRKRKWHVEGPVEAPGFLSASRRCPRGS RRLLTGRGCLCVLLSVRGTARPRGPEQNAARAESGGRRSRQGAGGRRPRPEAEADREPAM SVVGLDVGSQSCYIAVARAGGIETIANEFSDRCTPLQPAPSAAAPVSPPGLSELAEPPPA CPPPFAHIFAHRGAGLPAFFFRIVNLEMNWKLPELVSLLVQSVISFGSKNRTIGVAAKNQ QITHANNTVSNFKRFHGRAFNDPFIQKEKENLSYDLVPLKNGGVGIKVMYMGEEHLFSVE QITAMLLTKLKETAENSLKKPVTDCVISVPSFFTDAERRSVLDAAQIVGLNCLRLMNDMT AVALNYGIYKQDLPSLDEKPRIVVFVDMGHSAFQVSACAFNKGKLKVLGTAFDPFLGGKN FDEKLVEHFCAEFKTKYKLDAKSKIRALLRLYQECEKLKKLMSSNSTDLPLNIECFMNDK DVSGKMNRSQFEELCAELLQKIEVPLYSLLEQTHLKVEDVSAVEIVGGATRIPAVKERIA KFFGKDISTTLNADEAVARGCALQCAILSPAFKVREFSVTDAVPFPISLIWNHDSEDTEG VHEVFSRNHAAPFSKVLTFLRRGPFELEAFYSDPQGVPYPEAKIGRFVVQNVSAQKDGEK SRVKVKVRVNTHGIFTISTASMVEKVPTEENEMSSEADMECLNQRPPENPDTDKNVQQDN SEAGTQPQVQTDAQQTSQSPPSPELTSEENKIPDADKANEKKVDQPPEAKKPKIKVVNVE LPIEANLVWQLGKDLLNMYIETEGKMIMQDKLEKERNDAKNAVEEYVYEFRDKLCGPYEK FICEQDHQNFLRLLTETEDWLYEEGEDQAKQAYVDKLEELMKIGTPVKVRFQEAEERPKM FEELGQRLQHYAKIAADFRNKDEKYNHIDESEMKKVEKSVNEVMEWMNNVMNAQAKKSLD QDPVVRAQEIKTKIKELNNTCEPVVTQPKPKIESPKLERTPNGPNIDKKEEDLEDKNNFG AEPPHQNGQTNWKLHLEVSGTLLGMIYRAKLPRESVDFISRESIAQRIAAADTVLGRGQI GMCIQQGERYWCGKLLVANLTILLDVIDMHKALLVSQLALVVKLKLEDEFRAVGIYVPYH LEGEKLSAGGAEEAKSLLLLRKVGPLITITDEEMGAVELGQWVGLAGQQAGPSVFGLFRD SCLLLLLLLLFLLLLRPTATTDDLLMPLLKSVATGQQLDKSVSSLPRRNQRKTPLVSKKH TGSHFFTASSCRICGILQVLVCLLSPVLRTLCTSPPDRDIEGQVIATAGVAAAAAAALKA LYKTCSVLPVRQLGHRSAGEGAAAWASHDSSPSTRSSSFRLLLTHKEHFNLMNHQQHFL >gi568815585r:31037321_31261582|GENSCAN_predicted_CDS_2|4140_bp atggcacgggagcggccctgtcgctgccatggcaacggcggccgttctccggggaccggc tgcccattgggtagaatctttccagaaggctcgagaagaaggaagcggaagtggcacgtg gaggggccggtggaggcgccgggtttcttatcagccagccgccgctgtccccgggggagt aggaggctcctgacaggccgcggctgtctgtgtgtccttctgagtgtcagaggaacggcc agaccccgcgggccggagcagaacgcggccagggcagaaagcggcggcaggagaagcagg cagggggccggaggacgcagaccgagacccgaggcggaggcggaccgcgagccggccatg tcggtggtggggttggacgtgggctcgcagagctgctacatcgcggtagcccgggccggg ggcatcgagaccatcgccaatgagttcagcgaccggtgcaccccgctgcagcctgcgcct tccgcggccgcccccgtatccccacccggcctctctgagctggctgagccgcctccagct tgtcctcctcccttcgcgcatatctttgcgcaccgcggagcggggctgccggcctttttc ttccgcattgtgaatctcgaaatgaactggaagcttccagagcttgtgtcgctgcttgtc cagtcagtcatatcatttggatcaaaaaatagaacaatcggagttgcagccaaaaatcag caaatcactcatgcaaacaatacggtgtctaacttcaaaagatttcatggccgagcattc aatgaccccttcattcaaaaggagaaggaaaacttgagttacgatttggttccattgaaa aatggtggagttggaataaaggtaatgtacatgggtgaagaacatctatttagtgtggag cagataacagccatgttgttgactaagctgaaggaaactgctgaaaacagcctcaagaaa ccagtaacagattgtgttatttcagtcccctccttctttacagatgctgagaggcgatct gtgttagatgctgcacagattgttggcctaaactgtttaagacttatgaatgacatgaca gctgttgctttgaattacggaatttataagcaggatctcccaagcctggatgagaaacct cggatagtggtttttgttgatatgggacattcagcttttcaagtgtctgcttgtgctttt aacaagggaaaattgaaggtactgggaacagcttttgatcctttcttaggaggaaaaaac ttcgatgaaaagttagtggaacatttttgtgcagaatttaaaactaagtacaagttggat gcaaaatccaaaatacgagcactcctacgtctgtatcaggaatgtgaaaaactgaaaaag ctaatgagctctaacagcacagaccttccactgaatatcgaatgctttatgaatgataaa gatgtttccggaaagatgaacaggtcacaatttgaagaactctgtgctgaacttctgcaa aagatagaagtacccctttattcactgttggaacaaactcatctcaaagtagaagatgtg agtgcagttgagattgttggaggcgctacacgaattccagctgtgaaggaaagaattgcc aaattctttggaaaagatattagcacaacactcaatgcagatgaagcagtagccagagga tgtgcattacagtgtgcaatactttccccggcatttaaagttagagaattttccgtcaca gatgcagttccttttccaatatctctgatctggaaccatgattcagaagatactgaaggt gttcatgaagtctttagtcgaaaccatgctgctcctttctccaaagttctcacctttctg agaagggggccttttgagctagaagctttctattctgatccccaaggagttccatatcca gaagcaaaaataggccgctttgtagttcagaatgtttctgcacagaaagatggagaaaaa tctagagtaaaagtcaaagtgcgagtcaacacccatggcattttcaccatctctacggca tctatggtggagaaagtcccaactgaggagaatgaaatgtcttctgaagctgacatggag tgtctgaatcagagaccaccagaaaacccagacactgataaaaatgtccagcaagacaac agtgaagctggaacacagccccaggtacaaactgatgctcaacaaacctcacagtctccc ccttcacctgaacttacctcagaagaaaacaaaatcccagatgctgacaaagcaaatgaa aaaaaagttgaccagcctccagaagctaaaaagcccaaaataaaggtggtgaatgttgag ctgcctattgaagccaacttggtctggcagttagggaaagaccttcttaacatgtatatt gagacagagggtaagatgataatgcaagataaattggaaaaagaaaggaatgatgctaaa aatgcagttgaggaatatgtgtatgagttcagagacaagctgtgtggaccatatgaaaaa tttatatgtgagcaggatcatcaaaattttttgagactcctcacagaaactgaagactgg ctgtatgaagaaggagaggaccaagctaaacaagcatatgttgacaagttggaagaatta atgaaaattggcactccagttaaagttcggtttcaggaagctgaagaacggccaaaaatg tttgaagaactaggacagaggctgcagcattatgccaagatagcagctgacttcagaaat aaggatgagaaatacaaccatattgatgagtctgaaatgaaaaaagtggagaagtctgtt aatgaagtgatggaatggatgaataatgtcatgaatgctcaggctaaaaagagtcttgat caggatccagttgtacgtgctcaggaaattaaaacaaaaatcaaggaattgaacaacaca tgtgaacccgttgtaacacaaccgaaaccaaaaattgaatcacccaaactggaaagaact ccaaatggcccaaatattgataaaaaggaagaagatttagaagacaaaaacaattttggt gctgaacctccacatcagaatggtcagaccaattggaagctgcacctggaggtatctgga accctgcttggcatgatttatagagccaagctgcccagggaaagtgttgattttatttct agggaatccattgctcaaaggatcgctgcagcagacactgtcttaggacggggtcagatc gggatgtgcatccagcagggagagaggtactggtgtggcaaattattggttgctaatttg accatactcctggatgtcatagacatgcacaaagccttgctggttagccagctggcgcta gtagtgaaactgaaattggaagatgagttcagggctgtgggcatctatgttccctaccat ttggagggagaaaagctgtctgcaggaggagctgaggaggccaagtctttattactgctc cggaaggtgggtcctcttatcaccattacagatgaggaaatgggggctgtagaacttggt cagtgggtgggcctggccgggcagcaggcaggaccatctgtgtttgggctcttcagggat tcctgtcttctgttgctgctgctgctgctgtttctgcttttgctcaggcccacagccacc acagatgacctgctgatgcctctgctgaaatccgtggccactggccaacagcttgacaag agtgtcagctccctcccgagacgaaatcagaggaaaacgcctctagtttctaagaagcac acagggagccacttctttaccgctagctcctgcaggatctgtgggatcctgcaggtgctg gtttgcctcctgagccctgtcctaaggacactctgcacatcgcctcctgacagagacatt gaaggccaggttatagccactgcaggagtggcagcagcagcagcagcagctttgaaagcc ttatataaaacatgctcggtattgcccgtgaggcagcttggccacagatcagctggagag ggtgcggcggcatgggccagccacgattcttcgccctccacacggagcagcagcttccgt ttacttctgactcacaaggagcactttaacctgatgaatcaccagcagcacttcctgtga >gi568815585r:31037321_31261582|GENSCAN_predicted_peptide_3|94_aa MRPPACWWLLAPPALLALLTCSLGPSGRALAPEDKRPARRQPFPDSDPEEDRGARWASAG AGAAGRLAPVSGLRWGHQGTRLRAEQCLPGLLLL >gi568815585r:31037321_31261582|GENSCAN_predicted_CDS_3|285_bp atgcggccgcccgcctgctggtggctgctcgcgccgccggcgctgctcgcgctcctcacc tgctccctggggccttcgggacgtgccctggcccctgaggataagcggccagctcggcgc cagcccttcccggattccgacccggaggaggacaggggcgcccgctgggcctccgcggga gctggcgcggcggggaggcttgctcctgtctcgggtctccgctggggacatcaggggaca cgtctgcgagcagagcagtgcttgcccggtctcctgctgctgtaa >gi568815585r:31037321_31261582|GENSCAN_predicted_peptide_4|208_aa MLAVFTSLLEMHLKAASFLQRRTGRKPEYVLPTKTIRGLANGSSKAASQRNTLGLPRFWA YNKEMDKTVIVSAPCHQTGMSAEEELAFQVSMENSGEEARRAYCSLLVSWNPGLSIMKLE STYHQLTYIFVYHLSYMSADYKLHKGRNTVIREEMEQELTVGYWQTPLGQVHDLGSHANS EVLDYLDDKESIQKIKICASNICILLKK >gi568815585r:31037321_31261582|GENSCAN_predicted_CDS_4|627_bp atgctggcagtattcaccagcctgctagagatgcacctgaaggcagccagctttttacaa cggcgaacaggaaggaaacccgagtatgtcctgcccacaaagaccatcagagggcttgcc aatggcagtagcaaggctgcctcccagagaaacacgttaggcctccccaggttctgggcc tataacaaagaaatggataaaacagtgattgtgtcagcaccctgccaccagacaggaatg tcggcagaggaagagctcgccttccaggtctccatggaaaatagtggagaggaagcaaga agagcatattgttcccttcttgtttcctggaacccaggcttatccatcatgaaactggag agcacttatcaccaactgacatatatatttgtttatcacttatcttacatgtcagctgat tataagctccacaagggcaggaatacagttataagggaagaaatggaacaagaattaact gttggctactggcaaactcccctgggtcaagtccatgacctggggagccatgccaactcg gaggtcctggactacctggatgacaaagaatccatccagaagattaaaatctgtgccagt aatatctgcatactgcttaagaaataa >gi568815585r:31037321_31261582|GENSCAN_predicted_peptide_5|126_aa MELKNTAQALCEAYPSTNSRIDQVEERISEIEDQLNEIKQEDKIRERRVKRNEQSLQEIW DCVKRPNLHLIAVPESDGENGTKLENTLQDIIRENFPNVAKQVNIQTQKYGEHYKDTPRE EQPQDT >gi568815585r:31037321_31261582|GENSCAN_predicted_CDS_5|381_bp atggagctgaaaaacacagcacaagcactttgtgaagcatacccaagtaccaatagccga atcgatcaagtggaagaaaggatatcagagattgaagatcaactcaatgaaataaagcaa gaagacaagattagagaaagaagagtgaaaagaaatgaacaaagcctccaagaaatatgg gactgtgtgaaaagaccaaatctacatttgattgctgtacctgaaagtgatggcgagaat ggaaccaagttggaaaacactcttcaggatattatccgggagaacttccccaacgtagca aagcaggtcaacattcaaactcagaaatatggagaacactacaaagatactcctcgagaa gagcaaccccaagacacatag