GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:05:36 Sequence gi568815597r:77465643_77733354 : 267712 bp : 39.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1498 1493 6 1.05 1.05 Term - 6930 6820 111 0 0 157 48 29 0.841 3.38 1.04 Intr - 8926 8761 166 0 1 78 50 110 0.749 5.24 1.03 Intr - 23063 22938 126 1 0 -4 113 126 0.038 4.77 1.02 Intr - 31280 31149 132 1 0 77 70 74 0.298 3.34 1.01 Init - 31830 31727 104 0 2 60 80 122 0.550 8.36 1.00 Prom - 38868 38829 40 -1.85 2.00 Prom + 47050 47089 40 -3.65 2.01 Init + 48166 48297 132 0 0 62 86 57 0.660 3.09 2.02 Intr + 52321 52414 94 0 1 77 84 38 0.686 0.92 2.03 Intr + 52922 53085 164 0 2 53 63 216 0.832 14.37 2.04 Intr + 56185 56301 117 0 0 88 96 140 0.757 14.54 2.05 Intr + 61787 61912 126 1 0 68 92 52 0.615 3.56 2.06 Intr + 63148 63223 76 0 1 90 107 20 0.599 2.27 2.07 Intr + 66484 66615 132 2 0 39 40 117 0.393 1.70 2.08 Term + 70205 70422 218 0 2 87 44 204 0.947 12.22 2.09 PlyA + 71399 71404 6 1.05 3.06 PlyA - 73306 73301 6 1.05 3.05 Term - 73563 73490 74 1 2 46 44 92 0.480 -2.31 3.04 Intr - 74026 73892 135 2 0 75 75 65 0.387 3.52 3.03 Intr - 76257 76144 114 1 0 78 80 50 0.736 2.70 3.02 Intr - 79160 79004 157 2 1 51 88 60 0.631 0.96 3.01 Init - 79336 79250 87 1 0 80 55 68 0.681 3.39 3.00 Prom - 83611 83572 40 -4.95 4.11 PlyA - 83651 83646 6 1.05 4.10 Term - 100142 99998 145 1 1 115 44 59 0.843 0.60 4.09 Intr - 100539 100439 101 0 2 56 119 68 0.990 4.69 4.08 Intr - 102824 102690 135 2 0 85 115 94 0.995 11.64 4.07 Intr - 110578 110426 153 1 0 56 89 191 0.934 15.35 4.06 Intr - 113986 113885 102 1 0 23 91 109 0.942 4.15 4.05 Intr - 115427 115356 72 2 0 74 90 93 0.991 6.78 4.04 Intr - 116249 116134 116 0 2 71 115 79 0.796 8.15 4.03 Intr - 116484 116337 148 0 1 97 116 23 0.541 4.89 4.02 Intr - 119013 118875 139 2 1 69 69 176 0.669 13.35 4.01 Init - 167712 166208 1505 0 2 35 36 1381 0.006 119.21 4.00 Prom - 176109 176070 40 -5.95 5.19 PlyA - 177888 177883 6 1.05 5.18 Term - 179850 179784 67 2 1 96 38 85 0.032 0.73 5.17 Intr - 218047 217620 428 1 2 63 105 359 0.080 26.76 5.16 Intr - 218356 218219 138 1 0 19 51 201 0.072 9.24 5.15 Intr - 231832 231696 137 2 2 38 -1 104 0.015 -4.13 5.14 Intr - 232289 232221 69 0 0 71 115 39 0.858 3.14 5.13 Intr - 235829 235727 103 2 1 96 107 45 0.930 6.03 5.12 Intr - 246213 246105 109 2 1 73 65 30 0.098 -1.43 5.11 Intr - 250226 250100 127 0 1 103 63 68 0.726 4.62 5.10 Intr - 252405 252225 181 0 1 85 86 50 0.575 3.02 5.09 Intr - 252999 252954 46 2 1 134 64 9 0.548 0.59 5.08 Intr - 255563 255530 34 0 1 99 106 -8 0.559 -1.54 5.07 Intr - 256259 256189 71 1 2 77 116 61 0.901 5.71 5.06 Intr - 256554 256382 173 0 2 2 87 109 0.728 0.02 5.05 Intr - 257801 257689 113 0 2 67 96 51 0.965 2.98 5.04 Intr - 260120 259980 141 0 0 52 93 130 0.979 9.30 5.03 Intr - 263070 262653 418 0 1 15 61 441 0.874 26.77 5.02 Intr - 264296 264218 79 1 1 64 92 54 0.862 2.03 5.01 Intr - 265089 264976 114 2 0 58 91 72 0.689 3.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 167712 166198 1515 0 0 35 36 1387 0.993 123.33 S.002 Init - 218398 218219 180 1 0 65 51 177 0.847 10.93 S.003 Term - 231832 231675 158 2 2 38 28 141 0.866 0.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:77465643_77733354|GENSCAN_predicted_peptide_1|212_aa MKPRTLVVTVTVHEDGVSGVCSFTCSDVSGVSSFRVPIGPFYSVPISPFYRVLIGPFYRV LIGPFYRMPIGAFLQSADCLHVSPMLDASYPQTPSSSVLRLGLALLAPELADSLLWDLVI TLKHFLIITVIEATVFHQPLRSLQSDLDPTSPAFIIPQHECLCSIPKALEGKPEWKVEPT SLFGCRKCQILAFPSSLALWVWVCDQIRAKRM >gi568815597r:77465643_77733354|GENSCAN_predicted_CDS_1|639_bp atgaagccacggaccctcgtggtgactgttacagttcatgaagatggtgtgtccggagtt tgttccttcacatgttcggatgtgtccggagtttcttccttccgagtgccgattggtcca ttttacagcgtgccaattagtccattttacagagtgctgattggtccattttacagagtg ctgattggtccattttacagaatgccgattggtgcatttttacagagtgctgattgcctg catgtttcccccatgctggatgcttcctaccctcaaactccaagttcttcagttttgcga ctcggactggctctccttgctcctgagcttgcagacagcctgttgtgggaccttgtgatc accctgaagcacttcttgatcatcactgtcatagaagcaacagttttccatcaaccactc cgaagcctacagtctgacctggaccctacttctcctgctttcattattcctcaacatgaa tgtctgtgttcaatccccaaggctctagagggtaagcctgagtggaaagtagagcccacc tctctttttggatgcagaaaatgccagatccttgcatttccatcctcccttgcattatgg gtatgggtctgtgaccaaatcagggccaagaggatgtga >gi568815597r:77465643_77733354|GENSCAN_predicted_peptide_2|352_aa MLNPGAISQERIAPVNTKQTGKRDPRGHAEVMQMVSVLLLKEHQEMTLKGSGVKRHPISN LFPSVSESSLSSQPFCGPGSGKGTQCEKLVEKYGFTHLSTGELLREELASESERSKLIRD IMERGDLVPSGIVLELLKEAMVASLGDTRGFLIDGYPREVKQGEEFGRRTGFRMWQRRSL TRTKEQESHQHGDWTVTLNAKIHDSISNNTQLRTPGYSDSGVNRPVFGAWLCNSLAAKEA GSGLGQPRKGLPQCSGGLKGSSSAARVGAKAEEALTSSGGYWRPTVGDLYGLLGRHHDQP PSPKEPEQPACGRHHQDHRQAPRSLLPSVHPRDRLLRDKNTATQGESLHFLL >gi568815597r:77465643_77733354|GENSCAN_predicted_CDS_2|1059_bp atgctgaatccaggagctatctcacaggagaggattgctcctgtcaatacaaagcaaact ggaaaacgtgacccacgtgggcatgccgaggtgatgcagatggtttcagttctacttctc aaagaacaccaggaaatgacactaaaaggttctggagtaaagaggcatccaattagcaac ttattcccaagtgtttctgaaagttctttgtcctcacaacccttctgtggtcctggctct ggcaaaggcacacagtgtgaaaagctggtggaaaaatatggatttacacatctctcaact ggcgagctcctgcgtgaggaactggcatcagaatctgaaagaagcaaattgatcagagac attatggaacgtggagacctggtgccctcaggcatcgttttggagctcctgaaggaggcc atggtggccagcctcggggacaccaggggcttcctgattgacggctatcctcgggaggtg aagcaaggggaagagttcggacgcaggaccggttttagaatgtggcagagaagaagccta actagaactaaggaacaagaaagtcatcaacatggtgactggacagttacactcaatgct aaaattcatgattccatttcaaacaacacacagctgagaacccctggttatagtgactct ggtgtgaacaggcctgtatttggagcctggctttgtaattcattggctgctaaggaagcc ggctccggccttgggcagcccagaaaggggctcccacagtgcagcggcgggctgaagggc tcctcaagcgcggccagagtgggcgccaaggcggaggaggcgctgacatcgagcgggggc tattggagacccacagttggtgatctgtatggactgctcggcagacaccatgaccaaccg ccttctccaaaggagccggagcagcctgcctgtggacgacaccaccaagaccatcgccaa gcgcctagaagcctactaccgagcgtccatccccgtgatcgcctactacgagacaaaaac acagctacacaaggcgagtcacttcactttctcctctga >gi568815597r:77465643_77733354|GENSCAN_predicted_peptide_3|188_aa MDYSHVCGDAGVKKSTVLPVLEKYSTYNHSAFFLLVKKELTVKQPQAGPSGMFQKKALLS KQTTAPCMLLPLKTFQWARYGAKRAKETLSKARPTTSFLSGIWSWESECRGNVQMRKLSA ANNPVPSPASPAVPALVTLATGKPHGYLQTNLSFLGPAHTVSMLGRTNVESSVLNETNNL DVHTVTDG >gi568815597r:77465643_77733354|GENSCAN_predicted_CDS_3|567_bp atggattactcacatgtttgtggtgatgctggtgtaaagaaatccactgtgctgccggtg ttggaaaagtatagcacgtacaatcacagtgcattttttctacttgttaaaaaagagcta actgtcaaacagcctcaggcaggtccttcaggaatgttccagaaaaaggccttgttatca aagcagacgacagctccatgcatgttactgcccctgaagaccttccagtgggcaagatac ggagctaagagagccaaggagaccctgagcaaagcaaggccaaccacatcatttctctca ggaatctggagctgggaatcagagtgccgtggtaatgtgcaaatgcgaaagctgtcagcg gcaaacaatcctgtcccctcccctgcttctcctgcagtccccgctcttgtcaccctagct actggcaaaccccacgggtacctgcagacaaatctcagcttcctgggcccagcacacact gtttctatgctaggccgaaccaacgtggagtcatccgtgctaaatgaaacaaacaacttg gatgtacacactgtaacagatggctga >gi568815597r:77465643_77733354|GENSCAN_predicted_peptide_4|871_aa MAASRSTRVTRSTVGLNGLDESFCGRTLRNRSIAHPEEISSNSQVRSRSPKKRPEPVPIQ KGNNNGRTTDLKQQSTRESWVSPRKRGLSSSEKDNIERQAIENCERRQTEPVSPVLKRIK RCLRSEAPNSSEEDSPIKSDKESVEQRSTVVDNDADFQGTKRACRCLILDDCEKREIKKV NVSEEGPLNSAVVEEITGYLAVNGVDDSDSAVINCDDCQPDGNTKQNSIGSYVLQEKSVA ENGDTDTQTSMFLDSRKEDSYIDHKVPCTDSQVQVKLEDHKIVTACLPVEHVNQLTTEPA TGPFSETQSSLRDSEEEVDVVGDSSASKEQCKENTNNELDTSLESMPASGEPEPSPVLDC VSAQMMSLSEPQEHRYTLRTSPRRAAPTRGSPTKNSSPYRENGQFEENNLSPNETNATVS DNVSQSPTNPGEISQNEKGICCDSQNNGSEGVSKPPSEARLNIGHLPSAKESASQHITEE EDDDPDVYYFESDHVALKHNKDYQRLLQTIAVLEAQRSQAVQDLESLGRHQREALKNPIG FVEKLQKKADIGLPYPQRVVQLPEIVWDQYTHSLGNFEREFKNRKRHTRRVKLVFDKVGL PARPKSPLDPKKDGESLSYSMLPLSDGPEGSSSRPQMIRGRLCDDTKPETFNQLWTVEEQ KKLEQLLIKYPPEEVESRRWQKIADELGNRTAKQSSTSRRQHPLNKHLFKPSTFMTSHEP PVYMDEDDDRSCFHSHMNTAVEDASDDESIPIMYRNLPEYKELLQFKKLKKQKLQQMQAE SGFVQHVGFKCDNCGIEPIQGVRWHCQDCPPEMSLDFCDSCSDCLHETDIHKEDHQLEPI YRSETFLDRDYCVSQGTSYNYLDPNYFPANR >gi568815597r:77465643_77733354|GENSCAN_predicted_CDS_4|2616_bp atggctgcttcccgatctactcgtgttacaagatcaacagtggggttaaacggcttggat gaatctttttgtggtagaactttaaggaatcgtagcattgcgcatcctgaagaaatctct tctaattctcaagtacgatcaagatcaccaaagaagagaccagagcctgtgccaattcag aaaggaaataataatgggagaaccactgatttaaaacagcagagtacccgagaatcatgg gtaagccctaggaaaagaggactttcttcttcagaaaaggataacatagaaaggcaggct atagaaaattgtgagagaaggcaaacagaacctgtttcaccagttttaaaaagaattaag cgttgtcttagatctgaagcaccaaacagttcagaagaagattctcctataaaatcagac aaggagtcagtagaacagaggagtacagtagtggacaatgatgcagattttcaagggact aaacgagcttgtcgatgtcttatactggatgattgtgagaaaagggaaattaaaaaggtg aatgtcagtgaggaagggccacttaattctgcagtagttgaagaaatcacaggctatttg gctgtcaatggtgttgatgacagtgattcagctgttataaactgtgatgactgtcagcct gatgggaacactaaacaaaatagcattggttcctatgtgttacaggaaaaatcagtagct gaaaatggggatacggatacccaaacttcaatgttccttgatagtaggaaggaggacagt tatatagaccataaggtgccttgcacagattcacaagtgcaggtcaagttggaggaccac aaaatagtaactgcctgcttgcctgtggaacatgttaatcagctgactactgagccagct acagggcccttttctgaaactcagtcatctttaagggattctgaggaggaagtagatgtg gtgggagatagcagtgcctcaaaagagcagtgtaaagaaaacaccaataacgaactggac acaagtcttgagagtatgccagcctccggagaacctgaaccatctcctgttctagactgt gtttcagctcaaatgatgtctttatcagaacctcaagaacatcgttatactctgagaacc tcaccacgaagggcagcccctaccagaggtagtcccactaaaaacagttctccttacaga gaaaatggacaatttgaggagaataatcttagtcctaatgaaacaaatgcaactgttagt gataatgtaagtcaatctcctacaaatcctggtgaaatttctcaaaatgaaaaagggata tgttgtgactctcaaaataatggaagtgaaggagtaagtaaaccaccctcagaggcaaga ctcaatattggacatttgccatctgccaaagagagtgccagtcagcacattacagaagag gaagatgatgatcctgatgtttattactttgaatcagatcatgtggcactgaaacacaac aaagattatcagagactattacagacgattgctgtactcgaggctcagcgttctcaagca gtccaagaccttgaaagtttaggcaggcaccagagagaagcactgaaaaatcccattgga tttgtggaaaaactccagaagaaggctgatattgggcttccatatccacagagagttgtt caattgcctgagatcgtatgggaccaatatacccatagccttgggaattttgaaagagaa tttaaaaatcgtaaaagacatactagaagagttaagctagtttttgataaagtaggttta cctgctagaccaaaaagtcctttagatcctaagaaggatggagagtccctttcatattct atgttgcctttgagtgatggtccagaaggctcaagcagtcgtcctcagatgataagagga cgcttgtgtgatgataccaaacctgaaacatttaaccagttgtggactgttgaagaacag aaaaagctggaacagctactcatcaaataccctcctgaagaagtagaatctcgacgctgg cagaagatagcagatgaattgggcaacaggacagcaaaacagtcttcaacaagcagacga cagcaccctcttaataagcatctctttaagccttccactttcatgacttcacatgaaccg ccagtgtatatggatgaagatgatgaccgatcttgttttcatagccacatgaacactgct gttgaagatgcatcagatgacgaaagtattcctatcatgtataggaatttacctgaatat aaagaactattacagtttaaaaagttaaagaagcagaaacttcagcaaatgcaagctgaa agtggatttgtgcaacatgtgggctttaagtgtgataactgtggcatagaacccatccag ggtgttcggtggcattgccaggattgtcctccagaaatgtctttggatttctgtgattct tgttcagactgtctacatgaaacagatattcacaaggaagatcaccaattagaacctatt tataggtcagagacattcttagacagagactactgtgtgtctcagggcaccagttacaat taccttgacccaaactactttccagcaaacagatga >gi568815597r:77465643_77733354|GENSCAN_predicted_peptide_5|849_aa XPPLTQFFLDCGGLARTDKKPAICKSYLKLMTELWHKSRPGSVVPTTLFQGIKTVNPTFR GYSQQDAQEFLRCLMDLLHEELKEQVMEVEEDPQTITTEETMEEDKSQSDVDFQSCESCS NSDRAENENGSRCFSEDNNETTMLIQDDENNSEMSKDWQKEKMCNKINKVNSEGEFDKDR DSISETVDLNNQETVKVQIHSRASEYITDVHSNDLSTPQILPSNEGVNPRLSASPPKSGN LWPGLAPPHKKAQSASPKRKKQHKKYRSVISDIFDGTIISSVQCLTCDRVSVTLETFQDL SLPIPGKEDLAKLHSSSHPTSIVKAGSCGEAYAPQGWIAFFMEYVKSWFWGPVVTLQDCL AAFFARDELKGDNMYSCEKCKKLRNGVKFCKVQNFPEILCIHLKRFRHELMFSTKISTHV SFPLEGLDLQPFLAKDSPAQIVTYDLLSVICHHGTASSGHYIAYCRNNLNNLWYEFDDQS VTEVSESTVQNAEAYVLFYRYGGGPAVNHLYICHTCQIEAEKIEKRRKTELEIFIRLNRA FQKEDSPATFYCISMQWFREWESFVKGKDGDPPGPIDNTKIAVTKCGNVMLRQGADSGQI SEETWNFLQSIYGGGPEVILRPPVVHVDPDILQAEEKIEPSEKGPEICIEHRGVKKNLEE MFCQRNRQAQRPEVVRACVLWYKPEGQVGRLQGMDCNPPSEDQTSPRGRDWPGRASAAPD PRALHGPRLPAGGVVSAPAAAATAAAAVAAAAAARQSQRGERRGPEGAAGAAGSGRSGRR GPTWKRRGAAIELPAAVATRASAVAGAESGHGGGASPSGPLRPTPRERRPGHTLLPGDHH IDAELNADT >gi568815597r:77465643_77733354|GENSCAN_predicted_CDS_5|2550_bp nncccacctttgacacagttttttcttgattgtggaggactagctcgaacagataagaaa cctgccatttgtaaaagttatctcaaactaatgacagagctgtggcataaaagcaggcca ggatctgttgtgcctactactctgtttcaaggaattaaaactgtaaatccaacatttcgg gggtattctcagcaggatgctcaagaattccttcgatgtttaatggatttgcttcatgaa gaattgaaagagcaagtcatggaagtagaagaagatccgcaaaccataaccactgaggag acaatggaagaagacaagagccagtcggatgtagattttcagtcttgtgaatcttgtagc aacagtgatagagcagaaaatgaaaatggctctagatgcttttctgaagataataatgaa acaacaatgttaattcaggatgatgaaaacaattcagaaatgtcaaaggattggcaaaaa gagaagatgtgcaataagattaataaagtaaattctgaaggcgaatttgataaagataga gactctatatctgaaacagtcgacttaaacaaccaggaaactgtcaaagtgcaaatacac agcagagcttcagaatatatcactgatgtccattcgaatgacctgtctacaccacagatc cttccatcaaatgaaggtgttaatccacgtttatcggcaagccctcctaaatcaggcaat ttgtggccaggattggcaccaccacacaaaaaagctcagtctgcatctccaaagagaaaa aaacagcacaagaaatacagaagtgttatttcagacatatttgatggaacaatcattagt tcagtgcagtgtctgacttgtgacagggtgtctgtaaccctcgagacctttcaagatctg tccttgccaattcctggcaaggaagaccttgctaagctgcattcatcaagtcatccaact tctatagtcaaagcaggatcatgtggcgaagcatatgctccacaagggtggatagctttt ttcatggaatatgtgaagagctggttttggggtccagtagtaaccttgcaagattgtctt gctgccttctttgccagagatgaactaaaaggtgacaatatgtacagttgtgaaaaatgc aaaaagttgagaaatggagtgaagttttgtaaagtacaaaactttcctgagattttgtgc atccaccttaaaagattcagacatgaactaatgttttccaccaaaatcagtacccatgtt tcatttccgctagaaggcttggatcttcagccatttcttgctaaggatagtccagctcaa attgtgacatatgatcttctgtcagtcatttgccatcatggaactgcaagtagtggacac tatatagcctactgccgaaacaatctaaataatctctggtatgaatttgatgatcagagt gtcactgaagtttcagaatctactgtacaaaatgcagaagcttacgttcttttctatagg tatggtggaggaccagctgtcaaccatctgtacatttgtcatacttgccaaattgaggcg gagaaaattgaaaaaagaagaaaaactgaattggaaatttttattcggcttaacagagcg ttccaaaaagaggactctccagctactttttattgcatcagtatgcagtggtttagagaa tgggaaagttttgtgaagggtaaagatggagatcctccaggtcctattgacaatactaag attgcagtcactaaatgtggtaatgtgatgcttaggcaaggagcagattctggccagatt tctgaagaaacatggaattttctgcagtctatttatggtggagggcctgaagttatcctg cgacctccggttgttcatgttgatccagatatacttcaagcagaagaaaaaattgaaccc tctgagaagggacctgaaatatgcatcgaacatcgaggagttaaaaaaaatctggaggaa atgttctgccagagaaatcggcaagcacaaaggcccgaagtggttcgagcctgcgtgctg tggtacaagccagagggccaggttggacggttgcaagggatggactgcaacccgccctcg gaagaccaaacttcgcccaggggcagggactggccggggagggcctcggcggcgcccgac ccgcgggcgctgcacgggccgcggttaccagcaggcggtgtagtgagcgcgcctgcagca gcagcaacagcagcagcagcggtcgccgccgccgccgccgcccgccagtcccaacgagga gaaaggaggggaccggaaggagccgctggtgctgcgggaagtggcaggagcgggaggcgg ggacccacctggaagcgccgcggcgccgctatcgagcttcctgcagcggtggccacccga gcaagtgccgtggcgggggcggagagcggccacggcggcggcgcctccccaagtggcccg ttgcgtccgaccccgcgtgaaaggcgacctggtcataccctgctccctggagatcaccat attgatgccgaacttaatgcagacacctga