GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:22:14 Sequence gi568815588r:73398632_73617689 : 219058 bp : 41.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 15426 15569 144 2 0 94 55 176 0.606 6.56 1.02 PlyA + 16868 16873 6 1.05 2.18 PlyA - 19725 19720 6 1.05 2.17 Term - 23773 23628 146 2 2 21 39 215 0.866 6.99 2.16 Intr - 26710 26654 57 2 0 31 100 87 0.809 2.14 2.15 Intr - 27746 27180 567 0 0 107 110 433 0.856 39.22 2.14 Intr - 28100 27976 125 1 2 73 49 107 0.781 4.61 2.13 Intr - 29137 28982 156 0 0 112 47 95 0.806 6.00 2.12 Intr - 29670 29433 238 1 1 68 78 251 0.328 17.85 2.11 Intr - 36062 35955 108 0 0 69 111 38 0.301 3.54 2.10 Intr - 46191 46094 98 2 2 70 40 104 0.854 2.53 2.09 Intr - 47945 47861 85 0 1 15 95 90 0.213 0.36 2.08 Intr - 55858 55781 78 2 0 73 60 77 0.211 2.00 2.07 Intr - 69047 68922 126 0 0 73 70 79 0.923 4.33 2.06 Intr - 72150 72056 95 2 2 68 116 25 0.898 1.99 2.05 Intr - 72333 72256 78 2 0 125 90 38 0.973 5.45 2.04 Intr - 72578 72439 140 2 2 116 69 55 0.998 4.74 2.03 Intr - 72982 72837 146 2 2 119 89 62 0.997 8.48 2.02 Intr - 80886 80686 201 1 0 34 115 258 0.981 21.44 2.01 Init - 97258 97174 85 1 1 101 100 109 0.999 12.62 2.00 Prom - 98031 97992 40 -11.64 3.25 PlyA - 98934 98929 6 1.05 3.24 Term - 100557 99998 560 1 2 110 43 192 0.986 10.42 3.23 Intr - 102207 102024 184 0 1 55 93 92 0.950 4.84 3.22 Intr - 106359 106219 141 0 0 44 78 65 0.198 0.73 3.21 Intr - 106795 106677 119 2 2 117 69 40 0.153 4.26 3.20 Intr - 119116 117744 1373 0 2 72 110 588 0.988 46.88 3.19 Intr - 121361 121166 196 0 1 89 67 153 0.993 10.85 3.18 Intr - 122396 122277 120 0 0 56 84 141 0.999 10.05 3.17 Intr - 123205 123107 99 2 0 68 73 50 0.527 0.66 3.16 Intr - 125119 124952 168 2 0 100 53 247 0.852 21.40 3.15 Intr - 131280 131005 276 1 0 16 56 231 0.596 9.27 3.14 Intr - 131892 131512 381 1 0 90 94 199 0.975 14.66 3.13 Intr - 132204 132079 126 1 0 113 -37 110 0.537 0.73 3.12 Intr - 136139 135969 171 0 0 114 91 162 0.999 18.09 3.11 Intr - 137806 137638 169 1 1 67 99 183 0.999 15.90 3.10 Intr - 140962 140813 150 1 0 132 75 114 0.999 13.94 3.09 Intr - 143107 143002 106 0 1 33 81 175 0.772 10.60 3.08 Intr - 143338 143273 66 0 0 109 80 29 0.541 1.30 3.07 Intr - 144203 144172 32 2 2 75 97 -18 0.503 -6.19 3.06 Intr - 144500 144387 114 2 0 86 66 60 0.893 3.32 3.05 Intr - 147041 146907 135 2 0 102 78 164 0.996 16.64 3.04 Intr - 155016 154898 119 1 2 89 81 30 0.136 1.66 3.03 Intr - 157189 157105 85 1 1 69 98 25 0.166 0.07 3.02 Intr - 172882 172790 93 1 0 87 115 14 0.534 3.24 3.01 Init - 177027 176881 147 0 0 51 110 106 0.653 9.24 3.00 Prom - 183017 182978 40 -0.85 4.00 Prom + 184493 184532 40 -6.15 4.01 Init + 198396 198411 16 2 1 77 115 11 0.210 3.18 4.02 Term + 206079 206188 110 1 2 86 43 122 0.753 5.19 4.03 PlyA + 206900 206905 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 39789 39611 179 1 2 27 46 199 0.868 6.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:73398632_73617689|GENSCAN_predicted_peptide_1|47_aa MEPGAGPGWAGRLASPLGSRLQPGTPPEPERALRGAGCPSVPPLGAT >gi568815588r:73398632_73617689|GENSCAN_predicted_CDS_1|144_bp atggagccgggggcgggcccagggtgggcggggcgtctcgcgagcccgttaggatcgcgg ctgcagccgggaacacccccggagccggagagggcgctgcggggagcgggctgtccctca gtcccgcccctaggcgccacctga >gi568815588r:73398632_73617689|GENSCAN_predicted_peptide_2|842_aa MAAPEPARAAPPPPPPPPPPPGADRVVKAVPFPPTHRLTSEEVFDLDGIPRVDVLKNHLV KEGRVDEEIALRIINEGAAILRREKTMIEVEAPITGKIKYSERVYEACMEAFDSLPLAAL LNQQFLCVHGGLSPEIHTLDDIRRLDRFKEPPAFGPMCDLLWSDPSEDFGNEKSQEHFSH NTVRGCSYFYNYPAVCEFLQNNNLLSIIRAHEAQDAGYRMYRKSQTTGFPSLITIFSAPN YLDVYNNKAAVLKYENNVMNIRQFNCSPHPYWLPNFMDVFTWSLPFVGEKVTEMLVNVLS ICSDDELMTEGEDQFDVGSAAARKEIIRNKIRAIGKMARVFSVLREESESVLTLKGLTPT GMLPSGVLAGGRQTLQSVCTFYSLGFLCGFDLPHSGTSLRVILFPSLMMFKEAGITGQAM APRSRRRRHKKPPSSVAPIIMAPTTIVTPVPLTPSKPGPSIDTLGFFSLDDNVPGLSQLI LQKLNMKSYEEYKLVVDGGTPVSGFGFRCPQEMFQRMEDTFRFCAHCRALPSGLSDSKVL RHCKRCRNVYYCGPECQKSDWPAHRRVCQELRLVAVDRLMEWLLVTGDFVLPSGPWPWPP EAVQDWDSWFSMKGLHLDATLDAVLVSHAVTTLWASVGRPRPDPDVLQGSLKRLLTDVLS RPLTLGLGLRALGIDVRRTGGSTVHVVGASHVETFLTRPGDYDELGYMFPGHLGLRVVMV GVDVATGFSQSTSTSPLEPGTIQLSAHRGLYHDFWEEQVETGQTHHPDLVAAFHPEFFIE EWMMMMMIMTGDKQGIHPSKVVITRLKLDKDRKKILERKAKSRQVGKEKGKYKEETIVKM QE >gi568815588r:73398632_73617689|GENSCAN_predicted_CDS_2|2529_bp atggccgccccggagccggcccgggctgcaccgcccccacccccgcccccgccgccccct cccggggctgaccgcgtcgtcaaagctgtccctttccccccaacacatcgcttgacatct gaagaagtatttgatttggatgggatacccagggttgatgttctgaagaaccacttggtg aaagaaggtcgagtagatgaagaaattgcgcttagaattatcaatgagggtgctgccatc cttcggagagagaaaaccatgatagaagtagaagctccaatcacaggtaaaattaagtat tcggaaagagtctatgaagcttgtatggaagcttttgatagtttgcctcttgctgcactt ttaaaccaacagtttctttgtgttcatggtggactttcaccagaaatacacacactggat gatattaggagattagatagattcaaagagccacctgcatttggaccaatgtgtgacttg ttatggtccgatccttctgaagattttggaaatgaaaaatcacaggaacattttagtcac aatacagttcgaggatgttcttatttttataactatccagcagtgtgtgaatttttgcaa aacaataatttgttatcgattattagagctcatgaagctcaagatgcaggctatagaatg tacagaaaaagtcaaactacagggttcccttcattaataacaattttttcggcacctaat tacttagatgtctacaataataaagctgctgtattaaagtatgaaaataatgtgatgaat attcgacagtttaactgttctccacatccttactggttgcctaattttatggatgtcttc acgtggtctttaccgtttgttggagaaaaagtgacagaaatgttggtaaatgttctgagt atttgctctgatgatgaactaatgactgaaggtgaagaccagtttgatgtaggttcagct gcagcccggaaagaaatcataagaaacaaaattcgagcaattggcaagatggcaagagtc ttctctgttctcagggaggagagtgaaagtgtgctgacactcaagggcctgactcccaca gggatgttgcctagtggagtgttagctggaggacggcagaccctgcaaagtgtttgtact ttttacagtttggggttcctgtgtggatttgatttgccacattccgggacctcattgcga gtgattctgttccccagtttgatgatgttcaaagaagcaggtatcactggacaggccatg gctccacggtcccggcgacgaaggcacaagaaacctccctcatcagtggctcccatcatc atggccccaaccacaattgtgacccctgtgcctctgaccccctcaaaacctggccctagc attgacacacttggcttcttctccttggatgataatgttcctggcctatcgcagctgatc cttcaaaagctgaacatgaaaagctatgaagaatataagttggtggtagatgggggtacc cccgtatcaggctttggatttcgatgtcctcaagaaatgttccagaggatggaagacaca tttcgattctgtgctcactgtagagcactccctagtgggctttcagactccaaggttctc cggcactgtaagaggtgcagaaatgtctattactgtggtccagagtgccagaagtcagac tggcccgcacacaggagggtttgtcaagagcttcgtcttgtggctgtggaccgtctcatg gaatggcttctggtcacaggtgattttgttctaccctcaggaccttggccatggccacct gaagctgtacaggactgggactcctggttttctatgaaggggttacacctagatgctaca ttggatgctgtgctagttagtcatgctgtgaccaccttatgggccagtgtaggacggcca aggccagacccggatgtcctgcagggatctttgaagcggctgctgacagatgtcctgtca cggcccttgactctaggcctaggacttagggccttggggatagatgttaggaggactggg ggaagcacagtgcatgtggttggtgcttcccatgtggagacatttcttactcgcccaggg gactatgatgagcttggttacatgtttcctgggcaccttggactccgtgtggtcatggtg ggtgtagatgtagctactggcttttcacagagcacctcaacttcacccctggaacctggc acaattcagcttagtgcccacaggggcctctaccatgacttctgggaggagcaagtagag accgggcagacacaccatccagatttggtggcggcattccatccagaatttttcattgaa gagtggatgatgatgatgatgataatgactggggataagcaaggcattcaccctagcaag gtggtcatcactaggctaaaactggacaaagaccgcaaaaagatccttgaacggaaagcc aaatctcgccaagtaggaaaggaaaagggcaaatacaaggaagaaacaattgtgaagatg caggaataa >gi568815588r:73398632_73617689|GENSCAN_predicted_peptide_3|1709_aa MSWKRNYFSGGRGSVQGMFAPRSSTSIAPSKGLSNEPGQNSCFLNSALQVLWHLDIFRRS FRQLTTHKCMGDSCIFCALKNLWQILQFRQSQNLVHIQKGVPFQFFLWELSNWKNTSFCS LLVLPPVLTAELNVLGFLWCEQDLPLGKGIFNQFQCSSEKVLPSDTLRSALAKTFQDEQR FQLGIMDDAAECFENLLMRIHFHIADETKEDICTAQHCISHQKFAMTLFEQMVHYISTTS LWLSTGQQSGRRSGDSMSFGSGDSNQAICMLERREKPSPSMFGELLQNASTMGDLRNCPL FFRVTDDRAKQSELYLVGMICYYGKHYSTFFFQTKIRKWMYFDDAHVKEIGPKWKDVVTK CIKGHYQPLLLLYADPQGTPVSTQDLPPQAEFQSYSRTCYDSEDSGREPSISSDTRTDSS TESYPYKHSHHESVVSHFSSDSQGTVIYNVENDSMSQSSRDTGHLTDSECNQKHTSKKGS LIERKRSSGRVRRKGDEPQASGYHRETLKEKQAPRNASKPSSSTNRLRDFKETVSNMIHN RPSLASQTNVGSHCRGRGGDQPDKKPPRTLPLHSRDWEIESTSSESKSSSSSKYRPTWRP KRESLNIDSIFSKDKRKHCGYTQLSPFSEDSAKEFIPDEPSKPPSYDIKFGGPSPQYKRW GPARPGSHLLEQHPRLIQRMESGYESSERNSSSPVSLDAALPESSNVYRYCLAFEQQHCL LQQCEEISSKSELDELQEEVARRAQEQELRRKREKELEAAKGFNPHPSRFMDLDELQNQD SSAPALAHIHFERRMEYSLLAVWEGDFFGGSFGRSDGFERSLQEAESVFEESLHLEQKGD CAAALALCNEAISKLRLALHGASCSTHSRALVDKKLQISIRKARSLQDRMQQQQSPQQPS QPSACLPTQAGTLSQPTSEQPIPLQVLLSQEAQLESGMDTEFGASSFFHSPASCHESHSS LSPESSAPQHSSPSRSALKLLTSVEVDNIEPSAFHRQGLPKAPGWTEKNSHHSWEPLDAP EGKLQGSRCDNSSCSKLPPQEGRGIAQEQLFQEKKDPANPSPVMPGIATSERGDEHSLGC SPSNSSAQPSLPLYRTCHPIMPVASSFVLHCPDPVQKTNQCLQGQSLKTSLTLKVDRGSE ETYRPEFPSTKGLVRSLAEQFQRMQGVSMRDSTGFKDRSLSGSLRKNSSPSDSKPPFSQG QEKGHWPWAKQQSSLEGGDRPLSWEESTEHSSLALNSGLPNGETSSGGQPRLAEPDIYQE KLSQVRDVRSKDLGSSTDLGTSLPLDSWVNITRFCDSQLKHGAPRPGMKSSPHDSHTCVT YPERNHILLHPHWNQDTEQETSELESLYQASLQASQAGCSGWGQQDTAWHPLSQTGSADG MGRRLHSAHDPGLSKTSTAEMEHGLHEARTVRTSQATPCRGLSRECGEDEQYSAENLRRI SRSLSGTVVSEREEAPVSSHSFDSSNVRKPLETGHRCSSSSSLPVIHDPSVFLLGPQLYL PQPQFLSPDVLMPTMAGEPNRLPGTSRSVQQFLAMCDRGETSQGAKYTGRTLNYQSLPHR SRTDNSWAPWSETNQHIGTRFLTTPGCNPQLTYTATLPERSKGLQVPHTQSWSDLFHSPS HPPIVHPVYPPSSSLHVPLRSAWNSDPVPGSRTPGPRRVDMPPDDDWRQSSYASHSGHRR TVGEGFLFVLSDAPRREQIRARVLQHSQW >gi568815588r:73398632_73617689|GENSCAN_predicted_CDS_3|5130_bp atgtcttggaagagaaattatttttcagggggtcgtggtagtgtacaagggatgtttgca cctcgaagctcaacctccatagcccccagcaaaggcctcagcaatgagccagggcaaaac agctgcttcctcaacagtgccctgcaggttttgtggcacttggatatcttccgacgtagc tttaggcagcttacaactcacaagtgcatgggagattcctgcatcttttgcgctctcaag aatttatggcagatcctacagtttagacaatctcaaaaccttgtgcatatccagaagggt gtcccctttcagtttttcctgtgggaactcagcaattggaagaataccagtttttgttcc ttattggtgctgccacctgttcttactgctgaactaaatgttcttggctttctttggtgt gaacaggatctccctctgggaaagggaatctttaaccagtttcagtgtagtagtgaaaaa gtgcttccatctgacactctccgcagtgctctggcaaagactttccaggatgaacaacgt ttccagctgggaattatggatgatgctgcagagtgctttgaaaacctcctgatgagaatt cacttccacattgctgatgaaaccaaagaggatatatgtactgcccaacactgcatttcc catcagaaatttgcaatgacattgtttgagcagatggtacattatatctccaccacttcc ctttggctcagtactggccagcaaagtgggaggaggagtggggattctatgagctttgga tctggggacagcaatcaggctatttgtatgctggaaagacgagagaaaccttcaccaagc atgtttggtgagctgctgcagaatgccagcaccatgggggatctgcggaactgtccactg tttttcagagtgacggatgaccgggccaagcaatctgaactgtacttagttggaatgatc tgttactatggcaaacattattctacattcttttttcaaacaaagattcgcaaatggatg tattttgatgatgctcatgtcaaggagattgggcccaaatggaaggatgtggtgaccaaa tgcatcaaggggcattatcagcccctgctgctgctttatgcagatccccagggtacccca gtttccacccaggacctgcctccccaagctgagttccagtcatacagcaggacatgctac gacagtgaagattcagggagggagccctccatctcaagtgacactcgaacagattcctca acggagagctatccctacaaacattcccaccatgagtctgtggtcagtcacttctcttct gattctcaggggacagtcatctataatgtggaaaatgattccatgtctcagagcagtcgg gacacaggacacctgactgatagtgaatgtaatcagaaacacacatccaagaaagggtca ctgatagagcgcaagaggagctctggtcgggttaggaggaaaggcgatgagccccaggcc tcgggataccacagagaaacactgaaagagaagcaggctcctagaaatgcctccaaacca tccagcagcaccaacaggctgagagattttaaagagacagtcagcaatatgatccataac agaccatccctggcttctcagaccaatgtaggctctcactgcaggggcagaggaggagac cagcctgacaaaaaacctcctaggaccctgcctttacactctcgtgactgggaaatagag agtaccagcagtgagtcaaaatccagttcttccagcaagtatcgtcccacatggagaccc aaacgagaatctctgaatattgacagtatctttagtaaggacaaaaggaagcactgtggc tatacccagcttagccccttttctgaggattcagctaaagaatttataccagatgaacca agcaagccaccttcttacgacattaaatttggtggaccaagcccccagtacaagcgctgg ggcccagcacggccaggctctcaccttttagagcagcacccccgactaatccagcgaatg gaatctggctatgaaagcagtgagaggaacagcagcagccctgtcagcctggatgcagcc ctgcctgagagctcaaatgtctacaggtattgtttggcctttgaacagcaacattgtctc ttgcagcaatgtgaggagatatcttctaaaagtgaactggatgaattgcaggaagaggtg gccaggagggcgcaggaacaggaacttcgaagaaaacgggagaaggagttagaggcagcg aaagggtttaaccctcatcctagccgcttcatggacttggatgaactgcagaatcaggac tcttctgcacctgctttggcacacattcactttgagaggaggatggaatattccctgctt gctgtatgggaaggtgacttctttgggggtagttttgggaggagtgacggctttgagagg tccctgcaagaggcagagtcagtgtttgaagagtcactacatctggaacagaaaggagac tgtgctgcagctttggctctctgtaatgaagctatctctaaactaagacttgccctgcat ggtgccagctgtagcacgcacagcagagccctagtcgataagaagttgcaaatcagtatt cgaaaagcacggagcctgcaggatcgcatgcagcagcagcaatcaccacagcagccgtcg cagccctcagcctgcctcccaacacaggcggggactctctctcagccaacaagtgaacag cctatcccgctccaagtattgttaagccaagaggcccaactggaatccggcatggataca gagtttggggccagttctttcttccattcacctgcttcctgccatgagtcacactcatca ctatctccagagtcatctgccccacagcacagctcccccagtagatctgccttgaagctt ctgacttcggttgaagtagacaacattgaaccctctgcattccacaggcaaggtttacct aaagcaccagggtggactgagaagaattctcatcatagttgggagccattggatgcccca gagggtaagctgcaaggctctaggtgtgacaacagcagttgcagcaagctccctccacaa gaaggaagaggcattgctcaagaacagctgttccaagaaaagaaggatcctgctaacccc tccccggtgatgcctggaatagccacctctgagaggggtgatgaacacagcctaggctgt agtccttcaaattcatcagctcagcccagccttcccctgtatagaacctgccaccccata atgcctgttgcttcttcatttgtgcttcactgtcctgatcctgtgcagaaaactaaccaa tgcctccaaggccaaagcctcaaaacttcattgactttaaaagtggacagaggcagtgag gagacctataggccagagtttcccagcacaaaggggcttgtccgttctctggctgagcag ttccagaggatgcagggtgtctccatgagggatagtacaggtttcaaggatagaagtttg tcaggtagtctaaggaagaactcttccccttctgattctaagcctcctttctcacagggt caagagaaaggccactggccatgggcaaagcaacaatcctctctggagggtggggataga ccactttcctgggaagagtccactgaacattcttctcttgccttaaactctgggctgcct aatggtgaaacttctagcggaggacagcccaggttggcagagccagacatataccaagag aagctgtcccaagtgagagatgttaggtctaaggatctgggcagcagtactgacttgggg acttccttgcctttggattcctgggtgaatatcacaaggttctgtgattctcagcttaag catggggcacctaggccaggaatgaagtcctcccctcatgattcccatacgtgtgtaacc tatccagagagaaatcacatccttttgcatccacattggaaccaagacacagagcaggag acctcagaattggagtctctgtatcaggccagtcttcaggcttctcaagctggctgttct ggatgggggcagcaggataccgcctggcacccacttagccaaacaggctctgcagatggc atggggaggaggttgcactcagcccatgatcctggtctctcaaagacttcaacagcagaa atggagcatggtctccatgaagccagaacagtgcgtacttctcaggctacaccttgccga ggcctcagcagggagtgtggggaggatgagcagtacagtgcagagaatttacgtcgcatc tcacgcagtctcagtggcaccgttgtctcagagagggaggaagctccggtttcttcccac agttttgattcatcaaacgtgaggaagcctttggaaaccgggcaccgttgttccagctcc tcttccctccctgtcatccatgacccttctgtgtttctcctcggtccccaactctacctt ccccaaccacagttcctgtccccagatgtcctgatgcccaccatggcaggggagcccaat agactcccaggaacttcaaggagtgtccagcagtttctggctatgtgtgacaggggtgaa acttcccaaggggccaagtacacaggaaggactttgaactaccagagcctcccccatcgc tccagaacagacaactcctgggcaccctggtcagagaccaaccagcatattgggaccaga ttcctgactactccagggtgcaatcctcaactaacctacactgccacactaccagaaaga agcaagggccttcaggttcctcacactcagtcctggagtgatcttttccattcaccctcc caccctcccattgttcatcctgtgtacccaccatctagcagtcttcatgtacccctgagg tcagcttggaattcagatcctgttccagggtcccgaacccctggtcctcgaagagtagat atgcccccagatgatgactggaggcaaagcagttatgcctcccactctggacacaggaga acagtgggagaggggtttctgtttgttctatcagatgctcccagaagagagcagatcagg gctagagtcctgcagcacagtcaatggtaa >gi568815588r:73398632_73617689|GENSCAN_predicted_peptide_4|41_aa MAIVHASQAAGVSRVGSFQWVLGLADFKNEAADLHDECYSS >gi568815588r:73398632_73617689|GENSCAN_predicted_CDS_4|126_bp atggccatagttcatgcctcccaagcagctggtgtgtccagagttggttccttccagtgg gttcttggtctcgctgacttcaagaacgaagccgcagatcttcacgatgagtgttacagc tcttaa