GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:13:54 Sequence gi568815597r:85167630_85376352 : 208723 bp : 40.15% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 Intr - 1596 1415 182 1 2 70 121 142 0.605 14.37 1.11 Intr - 10643 10517 127 2 1 69 90 151 0.996 12.73 1.10 Intr - 15571 14469 1103 2 2 104 67 581 0.319 46.28 1.09 Intr - 23123 22428 696 0 0 56 98 564 0.656 44.73 1.08 Intr - 34014 32623 1392 1 0 72 91 919 0.022 78.75 1.07 Intr - 40685 40611 75 0 0 61 80 59 0.021 0.97 1.06 Intr - 66162 65788 375 1 0 19 46 214 0.004 4.36 1.05 Intr - 81545 81441 105 0 0 24 86 78 0.091 0.47 1.04 Intr - 81958 81675 284 0 2 33 66 182 0.226 6.64 1.03 Intr - 85073 85004 70 0 1 85 9 98 0.131 -1.08 1.02 Intr - 91093 90895 199 1 1 54 110 93 0.788 6.10 1.01 Init - 92004 91729 276 0 0 99 78 313 0.487 28.43 1.00 Prom - 98232 98193 40 -3.85 2.04 PlyA - 98638 98633 6 1.05 2.03 Term - 100353 99998 356 1 2 43 47 231 0.692 8.17 2.02 Intr - 103277 102989 289 2 1 86 116 172 0.994 15.70 2.01 Init - 108723 108502 222 0 0 107 -34 272 0.099 15.50 2.00 Prom - 109830 109791 40 -5.75 3.04 PlyA - 109956 109951 6 1.05 3.03 Term - 116490 116015 476 1 2 93 44 150 0.890 5.16 3.02 Intr - 117181 117102 80 0 2 41 84 68 0.673 -0.02 3.01 Init - 117378 117221 158 0 2 65 91 105 0.731 7.93 3.00 Prom - 120101 120062 40 -5.05 4.00 Prom + 126113 126152 40 -3.25 4.01 Init + 127928 128150 223 1 1 70 36 117 0.010 3.47 4.02 Intr + 131274 131418 145 1 1 86 36 87 0.009 1.72 4.03 Intr + 135830 136099 270 2 0 70 47 129 0.013 2.73 4.04 Intr + 140723 140906 184 2 1 63 75 205 0.137 15.57 4.05 Term + 148365 148463 99 2 0 29 43 123 0.263 -0.95 4.06 PlyA + 148979 148984 6 1.05 5.04 PlyA - 150880 150875 6 1.05 5.03 Term - 153939 153823 117 0 0 95 49 114 0.895 5.76 5.02 Intr - 157254 157111 144 0 0 97 93 132 0.991 14.16 5.01 Init - 163592 163311 282 2 0 59 59 208 0.980 12.09 5.00 Prom - 171521 171482 40 -8.05 6.06 PlyA - 172115 172110 6 1.05 6.05 Term - 178990 178874 117 1 0 65 41 77 0.590 -1.74 6.04 Intr - 182905 182786 120 1 0 89 94 90 0.951 9.47 6.03 Intr - 183950 183877 74 0 2 68 100 45 0.626 1.91 6.02 Intr - 191218 191119 100 1 1 61 80 90 0.627 4.26 6.01 Init - 200169 199978 192 0 0 58 67 115 0.426 5.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:85167630_85376352|GENSCAN_predicted_peptide_1|1628_aa MAAEEKDPLSYFAAYGSSSSGSSDEEDNIEPEETSRRTPDPAKSAGGCRNKAEKRLPGPD ELFRSVTRPAFLYNPLNKQIDWERHVVKAPEEPPKEFKIWKSNYVPPPETYTTEKKPPPP ELDMAIKWSNIYEDNGDDAPQNAKKARLLPEGEETLESDDEKDEHTSKKRKVEPGEPAKK KKDLWNFELERDNLGYLVEEIFKQQSIQEVTWMPLKAFSFIREAELESSENLQSDNAIEK KIPFSEKKFNLAAEICLSNEEPNVNPQDNGENVSRAATPAMAERSQRRAQAMASEHASPK PWQLPCGVEPADAEKAFSKILHLFMIKTINELGLKGTYLKIIRAIIDKPIANIIQNGQKL ERFPVRTGTRQKYPFSPFLFNIVPEVLARAVKQEKERKGIHTGKEVKLSLFADDMIPYLE NLKHSTKRLLELINDFTTLSLPSANAEYEGKSVSTRAEVLPTGLRPLPLAGVSLAAKAEG MADPLRRTLSRLRGRRGPRGTGGLGLRAAAAAAVAASSAAAGDAWGAADTLPREHAGGHG RSLQQPSPSPEAWGPGARVPGGHPEQLGALGPRPRGGQEAAPQSHGLAHAPPHSPEGSED SGEEEEDDKDEDDYDADYYENLPGGSQSAPEPEGAEAERRPPPPPAAGSSLGAEGGRLET GRLRTQLREAYYLLIQAMHDLPPDSGARRGGRGLADHSFPAGARAPGQPPSRGAAYRRAC PRDGERGGGGRPRQQVSPPRSPQREPRGGQLRTPRMRPSCSRSLESLRVGAKPPPFQRWP SDSWIRCGAHRDWDEPPPRGGRMDGWSGDRARAAAPTGLQPPGCKDHGCSSGSPFRDPAG SSVIRSGKGDRQEGPSFLRPPAVTVKKLQKWMYKGRLLSLGMKGRARGTAPKVTGTQAAS PNVGALKVRENRVLSVPPDQRITLTDLFENAYGSSMKGRELEELKDNIEFRGHKPLNSIT VSKKRNWLYQSTLRPLNLEEENKKCQDRSHLSISPVSLPKHQLSQSFLKSSKEYCTYVVC NATNSSLSKNCALDFNEENDADDEGEIWYNPIPEDDDLGISSALSFGEADSAVLKLPAVN LSMLSGSDLMKAERHTEDSLCSSEHAGDIQTTRSNGMNPIHPAHSTEFVQQYKQKLGHKT QEGIMVEDSPMLKSPFAGSGILAATNSTELGIMEPSSPNPSPVKKGSSINWSLPDKIKSP RTVRKLSMKMKKLPEFSRKLSVKGTLNYINSPDNTPSLSKYNCREVHHTDILPSGNTTTA AKRNVISRYHLDTSVSSQQSYQKKNSMSSKYSCKGGYLSDGDSPELTTKASKHGSENKFG KGKEIISNSCSKNEIDIDAFRHYSFSDQPKCSQYISGLMSVHFYGAEDLKPPRIDSKDVF CAIQVDSVNKARTALLTCRTTFLDMDHTFNIEIENAQHLKLVVFSWEPTPRKNRVCCHGT VVLPTLFRVTKTHQLAVKLEPRGLIYVKVTLMEQWENSLHGLDINQEPIIFGVDIQKVVE KENIGLMVPLLIQKCIMEIEKRGCQVVGLYRLCGSAAVKKELREAFERDSKAVGLCENQY PDINVITGVLKDYLRELPSPLITKQLYEAVLDAMAKSPLKMSSNGCENDPGDSKYTVDLL DCLPEIEK >gi568815597r:85167630_85376352|GENSCAN_predicted_CDS_1|4884_bp atggcagcggaggagaaggaccctctgagctattttgcggcatacgggagcagcagctca ggctcctcggacgaggaggataacatcgagccggaggagacgagtcgcagaaccccggat ccggcgaagtcggcgggcggctgtaggaacaaggcggagaagcggctcccgggacctgac gagctgtttaggagcgtgactcgcccggcctttctctacaatccgctcaacaaacagata gactgggagaggcacgtcgtcaaggcgcctgaggagcctccaaaggaattcaaaatatgg aagtcaaattatgtaccacctcctgagacctacaccactgagaagaagcctccgcctcca gagcttgacatggcaataaaatggtctaacatatatgaggacaatggtgatgatgctcca cagaatgctaagaaagctaggcttctaccagaaggggaggagacgttggaatcagatgat gaaaaagatgagcatacttctaaaaagcgcaaagtagagccaggagaaccagcaaagaag aaaaaagatttgtggaactttgaacttgagagagataatttagggtatttggtggaagaa atttttaagcagcaaagcattcaagaggttacttggatgccattaaaggcattcagtttt ataagggaagcagagcttgaaagttcagaaaatttgcagtctgacaatgccatagaaaag aaaatcccattttctgagaagaaatttaacctggctgcagaaatttgcttaagtaatgaa gagccaaatgttaatccccaagacaacggggaaaatgtctccagggcagccactccagcc atggctgaaaggagccaacgtagagctcaggccatggcttcggagcatgcaagccccaag ccttggcagcttccatgtggtgttgagcctgcagatgcagaaaaagctttcagtaaaatt ctacatctcttcatgataaaaaccatcaacgaactaggcctcaaaggaacatacctcaaa ataataagagccatcattgacaaacccatagccaacatcatacaaaatgggcaaaagctg gaacgcttccctgtgagaactggaacaagacaaaaatacccattctcaccattcctattc aacatagtaccggaagtcctagcaagagcagtcaagcaagagaaagaaagaaaaggcatc cacacaggaaaagaagtcaaactatctctctttgctgatgatatgattccatacctagaa aaccttaagcactccaccaaaaggctactagaactgataaatgattttaccacattaagc cttccaagtgccaatgcagagtatgaaggaaaatcagtttcaaccagggctgaggttttg ccaacggggctccgacctctccccctggcgggggtgagcctcgcggctaaggcagagggg atggcggaccctctgaggaggacgctgtccaggctccggggaaggcggggtccccgcggc accggggggctagggctccgggcggcagccgcagctgcggtggcggcctcttcggcagcc gcgggagacgcctggggtgccgcagacaccctcccacgtgagcacgccgggggacacggg cggagcctacagcagccctcgccgtcacctgaggcgtggggtcccggggcgcgggtcccc ggagggcacccggagcagttgggggcgctcgggcctcggccgcgcggcgggcaggaggcc gccccccagagccatgggctcgctcacgcccctccgcactccccggagggctccgaggac agcggagaggaggaggaagacgacaaggacgaagacgactacgacgccgactactacgaa aacctgcccggcggctcgcagtctgcgcccgagcctgagggggcggaggcggaacggcgt cccccgcctcccccagcggcgggctcctccctgggggcggagggcggccgcctggagaca ggcaggctgcggacccagttgcgagaggcctattatctgctgatccaggccatgcacgac ctgccccctgactcgggcgcgcggcggggcggcaggggcttggcggatcacagcttcccc gcgggagcccgggctccgggccagccgccttcccgcggcgccgcgtaccgccgagcctgc ccccgggacggggagcggggaggcggcggacgccctcggcagcaggtgtccccgccccgg tcgcctcagagggagccgcggggaggccagctgcggactcctcggatgcggccgtcctgc agcagaagcctcgagagcctccgggtgggtgccaagccgcctcccttccagcggtggccg agcgacagctggatcaggtgcggcgcgcaccgggactgggacgagcccccgccacgtgga ggcaggatggacggctggagtggggaccgcgcccgggcggctgcacccaccggcctccag cctccaggctgcaaggaccacggctgctcctcgggaagccctttcagggatccagcgggg tcctctgtgatacgcagtggcaaaggagaccgccaggaaggcccctccttcctcaggccg ccggcagtgacagtcaagaagctgcagaagtggatgtacaaagggcgtctgctgtccctg ggaatgaagggtcgtgcccgtgggacggctcccaaagtcacaggaacgcaggcagcctcc ccaaatgtgggcgctttgaaagtgcgtgaaaaccgtgtcctgtcggtgcctccagaccaa agaattacgctgacagatttatttgaaaatgcctatgggtcttcaatgaagggaagagaa cttgaagagctgaaggataatattgaattcagaggtcataagccacttaacagcatcact gtttcaaagaaacgcaattggctatatcagagtactctgaggcctcttaatctggaagaa gaaaataagaaatgccaagatagaagtcatttatccatctcacctgtgtctctacctaaa catcagctatcacagtctttcctcaaatcatctaaagagtactgtacatatgtggtatgt aacgctacaaactcttcattatcgaaaaactgtgctttagattttaatgaggaaaatgat gcagatgatgaaggagaaatatggtacaatcccattcctgaggatgatgaccttggtata tcaagtgccttgagttttggtgaggccgactctgctgttctgaagctccctgctgtcaat ttgagcatgttgtctggcagtgacctgatgaaagcagagcggcatactgaagactcactg tgctcttccgaacatgcaggtgatattcagaccacacggtcaaatggaatgaatcctata catcctgcccattccacagaatttgtgcagcagtacaagcaaaagctaggacacaagaca caagaaggtataatggtggaggacagtcccatgttgaaatctccttttgcaggttctggg atcctggctgctacaaatagtactgaattgggaattatggaaccatcttctccaaatcct agccctgtgaaaaaaggcagttcaattaattggtcattgccagataaaataaaatctcca cgaactgtgaggaaactttccatgaaaatgaaaaagttgccagaatttagccgaaagcta agcgttaagggaacattgaattatataaacagtccagataatactccttctttgtctaaa tataactgccgagaagttcatcatactgatattctgccctctgggaacacaaccaccgct gctaagaggaatgttataagccgataccatcttgataccagtgtatcctcccagcagagc taccagaagaaaaactctatgagttctaagtattcctgcaaaggtggttaccttagtgat ggagactcacctgaacttacaactaaagctagcaaacatggatctgaaaacaaatttgga aaaggaaaagaaataatttcaaatagttgtagcaagaatgaaatagacattgatgctttt aggcattatagcttttctgatcaacctaagtgttcacagtacatatctgggctcatgagt gtacatttctatggtgctgaggatttaaaaccacctcggatagattcaaaagacgtcttt tgtgcaattcaggtagattcagtaaacaaagcaagaacagctttgctcacatgccgaaca acatttttagacatggatcacactttcaacatagaaattgaaaatgcacaacatttgaaa ctagtagtattcagttgggaacccactccaagaaaaaatcgagtttgttgtcatggaact gttgttcttcccaccttatttagagtgacaaagactcatcagttggctgtcaaacttgaa cctagaggtcttatttatgtgaaagtgactcttatggaacagtgggagaattctcttcat ggactagatataaaccaagaaccaataatatttggagttgatattcaaaaagttgtagag aaagaaaatataggactgatggtgccccttctgatacagaaatgtattatggaaattgaa aagagaggctgtcaggtagtaggcctgtatcgattatgtggttcggcagcagtcaagaaa gaactgcgagaggcttttgagagagatagcaaagctgttggtctgtgtgaaaaccagtac ccagatataaatgtaataacaggtgttcttaaggattatttaagagaactcccttctcct ctgataacaaagcagctttatgaggctgtattagatgcaatggcaaaaagtcctttgaaa atgtcatcaaatggttgtgagaatgacccaggtgactctaagtacactgttgacctgctg gattgtctgccagagattgagaag >gi568815597r:85167630_85376352|GENSCAN_predicted_peptide_2|288_aa MEPTAPSLTEEDLTEVKKDVSNAAVPRAGGGGLQPSGRRKRKPGVRGQGGSSRSEERKVL AQEDCGEPCIRGTLALENLRVYLCEKIIAERHFDHLRAKKILSREDTEEISCRTSSRKRA GKLLDYLQENPKGLDTLVESIRREKTQNFLIQKITDEVLKLRNIKLEHLKGLKCSSCEPF PDGATNNLSRSNSDESNFSEKLRASTVMYHPEGESSTTPFFSTNSSLNLPVLEVGRTENT IFSSTTLPRPGDPGAPPLPPDLQLEEEGTCANSSEMFLPLRSRTVSRQ >gi568815597r:85167630_85376352|GENSCAN_predicted_CDS_2|867_bp atggagcccaccgcaccgtccctcaccgaggaggacctcactgaagtgaagaaggacgtg agtaacgcagctgtgcccagggcgggcgggggcgggctgcagcccagcgggagacgaaag cggaagcctggagtccgaggacaaggaggatcctccaggtcggaggagcggaaagtccta gcacaggaggactgtggcgagccctgcatccgagggaccttggccttagaaaatttacgt gtatacctgtgtgagaaaatcatagctgagagacattttgatcatctacgtgcaaaaaaa atactcagtagagaagacactgaagaaatttcttgtcgaacatcaagtagaaaaagggct ggaaaattgttagactacttacaggaaaacccaaaaggtctggacacccttgttgaatct attcggcgagaaaaaacacagaacttcctgatacagaagattacagatgaagtgctgaaa cttagaaatataaaactagaacatctgaaaggactaaaatgtagcagttgtgaacctttt ccagatggagccacgaacaacctctccagatcaaattcagatgagagtaatttctctgaa aaactgagggcatccactgtcatgtaccatccagaaggagaatccagcacgacgcccttt ttttctactaattcttctctgaatttgcctgttctagaagtaggcagaactgaaaatacc atcttctcttcaactacacttcccagacctggggacccaggggctcctcctttgccacca gatctacagttagaagaagaaggaacttgtgcaaactctagtgagatgtttcttccctta agatcacgtactgtttcacgacaatga >gi568815597r:85167630_85376352|GENSCAN_predicted_peptide_3|237_aa MAGTEERSGTWKLGDAGNHRAPERVLQPWLGELLGLGSQKGCSSLSFLLPTTCSFSPGIQ RIPSSCPVSRKNEVRRQVEVPKAWRGAKAAGVWHVSAALSMHTPGWVARGPGLGLNFAPK SESAPGVGRDQATGAGTSEPAGAGRLPWPPRDARVHSHGWMAAAVPERAGLLPDSGPQEH RDVQVWSCAWAAAAAPREHKAPALPTWKWAGLPQFPAPPSPVELAALATPPPLQPVS >gi568815597r:85167630_85376352|GENSCAN_predicted_CDS_3|714_bp atggctggcactgaggaacgcagtggaacttggaagcttggagatgccgggaaccacaga gccccagagagggtgttacagccctggcttggggagctcctaggtctgggctcccagaag ggctgcagctctctctccttcttgttgcccacaacgtgctcttttagcccaggcattcag cgcatcccaagttcttgtcctgtgtctaggaagaatgaggtacgcagacaagtggaggtg cccaaagcctggaggggagccaaggcggcaggggtctggcatgtcagtgctgctctgagc atgcatacacctggctgggttgcaagagggcctgggcttggcctcaactttgctccaaaa tcagagtcggcaccgggagtggggagagaccaggcaacaggagcaggtacttctgagcct gcaggggcagggaggcttccctggcccccaagagatgcccgggtccacagccatggctgg atggctgcagctgtgcctgagagggcagggctcctgcctgattctggcccccaagagcac agggatgtccaggtctggagctgtgcctgggcagctgcagctgcacccagggagcacaag gctcccgccctgccaacttggaagtgggcagggcttccacagttcccggctcctcccagc cctgtagagctcgcagccctggccacgcctcccccactgcagccagtgtcatga >gi568815597r:85167630_85376352|GENSCAN_predicted_peptide_4|306_aa MVNVFGVIPNYIPLFLNPVSAATPIQCDPLPFIVEWTRCRHWTQAGLTRFSLVELGNGPL RFLSSMFGSCAQICSSEIRVGSEILLLRLQQRPGEHLKGSSRVVEIGVQPEACVPANLER QVFPEIASEPQNFMFFSPETLKTSPMSSQRSPQDFSGFGNMLTGILARGFSFLDLDFRLI LDNWQMGTCYAALMQRANNIDVYKILKEKLQDMPAEKTRVTTRGRCYSTIFLQKLRSANM DIRISIVFEAKSKDASGEKGQHALSDRHSYGKTEPCQLPGKPVNSPRKPSGQLRGIRDEN PELFLE >gi568815597r:85167630_85376352|GENSCAN_predicted_CDS_4|921_bp atggtaaatgtttttggggtgatccccaactacatacccctttttctgaaccctgtatca gctgccacacctatacagtgtgaccctcttcccttcatagttgagtggacaagatgtaga cactggacccaagctggactcaccaggttttcccttgtggaattgggaaatgggcctctg aggttcttgagcagcatgtttggctcctgtgcacagatctgcagttcagaaatcagggtg ggaagtgaaatcctattactgagactccaacagcgtcctggggagcacctcaaagggtct tccagagttgttgaaattggggttcagccagaggcctgtgtacctgccaatctggagaga caagtcttcccagaaattgcatctgagcctcagaattttatgtttttttcccctgaaacc ctgaaaaccagtcccatgtcctctcagagaagccctcaagatttctctggatttggtaac atgctgactgggattctagccagaggattctcattcttggacctggattttagactcatt ttggacaattggcaaatgggcacgtgttatgctgctttaatgcaaagagcaaacaatatc gatgtttataaaatcctaaaggaaaaactccaagatatgccggctgagaaaacgcgagtg actacccgagggcgctgttactccacaatttttctgcaaaagctaaggtcggcaaacatg gatattcggatttccattgtttttgaggctaaatcaaaagatgcctctggggaaaagggg caacatgcactgtcggatcggcattcatatggaaaaacagagccatgtcagctaccaggg aaacctgtgaattcaccacggaagccgtcaggccagcttagaggcatccgtgatgaaaat cccgagttatttctggagtag >gi568815597r:85167630_85376352|GENSCAN_predicted_peptide_5|180_aa MEWAFSRLDEKIPRNVGNCRKCLVVNATCHADSLGLKARQFPLYFVTAPPWGTRPIPPTS SCITDHDSVATMILRAISSLQLRAFFVKELPMIKIMQQMSDHRYDKLTVPDDIAANCIYL NIPNKGHVLLHRTPEEYPESAKVYEKLKDHMLIPVSMSELEKVDGLLTCCSVLINKKVDS >gi568815597r:85167630_85376352|GENSCAN_predicted_CDS_5|543_bp atggagtgggctttcagtcgtttggatgagaaaattcccagaaatgttgggaactgccgg aaatgcctggttgttaatgcaacttgccatgcagattctcttgggctgaaagccagacag ttcccactctattttgtgacagctccaccctggggaacaaggccaattcctccaaccagc agctgcataacagaccatgacagtgtggccacaatgattctcagagctatttcctcccta cagctgcgtgccttcttcgttaaggagttgcctatgataaaaatcatgcaacagatgagt gaccaccgctacgacaaactcactgtgcctgatgacatagcagcaaactgtatatatcta aatatccccaacaaagggcacgtcttgctgcaccgaaccccggaagagtatccagaaagt gcaaaggtttatgagaaactgaaggaccatatgctgatccccgtgagcatgtctgaactg gaaaaggtggatgggctgctcacctgctgctcagttttaattaacaagaaagtagactcc tga >gi568815597r:85167630_85376352|GENSCAN_predicted_peptide_6|200_aa MDVFSPVPGTTSIQNNVQGKKRNYVSFYGSRNLSTDTPSVANIGSLAPDPGIRDRTAMPD LDWEVDMMKEALEKLQLNIVEMKDENATLDGGDVLFTGREFFVGLSKRTNQRGAEILADT FKDYAVSTVPVADGLHLKSFCSMAGPNLIAIGSSESAQKALKFKILRPDEIIILSGILQF QKQVSWALIPLAPVAQPTPY >gi568815597r:85167630_85376352|GENSCAN_predicted_CDS_6|603_bp atggatgtgttctcaccagttccaggtacaacatctatacagaacaatgtccaagggaag aagaggaattatgtttctttctatggctccaggaacctctcaacagacaccccctcagtg gccaatattgggtcacttgcaccagaccctggcataagagacagaactgccatgcctgat ttagactgggaggttgacatgatgaaagaagcattagaaaaacttcagctcaatatagta gagatgaaagatgaaaatgcaactttagatggcggagatgttttattcacaggcagagaa ttttttgtgggcctttccaaaaggacaaatcaacgaggtgctgaaatcttggctgatact tttaaggactatgcagtctccacagtgccagtggcagatgggttgcatttgaagagtttc tgcagcatggctgggcctaacctgatcgcaattgggtctagtgaatctgcacagaaggcc cttaagttcaaaatactgagacctgatgaaataatcattttgagtgggatcttacagttc cagaaacaagtttcctgggccctgattcccctggccccagttgcccaacctactccatac tga