GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:54:07 Sequence gi568815591r:80644897_81016781 : 371885 bp : 36.51% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1845 1964 120 2 0 73 82 218 0.987 19.99 1.02 Intr + 11644 11804 161 0 2 82 109 121 0.905 11.46 1.03 Intr + 16167 16314 148 0 1 75 73 156 0.838 12.12 1.04 Intr + 19510 19601 92 0 2 26 115 64 0.106 0.77 1.05 Intr + 26081 26268 188 2 2 92 113 124 0.998 13.71 1.06 Intr + 27026 27144 119 0 2 74 58 94 0.987 4.26 1.07 Intr + 27874 27947 74 0 2 88 70 62 0.334 1.69 1.08 Intr + 30924 31051 128 0 2 25 70 97 0.300 1.00 1.09 Intr + 32572 32731 160 2 1 32 55 108 0.069 0.02 1.10 Intr + 41052 41170 119 0 2 93 49 71 0.001 2.89 1.11 Intr + 43797 43913 117 1 0 14 98 73 0.009 0.42 1.12 Intr + 47428 47659 232 2 1 37 72 178 0.055 7.01 1.13 Term + 49930 50104 175 1 1 74 39 103 0.405 0.25 1.14 PlyA + 50237 50242 6 1.05 2.00 Prom + 58356 58395 40 -3.95 2.01 Init + 63133 63210 78 0 0 73 116 67 0.936 9.11 2.02 Term + 69285 70133 849 2 0 -17 43 577 0.627 34.56 2.03 PlyA + 70361 70366 6 1.05 3.00 Prom + 70530 70569 40 -6.15 3.01 Sngl + 71481 72452 972 2 0 70 35 364 0.444 25.78 3.02 PlyA + 72491 72496 6 1.05 4.16 PlyA - 73796 73791 6 1.05 4.15 Term - 100411 99998 414 1 0 113 34 361 0.986 27.48 4.14 Intr - 104132 104002 131 0 2 76 105 127 0.996 12.69 4.13 Intr - 106440 106364 77 2 2 111 61 42 0.418 2.04 4.12 Intr - 113592 113435 158 0 2 55 101 39 0.008 -0.21 4.11 Intr - 120347 120259 89 0 2 79 42 59 0.007 -0.83 4.10 Intr - 144632 144410 223 2 1 61 99 276 0.973 22.78 4.09 Intr - 150061 149958 104 2 2 92 94 30 0.447 2.97 4.08 Intr - 157883 157769 115 2 1 17 56 122 0.749 1.00 4.07 Intr - 159352 159210 143 2 2 83 95 118 0.940 11.15 4.06 Intr - 160862 160743 120 0 0 89 83 74 0.942 6.55 4.05 Intr - 165805 165715 91 1 1 5 89 102 0.022 0.65 4.04 Intr - 173522 173403 120 2 0 64 107 57 0.037 4.97 4.03 Intr - 182591 182529 63 2 0 63 98 69 0.904 3.40 4.02 Intr - 183849 183689 161 1 2 80 84 121 0.940 9.69 4.01 Init - 199216 199165 52 1 1 70 75 34 0.149 1.67 4.00 Prom - 201893 201854 40 -9.05 5.00 Prom + 204295 204334 40 -5.25 5.01 Init + 213252 213296 45 2 0 91 109 14 0.787 4.53 5.02 Intr + 214472 214574 103 1 1 62 58 50 0.421 -1.77 5.03 Term + 216965 217446 482 0 2 48 37 234 0.976 8.27 5.04 PlyA + 217459 217464 6 1.05 6.00 Prom + 219159 219198 40 -6.05 6.01 Init + 236755 236813 59 0 2 76 72 55 0.139 3.53 6.02 Intr + 281744 281925 182 2 2 67 70 131 0.232 7.79 6.03 Term + 305725 305885 161 2 2 37 44 130 0.048 0.62 6.04 PlyA + 308159 308164 6 1.05 7.00 Prom + 309994 310033 40 -5.45 7.01 Init + 329511 329725 215 2 2 60 89 99 0.507 5.46 7.02 Intr + 333532 333580 49 1 1 31 110 44 0.058 -1.34 7.03 Intr + 351025 351214 190 0 1 64 75 133 0.145 7.84 7.04 Intr + 354868 355002 135 2 0 65 111 17 0.589 1.32 7.05 Intr + 360842 360929 88 0 1 87 89 37 0.972 1.81 7.06 Term + 361786 361897 112 1 1 57 41 126 0.959 1.85 7.07 PlyA + 362282 362287 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 17644 17432 213 1 0 87 38 302 0.857 20.03 S.002 Init - 165793 165715 79 1 1 22 89 103 0.922 5.07 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:80644897_81016781|GENSCAN_predicted_peptide_1|610_aa MGCDRNCGLIAGAVIGAVLAVFGGILMPVGDLLIQKTIKKQVVLEEGTIAFKNWVKTGTE VYRQFWIFDVQNPQEVMMNSSNIQVKQRGPYTYRVRFLAKENVTQDAEDNTVSFLQPNGA IFEPSLSVGTEADNFTVLNLAVAYNNTADGVYKVFNGKDNISKVAIIDTYKGKRSIYAVF ESDVNLKGIPVYRFVLPSKAFASPVENPDNYCFCTEKIISKNCTSYGVLDISKCKEGRPV YISLPHFLYASPDVSEPIDGLNPNEEEHRTYLDIEPITGFTLQFAKRLQVNLLVKPSEKI QGSQENNPMVLLKVVHTPLNGTWSEGCGNAGPGQMHFSYRSLPAARVNVEHYYRKATNQE STCLERSFASLKVLKAYLVPFLLTGKIKEVSVQDINMDMDETGNHYSEQTTARTENQTLH VLTHRWELNNENTWTEVNYKENSKSSNRKVSNRALWQDGQVGTALSAALSKINTEEIQTT IREYYKHLYANKLENLEEMDKFLNTYTLRSLNQEEIESLNRTITRSEIEAIINSLPTKKS PEPDGFTAKFYQRDMDEAGNHHSRQTNTRTENQTPHVLTHKWELNNENNGLRVGNITHQD LSGDGGVGDG >gi568815591r:80644897_81016781|GENSCAN_predicted_CDS_1|1833_bp atgggctgtgaccggaactgtgggctcatcgctggggctgtcattggtgctgtcctggct gtgtttggaggtattctaatgccagttggagacctgcttatccagaagacaattaaaaag caagttgtcctcgaagaaggtacaattgcttttaaaaattgggttaaaacaggcacagaa gtttacagacagttttggatctttgatgtgcaaaatccacaggaagtgatgatgaacagc agcaacattcaagttaagcaaagaggtccttatacgtacagagttcgttttctagccaag gaaaatgtaacccaggacgctgaggacaacacagtctctttcctgcagcccaatggtgcc atcttcgaaccttcactatcagttggaacagaggctgacaacttcacagttctcaatctg gctgtggcatacaacaatactgcagatggagtttataaagttttcaatggaaaagataac ataagtaaagttgccataatcgacacatataaaggtaaaaggtcaatctatgctgtattt gaatccgacgttaatctgaaaggaatccctgtgtatagatttgttcttccatccaaggcc tttgcctctccagttgaaaacccagacaactattgtttctgcacagaaaaaattatctca aaaaattgtacatcatatggtgtgctagacatcagcaaatgcaaagaagggagacctgtg tacatttcacttcctcattttctgtatgcaagtcctgatgtttcagaacctattgatgga ttaaacccaaatgaagaagaacataggacatacttggatattgaacctataactggattc actttacaatttgcaaaacggctgcaggtcaacctattggtcaagccatcagaaaaaatt caggggtcacaagaaaataacccaatggtccttttgaaagtagtacatacacctttaaat ggaacttggtctgaagggtgtggaaatgctggtccagggcagatgcacttcagctaccgt tccttgcctgccgccagagtaaatgttgagcattactacagaaaagccacaaaccaagaa tctacctgtttggaaagatcttttgcatctctgaaggtgcttaaagcatacttagtgcct ttccttttaactgggaagataaaagaagtatctgtccaagatattaatatggacatggat gaaactggaaaccattattctgagcaaactaccgcaaggacagaaaaccagacactgcat gttctcactcataggtgggaactgaataatgagaacacttggacagaggtgaattacaaa gagaattctaaaagcagcaatagaaaagtatcaaatcgggcactttggcaagatggccaa gtaggaacagctctgtctgcggctctcagcaagatcaacacagaagaaatacaaactacc atcagagaatactataaacacctctatgcaaataaactagaaaatctagaagaaatggat aaattcctgaacacatacaccctccgaagtctaaaccaggaagaaattgaatctctaaat agaacaataacacgttcggaaatcgaggcaataattaatagcctaccaaccaaaaaaagt ccagaaccagatggattcacagccaaattctaccagagggacatggatgaagctggaaac catcattctcggcaaactaacacaagaacagaaaaccaaacaccacatgttctcactcac aagtgggagttgaacaatgagaacaatggactcagggtgggaaacatcacacaccaggac ctgtcaggggatggtggggtaggggatggatag >gi568815591r:80644897_81016781|GENSCAN_predicted_peptide_2|308_aa MGLEEAAFPKDLERMPGRGRGEQSQKEDIQTKGKEVENFEKNLEECITRITNTEKCLKEL MELKTKARELCEECRSLRSRCDQLEERVSAMEDEMNERKREGKFREKRIKRNEQSLQEIW DYVKRPNLCLIGVPESDGENGTKLENTLQDIIQENFPNLARQANVQIQEIQRTPQRYSSR SATPRHIIVRFTEVEMKEKMLRAAREKGRVTLKGKPIRLTADLSAETLQARRPWGPIFNI LKEKNFQPRISYPAKLSFISEGEIKYFTDKQMLRDFVTTRPALKELLKEALNMERNNRYQ LLQNHAKM >gi568815591r:80644897_81016781|GENSCAN_predicted_CDS_2|927_bp atggggctggaagaagcagcatttccaaaggacttagaaagaatgccaggaagaggaaga ggagaacagtcacaaaaggaggacattcaaaccaaaggcaaagaagttgaaaacttcgaa aaaaacttagaagaatgtataactagaataaccaatacagagaagtgcttaaaggagctg atggagctgaaaaccaaggctcgagaactatgtgaagaatgcagaagcctcaggagccga tgcgatcaactggaagaaagggtatcagcaatggaagatgaaatgaatgaaaggaagcga gaagggaagtttagagaaaaaagaataaaaagaaatgagcaaagcctccaagaaatatgg gactatgtgaaaagaccaaatctatgtctgatcggtgtacctgaaagtgacggggagaat ggaaccaagttggaaaacactctgcaggatattatccaggagaacttccccaatctagca aggcaggccaacgttcagattcaggaaatacagagaacgccacaaagatactcctcgaga agtgcaactccaagacacataattgtcagattcaccgaagttgaaatgaaggaaaaaatg ttaagggcagccagagagaaaggtcgggttaccctcaaagggaagcccatcagactaaca gcagatctctcagcagaaaccctacaagccagaagaccgtgggggccaatattcaacatt cttaaagaaaagaattttcaacccagaatttcatatccggccaaactaagcttcataagt gaaggagaaataaaatactttacagacaagcaaatgctaagagattttgtcaccaccagg cctgccctaaaagagctcctgaaggaagcgctaaacatggaaaggaacaaccggtaccag ctgctgcaaaatcatgccaaaatgtaa >gi568815591r:80644897_81016781|GENSCAN_predicted_peptide_3|323_aa MDKFLGTYTLPRLNQEEVESLNRPITGSEIVAIIKSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLIHHDQVGFIPGMQGLFNIHKSINVIQCINRAKDKNHMIISIDAEKAFDKI QQRFMLKTLNKLGIDGTYFKIIRVIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLIF NIVLEVLSRAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPTVSAQNLLKLISNFSKV SGYKINVHKSQAFLYTNNRQTES >gi568815591r:80644897_81016781|GENSCAN_predicted_CDS_3|972_bp atggataaattcctcggcacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggatctgaaattgtggcaataatcaaaagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactct ttttatgaggccagcatcattctgataccaaagcctggcagagacacaaccaaaaaagag aattttagaccaatatctttgatgaacattgatgcaaaaatcctcaataaaatactggca aacagaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggcttgttcaatatacacaaatcaataaatgtaatccagtgtataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaacgcttcatgctaaagactctcaataaattaggtattgatgggacatatttcaaa ataataagagttatttatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcatattc aacatagttttggaagttctgtccagggcaatcaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctctttgcagacgacatgattgtatatcta gaaaaccccactgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacacaaatcacaagcattcttatacaccaataacagacaa acagagagctaa >gi568815591r:80644897_81016781|GENSCAN_predicted_peptide_4|686_aa MGTGECGHWHLNSKCGKELRETKTSEYFSLSHHPLDYRILLMDEDQDRIYVGSKDHILSL NINNISQEALSVFWPASTIKVEECKMAGKDPTHGCGNFVRVIQTFNRTHLYVCGSGAFSP VCTYLNRGRRSEDQVFMIDSKCESGKGRCSFNPNVNTVSVMINEELFSGMYIDFMGTDAA IFRSLTKRNAVRTDQHNSKWLSEPMFVDAHVIPDGTDPNDAKVYFFFKEKLTDNNRSTKQ IHSMIARICPNDTGGLRSLVNKWTTFLKARLVCSVTDEDGPETHFDELGASFKTRLVALL ANDPTSKPKSVGRSLPIVSLYFSCPGGAFTPNMRTTKEFPDDVVTFIRNHPLMYNSIYPI HKRPLIVRIGTDYKYTKIAVDRVNAADGRYHVLFLGTDRGTVQKVVVLPTNNSVSGELIL EELEVFKQQLYVSSNEGVSQVSLHRCHIYGTACADCCLARDPYCAWDGHSCSRFYPTGKR RSRRQDVRHGNPLTQCRGFNLKGIKTYRNAAEIVQYGVKNNTTFLECAPKSPQASIKWLL QKDKDRRKEVKLNERIIATSQGLLIRSVQGSDQGLYHCIATENSFKQTIAKINFKVLDSE MVAVVTDKWSPWTWASSVRALPFHPKDIMGAFSHSEMQMINQYCKDTRQQHQQGDESQKM RGDYGKLKALINSRKSRNRRNQLPES >gi568815591r:80644897_81016781|GENSCAN_predicted_CDS_4|2061_bp atggggacaggagagtgtggacactggcacttaaattcaaaatgtgggaaagaacttcga gaaaccaagacctctgaatacttcagcctttcccaccatcctttagactacaggatttta ttaatggatgaagatcaggaccggatatatgtgggaagcaaagatcacattctttccctg aatattaacaatataagtcaagaagctttgagtgttttctggccagcatctacaatcaaa gttgaagaatgcaaaatggctggcaaagatcccacacacggctgtgggaactttgtccgt gtaattcagactttcaatcgcacacatttgtatgtctgtgggagtggcgctttcagtcct gtctgtacttacttgaacagagggaggagatcagaggaccaagttttcatgattgactcc aagtgtgaatctggaaaaggacgctgctctttcaaccccaacgtgaacacggtgtctgtt atgatcaatgaggagcttttctctggaatgtatatagatttcatggggacagatgctgct atttttcgaagtttaaccaagaggaatgcggtcagaactgatcaacataattccaaatgg ctaagtgaacctatgtttgtagatgcacatgtcatcccagatggtactgatccaaatgat gctaaggtgtacttcttcttcaaagaaaaactgactgacaataacaggagcacgaaacag attcattccatgattgctcgaatatgtcctaatgacactggtggactgcgtagccttgtc aacaagtggaccactttcttaaaggcgaggctggtgtgctcggtaacagatgaagacggc ccagaaacacactttgatgaattaggtgcttcttttaaaacacggttggtagctctgtta gcaaacgaccccaccagcaagcccaagtcagtaggaagaagcctgccaattgtgtcatta tatttttcctgtccaggaggagcatttacacccaatatgcgaaccaccaaggagttccca gatgatgttgtcacttttattcggaaccatcctctcatgtacaattccatctacccaatc cacaaaaggcctttgattgttcgtattggcactgactacaagtatacaaagatagctgtg gatcgagtgaacgctgctgatgggagataccatgtcctgtttctcggaacagatcggggt actgtgcaaaaagtggttgttcttcctactaacaactctgtcagtggcgagctcattctg gaggagctggaagtctttaagcaacagttgtatgtgagttccaatgaaggggtttcccag gtatctctgcaccgctgccacatctatggtacagcctgtgctgactgctgcctggcgcgg gacccttattgcgcctgggatggccattcctgttccagattctacccaactgggaaacgg aggagccgaagacaagatgtgagacatggaaacccactgactcaatgcagaggatttaat ctaaaaggtattaaaacatacagaaatgcagctgaaattgtccagtatggagtaaaaaat aacaccacttttctggagtgtgcccccaagtctccgcaggcatctatcaagtggctgtta cagaaagacaaagacaggaggaaagaggttaagctgaatgaacgaataatagccacttca cagggactcctgatccgctctgttcagggttctgaccaaggactttatcactgcattgct acagaaaatagtttcaagcagaccatagccaagatcaacttcaaagttttagattcagaa atggtggctgttgtgacggacaaatggtccccatggacctgggccagctctgtgagggct ttacccttccacccgaaggacatcatgggggcattcagccactcagaaatgcagatgatt aaccaatattgcaaagacactcggcagcaacatcagcagggagatgaatcacagaaaatg agaggggactatggcaagttaaaggccctcatcaatagtcggaaaagtagaaacaggagg aatcagttgccagagtcataa >gi568815591r:80644897_81016781|GENSCAN_predicted_peptide_5|209_aa MAYTPSYLYKEEGFQLVVVLKKYPMSSLERIPVLLYHFQVHPKDFYQVSDAEKAFDKIQH PFMIKTLSKIDIQGTYLNIIKVIYDKPTANITLNGEKLKPCHLRNGKRQECTLLPLLFNI VLEVLARAIRQEKEIKGIQIGKEEVILSLFADDRTIYLENPKDSRKLLELIKEFSKVSRY KTKIHKSVVACVPTATMQRIKPRTQPLFQ >gi568815591r:80644897_81016781|GENSCAN_predicted_CDS_5|630_bp atggcttatactcctagttatttgtataaggaggaaggctttcagcttgtggttgtattg aaaaagtacccgatgagctccctagagaggatacctgttctgctctatcattttcaagtt catcctaaagatttctaccaagtgtcagatgcagaaaaagcatttgacaaaatccagcat ccctttatgattaaaactctcagcaaaatcgacatacaaggtacctacctcaatataata aaagtcatttatgacaaacccacagccaacataacactgaatggggaaaagctgaaacca tgccatctaagaaatggaaagagacaagaatgcacactcttaccactcctcttcaacata gtactggaagtcctagccagagcaatcagacaagagaaagaaataaagggcatccaaata ggtaaagaggaagtcatactgtcactgtttgctgatgataggaccatttaccttgaaaac cctaaagactccagaaagctcctagaactgataaaagaattcagcaaagtttccagatac aagactaagatacacaagtcagtagtagcttgtgtaccaacagcgaccatgcagagaatc aaaccaagaactcaaccactttttcaatag >gi568815591r:80644897_81016781|GENSCAN_predicted_peptide_6|133_aa MGTINCAYNWLHSSSSNELNGFSVSLTSRMKLRTLTMSVTVLKDGVPGVCSFRCSDVSRV SSFWWVCGLADFRSEAADLLRMKKQERAECIWMCLVLACSPFLKSSITTCGFKLDDISGP KMIDDHNFVAAVV >gi568815591r:80644897_81016781|GENSCAN_predicted_CDS_6|402_bp atgggaacaattaactgtgcttacaattggcttcatagcagcagcagcaatgagttgaat gggttctcggtctcactgacttcaagaatgaagctgcggaccctcacaatgagtgttaca gttcttaaagatggtgtgcccggagtttgttccttcagatgttcagatgtgtccagagtt tcttccttctggtgggtttgtggtcttgctgactttaggagtgaagctgcagaccttctc agaatgaagaaacaagaacgagctgaatgtatctggatgtgtctggtcctggcttgcagc ccttttcttaaatccagtatcactacttgtggttttaaacttgatgacatctctgggcct aaaatgatagatgatcacaactttgttgctgctgtggtttag >gi568815591r:80644897_81016781|GENSCAN_predicted_peptide_7|262_aa MMCPEFVPSDVQMCPEFLASGGFVVSLTSGMKLQTLTVGVTALIRSASGVVRSFQWVCGL NWPQEQSCRPSCNLRKECERWRLGDGNRQRKSDIPSGMALMSIQFIWILFCEKNGKETIN PISRFPIKHDSSEKQEEQTFEGNGNKEEDGPEHCMSCISQYSLEGLPYCTHPVGTIGMQF SDTEQQNPHSATSLKLDRFYFCGVQIAWRKKGSSKRKLNQRGGSEWRKKVEVDIRDLPIT NALLVMTSEKHSICGSAGGLGQ >gi568815591r:80644897_81016781|GENSCAN_predicted_CDS_7|789_bp atgatgtgtccagagtttgttccttcagatgttcagatgtgtccagagtttcttgcttct ggtgggtttgtggtctcgctgacttcaggaatgaagctgcagaccctcactgtgggtgtt acagctcttatacgcagtgcttctggagttgttcgctccttccagtgggtttgtggtctt aactggcctcaagagcaaagctgcagaccttcgtgtaacctgcgcaaggaatgcgagcgc tggagacttggggatggcaataggcaaagaaagtcagacatacccagtgggatggcttta atgagtatccaattcatttggatcctcttctgtgagaaaaatgggaaagaaacaataaac cctataagcagatttcccatcaagcatgattcatcagaaaaacaagaggaacaaaccttt gaaggtaatggcaacaaggaggaggatggcccagagcattgtatgtcttgtatatcacag tattccctagaagggctaccatattgcacacatccagtgggcactattggcatgcaattc tctgacactgagcagcaaaatccccattccgcaacatcattaaaacttgaccggttttat ttctgtggagtccagattgcctggagaaagaaaggttcttccaagagaaaacttaaccaa agaggaggatcagaatggagaaagaaggtagaagttgatatccgagatttaccaataacc aatgctctcctggtaatgacctcagagaaacacagtatctgtggatctgcaggaggcctg ggccaatga