GENSCAN 1.0 Date run: 6-Nov-116 Time: 20:39:32 Sequence gi568815589f:2522190_2752949 : 230760 bp : 40.46% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1594 1753 160 0 1 98 55 84 0.472 6.14 1.02 Intr + 7668 7825 158 1 2 46 100 64 0.181 2.21 1.03 Intr + 27236 27461 226 1 1 48 77 261 0.009 17.64 1.04 Term + 74606 74814 209 0 2 90 49 121 0.315 4.92 1.05 PlyA + 75089 75094 6 1.05 2.00 Prom + 85187 85226 40 -4.95 2.01 Init + 100001 100082 82 1 1 112 117 243 0.802 28.78 2.02 Intr + 100347 100391 45 1 0 88 92 28 0.463 0.66 2.03 Intr + 113264 113383 120 0 0 98 103 180 0.962 20.05 2.04 Intr + 117670 117792 123 2 0 36 78 204 0.998 13.84 2.05 Intr + 119188 119310 123 2 0 49 83 99 0.947 5.14 2.06 Intr + 120971 121342 372 0 0 68 91 423 0.999 34.71 2.07 Intr + 121439 121561 123 0 0 83 91 194 0.999 18.84 2.08 Intr + 121648 121770 123 2 0 94 91 82 0.989 8.74 2.09 Intr + 122545 122664 120 2 0 74 105 85 0.738 8.35 2.10 Intr + 122768 122893 126 0 0 59 67 170 0.995 11.73 2.11 Intr + 123385 123556 172 2 1 56 93 124 0.900 7.78 2.12 Intr + 124145 124363 219 2 0 103 5 254 0.605 15.10 2.13 Intr + 125285 125403 119 2 2 74 76 114 0.632 7.99 2.14 Intr + 126019 126158 140 2 2 95 94 40 0.998 4.56 2.15 Intr + 126480 126621 142 2 1 64 86 140 0.997 10.41 2.16 Intr + 128181 128327 147 1 0 97 78 87 0.966 7.89 2.17 Intr + 129226 129309 84 2 0 73 57 82 0.846 2.47 2.18 Intr + 129685 129765 81 2 0 122 84 57 0.994 7.39 2.19 Intr + 130591 130760 170 2 2 81 92 161 0.973 14.54 2.20 Term + 131644 131679 36 0 0 104 38 59 0.823 -1.14 2.21 PlyA + 133890 133895 6 1.05 3.04 PlyA - 135073 135068 6 1.05 3.03 Term - 135557 135413 145 1 1 62 42 145 0.298 3.70 3.02 Intr - 142886 142780 107 2 2 60 114 101 0.608 7.99 3.01 Init - 145838 145749 90 2 0 34 116 24 0.218 0.34 3.00 Prom - 148894 148855 40 -6.25 4.00 Prom + 149267 149306 40 -2.35 4.01 Init + 149829 149924 96 2 0 51 39 114 0.334 3.16 4.02 Term + 187592 187717 126 1 0 80 33 126 0.470 3.60 4.03 PlyA + 188670 188675 6 1.05 5.00 Prom + 190695 190734 40 -2.55 5.01 Init + 195551 197099 1549 1 1 91 97 1964 0.463 190.22 5.02 Intr + 200676 200833 158 1 2 61 71 67 0.018 1.11 5.03 Term + 207257 207538 282 1 0 122 35 203 0.796 12.84 5.04 PlyA + 207828 207833 6 1.05 6.03 PlyA - 210153 210148 6 1.05 6.02 Term - 212647 212476 172 0 1 46 49 136 0.866 1.82 6.01 Init - 213285 213200 86 0 2 75 65 101 0.879 6.84 6.00 Prom - 216759 216720 40 -7.05 7.00 Prom + 218656 218695 40 -2.65 7.01 Init + 220085 220466 382 1 1 77 3 205 0.444 7.98 7.02 Intr + 223451 223573 123 0 0 79 32 84 0.489 1.54 7.03 Intr + 224170 224329 160 2 1 68 71 88 0.770 3.22 7.04 Term + 224537 224777 241 2 1 17 49 185 0.448 2.01 7.05 PlyA + 224994 224999 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 204743 204940 198 1 0 75 59 132 0.943 7.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:2522190_2752949|GENSCAN_predicted_peptide_1|250_aa MAIMRLSRSVRDAGFSQDRGTRPTCASPSATSIKPRQIPHNKTEDKTQMKISGVQVNDRE ALNTFYVPGTDVSTSLSITSFKLHRIPILQIKAPRPKEVKHLFKVPVWKCDPFTSIGMWA EGKKLLLDLAHKTYQLKKEDHGHLVEGRAMEWKKPESVKLCQDRNCPTGNTYTGPFRHKK IAPILRSCFCFQTIVCLIPTNSKTAKGLAGNTPSGDAPRVYSHCGARGEGKVEKSRKKNL SAALQDQRLL >gi568815589f:2522190_2752949|GENSCAN_predicted_CDS_1|753_bp atggccataatgagactcagccgcagcgtaagagatgctggcttctcacaggaccgtggg accaggcccacctgtgcctctccctcagccacaagcattaagcccagacaaattcctcac aacaaaactgaagacaaaacgcaaatgaaaatatcaggagtacaagtcaatgaccgtgaa gcactgaatactttctatgtgcctggcactgatgtgagcacttcgttatctattacctca tttaaactacacagaatacccattttacagataaaggcaccaaggcccaaagaggttaaa cacctttttaaggtgccagtgtggaaatgtgatccgtttacatccataggaatgtgggca gaaggcaagaaactacttctcgacctggcccataaaacctaccagctgaagaaagaagac catgggcacctggtggaaggtagagccatggaatggaagaagcccgagtccgtgaaactc tgccaagatagaaactgcccgactgggaacacctacactggaccgttcagacacaagaaa atagcaccaatcctgagatcttgtttttgctttcaaacaatcgtctgcctaattcccact aactctaaaacagcaaaggggctggctggcaacaccccaagtggtgatgcccctcgggtc tattcacattgtggagcaagaggagaggggaaggtggagaagtccagaaaaaagaacctg tctgcggccttgcaggatcagcggcttctgtga >gi568815589f:2522190_2752949|GENSCAN_predicted_peptide_2|888_aa MGTSALWALWLLLALCWAPRESGATGTEPLLRLVSRPLDGRRGRKAKCEPSQFQCTNGRC ITLLWKCDGDEDCVDGSDEKNCVKKTCAESDFVCNNGQCVPSRWKCDGDPDCEDGSDESP EQCHMRTCRIHEISCGAHSTQCIPVSWRCDGENDCDSGEDEENCGNITCSPDEFTCSSGR CISRNFVCNGQDDCSDGSDELDCAPPTCGAHEFQCSTSSCIPISWVCDDDADCSDQSDES LEQCGRQPVIHTKCPASEIQCGSGECIHKKWRCDGDPDCKDGSDEVNCPSRTCRPDQFEC EDGSCIHGSRQCNGIRDCVDGSDEVNCKNVNQCLGPGKFKCRSGECIDISKVCNQEQDCR DWSDEPLKECHINECLVNNGGCSHICKDLVIGYECDCAAGFELIDRKTCGDIDECQNPGI CSQICINLKGGYKCECSRGYQMDLATGVCKAVGKEPSLIFTNRRDIRKIGLERKEYIQLV EQLRNTVALDADIAAQKLFWADLSQKAIFSASIDDKVGRHVKMIDNVYNPAAIAVDWVYK TIYWTDAASKTISVATLDGTKRKFLFNSDLREPASIAVDPLSGFVYWSDWGEPAKIEKAG MNGFDRRPLVTADIQWPNGITLDLIKSRLYWLDSKLHMLSSVDLNGQDRRIVLKSLEFLA HPLALTIFEDRVYWIDGENEAVYGANKFTGSELATLVNNLNDAQDIIVYHELVQPSGKNW CEEDMENGGCEYLCLPAPQINDHSPKYTCSCPSGYNVEENGRDCQSTATTVTYSETKDTN TTEISATSGLVPGGINVTTAVSEVSVPPKGTSAAWAILPLLLLVMAAVGGYLMWRNWQHK NMKSMNFDNPVYLKTTEEDLSIDIGRHSASVGHTYPAISVVSTDDDLA >gi568815589f:2522190_2752949|GENSCAN_predicted_CDS_2|2667_bp atgggcacgtccgcgctctgggcgctctggctgctgctcgcgctgtgctgggcgccccgg gagagcggcgccaccggaaccgagccgctgcttcgcctcgtttctcgccctcttgacggg cgccgagggagaaaagccaaatgtgaaccctcccaattccagtgcacaaatggtcgctgt attacgctgttgtggaaatgtgatggggatgaagactgtgttgacggcagtgatgaaaag aactgtgtaaagaagacgtgtgctgaatctgacttcgtgtgcaacaatggccagtgtgtt cccagccgatggaagtgtgatggagatcctgactgcgaagatggttcagatgaaagccca gaacagtgccatatgagaacatgccgcatacatgaaatcagctgtggcgcccattctact cagtgtatcccagtgtcctggagatgtgatggtgaaaatgattgtgacagtggagaagat gaagaaaactgtggcaatataacatgtagtcccgacgagttcacctgctccagtggccgc tgcatctccaggaactttgtatgcaatggccaggatgactgcagcgatggcagtgatgag ctggactgtgccccgccaacctgtggcgcccatgagttccagtgcagcacctcctcctgc atccccatcagctgggtatgcgacgatgatgcagactgctccgaccaatctgatgagtcc ctggagcagtgtggccgtcagccagtcatacacaccaagtgtccagccagcgaaatccag tgcggctctggcgagtgcatccataagaagtggcgatgtgatggggaccctgactgcaag gatggcagtgatgaggtcaactgtccctctcgaacttgccgacctgaccaatttgaatgt gaggatggcagctgcatccatggcagcaggcagtgtaatggtatccgagactgtgtcgat ggttccgatgaagtcaactgcaaaaatgtcaatcagtgcttgggccctggaaaattcaag tgcagaagtggagaatgcatagatatcagcaaagtatgtaaccaggagcaggactgcagg gactggagtgatgagcccctgaaagagtgtcatataaacgaatgcttggtaaataatggt ggatgttctcatatctgcaaagacctagttataggctacgagtgtgactgtgcagctggg tttgaactgatagataggaaaacctgtggagatattgatgaatgccaaaatccaggaatc tgcagtcaaatttgtatcaacttaaaaggcggttacaagtgtgaatgtagtcgtggctat caaatggatcttgctactggcgtgtgcaaggcagtaggcaaagagccaagtctgatcttc actaatcgaagagacatcaggaagattggcttagagaggaaagaatatatccaactagtt gaacagctaagaaacactgtggctctcgatgctgacattgctgcccagaaactattctgg gccgatctaagccaaaaggctatcttcagtgcctcaattgatgacaaggttggtagacat gttaaaatgatcgacaatgtctataatcctgcagccattgctgttgattgggtgtacaag accatctactggactgatgcggcttctaagactatttcagtagctaccctagatggaacc aagaggaagttcctgtttaactctgacttgcgagagcctgcctccatagctgtggaccca ctgtctggctttgtttactggtcagactggggtgaaccagctaaaatagaaaaagcagga atgaatggattcgatagacgtccactggtgacagcggatatccagtggcctaacggaatt acacttgaccttataaaaagtcgcctctattggcttgattctaagttgcacatgttatcc agcgtggacttgaatggccaagatcgtaggatagtactaaagtctctggagttcctagct catcctcttgcactaacaatatttgaggatcgtgtctactggatagatggggaaaatgaa gcagtctatggtgccaataaattcactggatcagagctagccactctagtcaacaacctg aatgatgcccaagacatcattgtctatcatgaacttgtacagccatcaggtaaaaattgg tgtgaagaagacatggagaatggaggatgtgaatacctatgcctgccagcaccacagatt aatgatcactctccaaaatatacctgttcctgtcccagtgggtacaatgtagaggaaaat ggccgagactgtcaaagtactgcaactactgtgacttacagtgagacaaaagatacgaac acaacagaaatttcagcaactagtggactagttcctggagggatcaatgtgaccacagca gtatcagaggtcagtgttcccccaaaagggacttctgccgcatgggccattcttcctctc ttgctcttagtgatggcagcagtaggtggctacttgatgtggcggaattggcaacacaag aacatgaaaagcatgaactttgacaatcctgtgtacttgaaaaccactgaagaggacctc tccatagacattggtagacacagtgcttctgttggacacacgtacccagcaatatcagtt gtaagcacagatgatgatctagcttga >gi568815589f:2522190_2752949|GENSCAN_predicted_peptide_3|113_aa MKAYPRLGRKGGLMDLQFHVAGEASQSYQKNYMLDIQETHDPNWDNRNGFLEEETSEPRP GRIRAGCLTDDSRQCTSTRAKCAHLASSRQQNGQQQQLALTQLSLLLPLPSPH >gi568815589f:2522190_2752949|GENSCAN_predicted_CDS_3|342_bp atgaaggcatacccgagactgggaagaaaaggaggtttaatggacttacagttccacgtg gctggggaggcctcacaatcatatcagaagaattatatgctggatatacaagagactcat gacccaaactgggacaacaggaatggcttcctggaagaagagacatctgagcccagacct ggaagaataagagccggatgcttaacagacgacagcagacaatgcacctcaaccagggct aagtgtgcacacttggccagcagccgccaacagaacgggcagcagcagcagttagcactg acgcaactatcgctgctattgccactaccatctccccactaa >gi568815589f:2522190_2752949|GENSCAN_predicted_peptide_4|73_aa MHTYVHCSTIHNSKDVESTQVATKSGLDKENAVFTDASFEISKSPQEHHLAPVTPTPGSP RHCSNASVRADVP >gi568815589f:2522190_2752949|GENSCAN_predicted_CDS_4|222_bp atgcacacctatgttcactgcagcactattcacaatagcaaagacgtggaatcaacccag gtggccaccaaaagtggactggataaagaaaatgcagttttcacagatgcctcttttgaa atcagcaaatctcctcaggagcatcatctggctcctgtcacccctactccagggagccct cgtcactgcagtaatgccagtgtcagagcagatgttccataa >gi568815589f:2522190_2752949|GENSCAN_predicted_peptide_5|662_aa MLKQSERRRSWSYRPWNTTENEGSQHRRSICSLGARSGSQASIHGWTEGNYNYYIEEDED GEEEDQWKDDLAEEDQQAGEVTTAKPEGPSDPPALLSTLNVNVGGHSYQLDYCELAGFPK TRLGRLATSTSRSRQLSLCDDYEEQTDEYFFDRDPAVFQLVYNFYLSGVLLVLDGLCPRR FLEELGYWGVRLKYTPRCCRICFEERRDELSERLKIQHELRAQAQVEEAEELFRDMRFYG PQRRRLWNLMEKPFSSVAAKAIGVASSTFVLVSVVALALNTVEEMQQHSGQGEGGPDLRP ILEHVEMLCMGFFTLEYLLRLASTPDLRRFARSALNLVDLVAILPLYLQLLLECFTGEGH QRGQTVGSVGKVGQVLRVMRLMRIFRILKLARHSTGLRAFGFTLRQCYQQVGCLLLFIAM GIFTFSAAVYSVEHDVPSTNFTTIPHSWWWAAVSTFALGFPILFPSPVSCSSLPWLSATR LWLLILVFPPTPNRRIQLTKRRWMSKVVERELSRSVNSSSHMSMAVAKNKRENASPIMQT LHKFLFMAFAQPIGQSKSHGQAASQRAGQVSISTVGYGDMYPETHLGRFFAFLCIAFGII LNGMPISILYNKFSDYYSKLKAYEYTTIRRERGEVNFMQRARKKIAECLLGSNPQLTPRQ EN >gi568815589f:2522190_2752949|GENSCAN_predicted_CDS_5|1989_bp atgctcaaacagagtgagaggagacggtcctggagctacaggccctggaacacgacggag aatgagggcagccaacaccgcaggagcatttgctccctgggtgcccgttccggctcccag gccagcatccacggctggacagagggcaactataactactacatcgaggaagacgaagac ggcgaggaggaggaccagtggaaggacgacctggcagaagaggaccagcaggcaggggag gtcaccaccgccaagcccgagggccccagcgaccctccggccctgctgtccacgctgaat gtgaacgtgggtggccacagctaccagctggactactgcgagctggccggcttccccaag acgcgcctaggtcgcctggccacctccaccagccgcagccgccagctaagcctgtgcgac gactacgaggagcagacagacgaatacttcttcgaccgcgacccggccgtcttccagctg gtctacaatttctacctgtccggggtgctgctggtgctcgacgggctgtgtccgcgccgc ttcctggaggagctgggctactggggcgtgcggctcaagtacacgccacgctgctgccgc atctgcttcgaggagcggcgcgacgagctgagcgaacggctcaagatccagcacgagctg cgcgcgcaggcgcaggtcgaggaggcggaggaactcttccgcgacatgcgcttctacggc ccgcagcggcgccgcctctggaacctcatggagaagccattctcctcggtggccgccaag gccatcggggtggcctccagcaccttcgtgctcgtctccgtggtggcgctggcgctcaac accgtggaggagatgcagcagcactcggggcagggcgagggcggcccagacctgcggccc atcctggagcacgtggagatgctgtgcatgggcttcttcacgctcgagtacctgctgcgc ctagcctccacgcccgacctgaggcgcttcgcgcgcagcgccctcaacctggtggacctg gtggccatcctgccgctctaccttcagctgctgctcgagtgcttcacgggcgagggccac caacgcggccagacggtgggcagcgtgggtaaggtgggtcaggtgttgcgcgtcatgcgc ctcatgcgcatcttccgcatcctcaagctggcgcgccactccaccggactgcgtgccttc ggcttcacgctgcgccagtgctaccagcaggtgggctgcctgctgctcttcatcgccatg ggcatcttcactttctctgcggctgtctactctgtggagcacgatgtgcccagcaccaac ttcactaccatcccccactcctggtggtgggccgcggtgagtacctttgccctgggcttt cccatcctcttccccagcccagtgagctgctcctccctcccctggttatcagccaccagg ctttggcttctgatcctcgtcttcccccccacccccaatcgccgcatacagctaacaaaa cggcgatggatgtcaaaagtggtggaaagagaactcagcagatcagtaaactccagcagc cacatgtcgatggctgtggcaaagaacaagagagagaatgcaagccccatcatgcaaaca cttcataagtttcttttcatggcatttgctcagcccattggccagagtaagtcacatggc caagctgcaagtcaaagggcagggcaggtgagcatctccaccgtgggctacggagacatg tacccagagacccacctgggcaggttttttgccttcctctgcattgcttttgggatcatt ctcaacgggatgcccatttccatcctctacaacaagttttctgattactacagcaagctg aaggcttatgagtataccaccatacgcagggagaggggagaggtgaacttcatgcagaga gccagaaagaagatagctgagtgtttgcttggaagcaacccacagctcaccccaagacaa gagaattag >gi568815589f:2522190_2752949|GENSCAN_predicted_peptide_6|85_aa MVELAAFDRRDESFIGTEVCRLSFPINHLYDLDEENQQGSVGYSRVGNSGRPLLLIGLRE QQEGKVFLGPVNAVVKQGYSTEPRL >gi568815589f:2522190_2752949|GENSCAN_predicted_CDS_6|258_bp atggtggagctggcagcctttgacaggagagatgaatcatttattggcactgaggtatgc cgattatcatttcctatcaatcatctgtatgatctggatgaagagaaccaacaaggctcg gtaggatattcgagagttggcaacagtgggaggccattactactcataggcctgagggaa caacaggagggaaaggtgtttctaggacctgtaaatgctgtggttaaacaaggctactca actgagccacggctgtag >gi568815589f:2522190_2752949|GENSCAN_predicted_peptide_7|301_aa MVPSTLEKKVAAGNDTVRLGEIPKLQSTSTPASVQALDAIVRRNSRMSQKIVKAWRYTAK QNVHTQGSAGLLKSHTQRGLGLLPLWVSLTKGWNIHENSWKKVEISQNGSATHFYTKYGS SRNFHGTGSLINVLKTTPPPLLTPTLPPPIRGLCSHAEYKQDLDHKAGEGTNPADTVILD FQIPELGNKFLLFKPSSLCYFVTVVLANEYKVTGCRQHKTASGVIISRDNIRTQIPLPSK PEVPRPQIADWYWSMACYELGYPAGGEQVRKASSVFTAAPHHSHYCLSFSSCQISGGIRF S >gi568815589f:2522190_2752949|GENSCAN_predicted_CDS_7|906_bp atggtcccttccactttagaaaaaaaagttgcagctggaaatgatactgttaggctggga gagatccccaaactccagagcacttcaaccccagccagtgtccaggctcttgacgctatc gtgagaaggaattcaaggatgagtcagaaaatagtgaaagcatggagatatactgcgaag caaaacgtacacactcaggggagtgcaggcctactcaagagtcacacgcaacggggtttg gggctgctacccttatgggtttctttaaccaaggggtggaacattcatgaaaattcctgg aaaaaggtggaaatttctcagaacggcagtgccacccatttttataccaaatatgggtct tcccggaactttcatggcactggaagcctcatcaatgtgctgaagaccacccctcctcca ctcctcactcccacactcccacccccaattagaggcctctgcagccatgctgagtataag caggatttggatcacaaagcaggggaaggaaccaatcctgctgacacagtgatcttggac ttccaaattccagaactgggaaataaatttctgttgtttaagccatccagcttgtgctac tttgttacagtggtcctagcaaatgaatacaaggtcactggatgtcgtcagcacaaaact gccagtggggttatcattagcagagataacatccgaactcaaattcctttaccctctaaa ccagaggtccccagaccccagattgcggactggtactggtccatggcctgttatgaactg ggctaccccgcaggaggtgaacaggtccgcaaagcttcatctgtgtttacagctgctccc catcactcacattactgcctgagcttctcctcctgtcagatcagcggtggcattagattc tcctag