GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:17:27 Sequence gi568815594r:73881207_74081953 : 200747 bp : 37.08% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10122 10131 10 2 1 91 94 7 0.428 2.42 1.02 Term + 18037 18167 131 2 2 72 39 162 0.948 7.06 1.03 PlyA + 18858 18863 6 1.05 2.02 PlyA - 19545 19540 6 1.05 2.01 Sngl - 25604 24363 1242 2 0 58 39 395 0.462 27.17 2.00 Prom - 28436 28397 40 -7.65 3.00 Prom + 28731 28770 40 -3.65 3.01 Sngl + 37424 37723 300 1 0 71 43 176 0.322 6.94 3.02 PlyA + 37891 37896 6 1.05 4.04 PlyA - 38636 38631 6 1.05 4.03 Term - 57864 57532 333 0 0 28 49 322 0.377 15.93 4.02 Intr - 58411 58158 254 2 2 23 30 245 0.381 8.43 4.01 Init - 59088 59028 61 0 1 77 81 48 0.600 4.46 4.00 Prom - 60309 60270 40 -9.95 5.00 Prom + 61003 61042 40 -5.95 5.01 Init + 61533 61684 152 2 2 76 93 138 0.994 12.66 5.02 Term + 62886 63312 427 0 1 45 49 240 0.780 9.59 5.03 PlyA + 64805 64810 6 1.05 6.03 PlyA - 65216 65211 6 1.05 6.02 Term - 68550 68201 350 1 2 96 49 206 0.845 11.16 6.01 Init - 70709 70448 262 2 1 62 19 138 0.039 1.57 6.00 Prom - 75946 75907 40 -3.65 7.00 Prom + 77940 77979 40 -6.95 7.01 Init + 80462 80831 370 1 1 59 4 267 0.437 12.60 7.02 Term + 81002 81429 428 0 2 10 44 240 0.505 6.28 7.03 PlyA + 81802 81807 6 -1.95 8.10 PlyA - 81852 81847 6 1.05 8.09 Term - 83733 83537 197 1 2 86 41 106 0.868 2.29 8.08 Intr - 84128 83872 257 1 2 73 46 167 0.859 7.16 8.07 Intr - 100939 100657 283 2 1 32 81 364 0.239 25.65 8.06 Intr - 106168 106086 83 0 2 94 64 53 0.228 1.86 8.05 Intr - 106446 106311 136 1 1 66 95 77 0.967 4.91 8.04 Intr - 106859 106750 110 1 2 100 75 48 0.309 3.71 8.03 Intr - 116889 116806 84 2 0 114 47 53 0.562 1.82 8.02 Intr - 117132 117000 133 1 1 106 78 80 0.878 7.58 8.01 Init - 117375 117267 109 0 1 70 81 201 0.763 16.12 8.00 Prom - 121007 120968 40 -3.95 9.00 Prom + 123454 123493 40 -4.85 9.01 Init + 130185 130270 86 2 2 11 110 85 0.274 1.78 9.02 Intr + 132358 132435 78 1 0 63 121 63 0.326 4.85 9.03 Intr + 134004 134125 122 0 2 53 53 63 0.471 -1.48 9.04 Intr + 134316 134399 84 1 0 55 94 72 0.468 3.37 9.05 Intr + 137841 137960 120 1 0 78 82 28 0.218 0.75 9.06 Term + 141069 141226 158 1 2 59 35 99 0.243 -1.19 9.07 PlyA + 141798 141803 6 1.05 10.05 PlyA - 142639 142634 6 1.05 10.04 Term - 151850 151671 180 2 0 154 49 27 0.566 2.03 10.03 Intr - 156970 156887 84 1 0 126 89 99 0.910 12.90 10.02 Intr - 157207 157125 83 2 2 96 73 92 0.782 6.94 10.01 Init - 157405 157306 100 1 1 80 94 205 0.999 18.77 10.00 Prom - 157518 157479 40 -4.85 11.05 PlyA - 157713 157708 6 -5.80 11.04 Term - 158274 158005 270 0 0 -8 54 276 0.822 9.10 11.03 Intr - 162011 161945 67 1 1 80 91 32 0.009 0.69 11.02 Intr - 173204 173069 136 0 1 56 75 107 0.004 4.91 11.01 Init - 178072 178012 61 1 1 87 99 46 0.331 7.10 11.00 Prom - 188501 188462 40 -4.95 12.03 PlyA - 189124 189119 6 1.05 12.02 Term - 192533 192463 71 0 2 75 39 63 0.480 -2.78 12.01 Intr - 195253 195169 85 1 1 108 121 31 0.764 6.87 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 79227 79028 200 0 2 63 42 148 0.815 6.12 S.002 Term + 173524 173663 140 2 2 62 49 173 0.915 7.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:73881207_74081953|GENSCAN_predicted_peptide_1|46_aa MAEAMFIEYQQPALNFRHFEDVHETQPLISRDSESIVDRRGANITV >gi568815594r:73881207_74081953|GENSCAN_predicted_CDS_1|141_bp atggcagaagcaatgttcatcgaatatcaacaacctgcactaaatttcaggcactttgag gatgttcatgaaacacagcccttgatctcaagagattcagagtctattgtggatcggagg ggagcaaacatcacagtttag >gi568815594r:73881207_74081953|GENSCAN_predicted_peptide_2|413_aa MIVYLENPIVSAQNLLKLISNFSKVSRYKINVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKDLKENYKPLLSEIKEDTNKWKNIPCSWVGRINIMKMAILLKVIYR FNAIPTKLQMTFFTELEKATLKFIWNQKRACITKSILSQKNKAGGITLPDFKLYYKAMVT KTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAIC RKLKLDPFLTPYTKINSRWIKDLNIRPKTIKTLEENLGSTIQDIGMGKDFMSETPKAMAT KARIDQWDLIKLKSFCTAKETAIRVNRQPTKWEKIFATYSSDKGLISRIYNEVKQIYQKK TNNPIKKWMKDMNRHFSKEDIYAAKKTHEKMLTITGHQRKANQNHNEIPSHTS >gi568815594r:73881207_74081953|GENSCAN_predicted_CDS_2|1242_bp atgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaagatacaaaatcaatgtacaaaaatcacaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctcaaggagaac tacaaaccactgctcagtgaaataaaagaggatacaaacaaatggaagaatattccatgc tcatgggtaggaagaatcaatatcatgaaaatggccatactgctcaaggtaatttataga ttcaatgccatccccaccaagctacaaatgactttcttcacagaattggaaaaagctact ttaaagttcatatggaaccaaaaaagagcctgcatcaccaagtcaatcctaagccaaaag aacaaagctggaggcatcacactacctgacttcaaactatactacaaggctatggtaacc aaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagagccc tcagaaataatgccgcatatctacaactatctgatctttgacaaacctgagaaaaacaag caatggggaaaggattccctatttaataaatggtgctgggaaaactggctagccatatgt agaaagctgaaactggatcctttccttacaccttatacaaaaattaattcaagatggatt aaagacttaaacattagacctaaaaccataaaaaccctagaagaaaacctaggcagtacc attcaggacataggcatgggcaaggacttcatgtctgaaacaccaaaagcaatggcaaca aaagccagaattgaccaatgggatctaattaaactaaagagcttctgcacagcaaaagaa actgccatcagagtgaacaggcaacctacaaaatgggagaaaattttcgcaacctactca tctgacaaagggctaatatccaggatctacaatgaagtcaaacaaatttaccagaaaaaa acaaacaaccccatcaaaaagtggatgaaggacatgaacagacacttctcaaaagaagac atttatgcagccaaaaaaacacatgaaaaaatgctcaccatcactggccatcagagaaaa gcaaatcaaaaccacaatgagataccatctcacaccagttag >gi568815594r:73881207_74081953|GENSCAN_predicted_peptide_3|99_aa MTWICMTVSGLPERSDKALGSRDTRMRRRQNSLGMKVKADKKKNCLTVVDFAIFPDKLLQ KYLQVSMKIREHPYRFIPVFPVKSVSQMEKEKKRPIKRN >gi568815594r:73881207_74081953|GENSCAN_predicted_CDS_3|300_bp atgacgtggatctgcatgacagtctctggtcttccagagagaagtgacaaagctctaggt tccagagatacaagaatgaggaggaggcaaaacagcttgggaatgaaggtgaaagctgac aaaaagaaaaactgtttgacagttgttgactttgcaatttttcctgataaactcttacaa aagtatctccaggttagtatgaagataagagaacatccctatcgatttatcccagttttc ccagtcaagtctgtgtcccaaatggaaaaagagaaaaagagaccaattaaaaggaactag >gi568815594r:73881207_74081953|GENSCAN_predicted_peptide_4|215_aa MSLRKLTIMVEGKGGPNIPHACGCHQRSSIKVSLLPAVTSKSESPKEPEQLRKLFIGGLS FETTDESLRSHFEQRRTLTDCAVMRDPNTKCSKGFGFVTYATMEEKYHTVNGHSCEARKA LSKQEVASASSSQRGEVVLETLVVVMEVVSVEMTTLVMEKTSGVIVALVAATVVVDMVAV GMTIVDLVMMEAILEVVEATMILAITAISLQILDP >gi568815594r:73881207_74081953|GENSCAN_predicted_CDS_4|648_bp atgagcctcaggaagcttacaatcatggtagaaggcaaagggggacccaacatacctcat gcctgtggatgccaccaaagaagcagcattaaagtctctcttctccctgctgttacatct aagtcagagtctcctaaagaacccgaacagctgaggaagctcttcattggagggttgagc tttgaaacaactgatgagagcctaaggagccattttgagcaaaggagaacgctcacggac tgtgcggtaatgagagatccaaacacaaagtgctccaagggctttgggttcgtcacatat gccactatggaggagaaataccacactgtgaatggccacagctgtgaagctaggaaagcc ctgtcaaagcaagaggtggctagtgcttcctctagccaaagaggcgaagtggttctggaa actttggtggtggtcatggaggtggtttcggtggaaatgacaactttggtcatggagaaa acttcaggggtcatagtagctttggtggcagccacggtggtagtggatatggtggcagtg gggatgactatagtggatttggtaatgatggaagcaattttggaggtggtggaagctaca atgattttggcaattacagcaatcagtcttcaaattttggacccatga >gi568815594r:73881207_74081953|GENSCAN_predicted_peptide_5|192_aa MQEESELDRRQCDRTKGQKERCEEPKQWRCREMQKGSYKARNADSLRKLERVCKHWQGSS WATHTPREEETRWVSGTVGAGCYPSRVPLRRPQGAGTHLPLAGRRLPGAGAAAPPAVAGE RGGQRGARRSSPGSEGRSGGGARSGRAGREVPAAQALWLPKIGEPFLCMIGAEKSGTPGP GKFSELQVYPEY >gi568815594r:73881207_74081953|GENSCAN_predicted_CDS_5|579_bp atgcaggaggagtcagagttagacagaaggcaatgtgacagaacaaagggacagaaagaa agatgtgaagaacctaagcaatggagatgtagggagatgcagaaagggagctataaggca aggaatgcagatagccttcggaagctggaaagggtctgcaagcactggcaaggcagttca tgggccacccatactcctagggaagaagagactcgctgggtgagcgggactgttggcgcg gggtgctaccccagccgtgtcccgctccggagaccccagggcgccgggacccatctgccg ctcgccggccggaggctaccaggagcaggagcagcagcgccgcccgcagtagccggggag cgcgggggacagcggggggcgcggcgcagttcaccgggctcagaaggcaggtcgggcggc ggtgcgaggagcgggagagctggcagggaggtgcctgcagcccaggctctgtggctcccc aagatcggtgaaccctttttatgcatgattggggctgaaaagtctggaacccccgggcca gggaaattctcggagctccaggtctatcccgagtactga >gi568815594r:73881207_74081953|GENSCAN_predicted_peptide_6|203_aa MGSDGEKAAYWINVISKQWPTYTIHIQVKKFEGLVDTGAEINIPHNSYSAPSQHTMENMG FVPGLGLSPKHEGITKSLPVTVKENRAVSSHTDLPATQNYSYWAYVPFPPLIRPLTWIDA PAEIYTNDSVWMPGATDDCCPAQPGEGTAFNVTMGYKYPPLCLGHAPGCIHLETQVWAAY LPEISATEEPGYLISGLSLSPLK >gi568815594r:73881207_74081953|GENSCAN_predicted_CDS_6|612_bp atgggctctgatggtgaaaaagccgcttattggattaatgtaatttctaaacaatggccc acctacaccatacacattcaagtaaaaaagtttgagggcctagttgatactggggctgaa attaatattccacataactcttatagtgctcccagtcagcatacgatggaaaacatgggg tttgttcctgggcttggtctcagtccaaagcatgaagggattactaaatcccttccagtt actgtaaaagaaaacagggcagtttcctcccacactgatttacctgctacacaaaattat tcttactgggcttatgtgccttttcctccacttattcgacctctcacctggatagatgct cctgcagaaatctacactaacgatagtgtgtggatgcctggagccacagatgactgttgc cctgctcaaccaggagaaggcactgcatttaatgttaccatgggttataaataccctcct ctgtgccttggacatgcacctggttgtatccatctagaaactcaagtctgggctgcttat cttccggagatatcagctacagaggaaccgggatatttgatctctggcctctccctttct cctttaaaataa >gi568815594r:73881207_74081953|GENSCAN_predicted_peptide_7|265_aa MWESLELPRDLLNGFDQNAGSDMDNKVQAEVVSDGDKELGKYDKGDYCCTKRLATFCSYP RDLWNFEFERDDLRYLVEEISKWQSVQEEAEHKSLKNLQPDNAIEMKDPFSEEKFKPAAE ICIRIWFPVTANLAVAKRGQGIVWAMASEGASPKPWQLSCGVRPLVYRSQELRFGNLYLD FRGCMEMSGCSGRNLLQGQGLHEEPVLWQYGREVWGWSNHTESPLGHYLWELSEEGHCPL DPRMVDPLTACAVCLEKPQKLNTSP >gi568815594r:73881207_74081953|GENSCAN_predicted_CDS_7|798_bp atgtgggaaagtttggaacttccaagagacttgttgaatggctttgaccaaaatgctggt agtgatatggacaataaagtccaggctgaggtggtctcagatggagataaggaactaggg aagtatgacaaaggagactattgttgtacaaagagactggcgacattttgctcctatcct agagatttgtggaattttgaatttgagagagatgatttaagatatctagtagaagaaatt tctaagtggcaaagtgttcaagaggaagcagagcataaaagtttgaaaaatttgcagcct gacaatgcaatagaaatgaaagacccgttttctgaggagaaattcaagccagctgcagaa atttgcataaggatttggttccctgtgacagccaatctagctgtggctaaaaggggccaa ggtatagtttgggccatggcttcagagggtgcaagccccaagccttggcaactttcatgt ggtgttaggcctctggtgtacagaagtcaagaattgaggttcgggaacctctacctagat ttcagaggatgtatggaaatgtctggatgttcaggcagaaatttgctgcaggggcagggc ctgcatgaagaacctgtgctatggcagtatggaagggaagtgtggggttggagcaaccac acagaatccccactggggcactacctatgggagctgtcagaagaaggccactgtcctcta gaccccagaatggtagatccactgacagcatgtgctgtgtgcctggaaaagccacagaaa ctcaacaccagcccatga >gi568815594r:73881207_74081953|GENSCAN_predicted_peptide_8|463_aa MSLLSSRAARVPGPSSSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVHP KMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGARPLHALQVLL LLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSGIHPKNIQSLEVIG KGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAEPKSSQYYLSFRTAVPRCPLQA SGLEGQPGIKRAGEAQESLATETQPEFPIALSTEILLEALPQHELRSRVLRLTPRAAVPG VAAPATCGRLRQRETLHTLEAHPIASALADSCLTGEEIQQSSPYSHTPACILPPHTEASP TSIATPHIALLVSACRWQILLSLSCQRTGIHAVLTPPNGHREGTQNCARQCPALKPTSPE DKPTNMRKNQCKNSENSKSQSAFLPLSNHITSPARVLNRAEMA >gi568815594r:73881207_74081953|GENSCAN_predicted_CDS_8|1392_bp atgagcctcctgtccagccgcgcggcccgtgtccccggtccttcgagctccttgtgcgcg ctgttggtgctgctgctgctgctgacgcagccagggcccatcgccagcgctggtcctgcc gctgctgtgttgagagagctgcgttgcgtttgtttacagaccacgcaaggagttcatccc aaaatgatcagtaatctgcaagtgttcgccataggcccacagtgctccaaggtggaagtg gtagcctccctgaagaacgggaaggaaatttgtcttgatccagaagccccttttctaaag aaagtcatccagaaaattttggacggtgcgagaccacttcatgccttgcaggtgctgctg cttctgtcattgctgctgactgctctggcttcctccaccaaaggacaaactaagagaaac ttggcgaaaggcaaagaggaaagtctagacagtgacttgtatgctgaactccgctgcatg tgtataaagacaacctctggaattcatcccaaaaacatccaaagtttggaagtgatcggg aaaggaacccattgcaaccaagtcgaagtgatagccacactgaaggatgggaggaaaatc tgcctggacccagatgctcccagaatcaagaaaattgtacagaaaaaattggcagaaccc aagtcttcccagtactatcttagtttccgcaccgcagttcctcggtgtccacttcaggct tccggactggaaggacagccgggaataaaacgtgccggcgaggctcaggagtcattggcc acagagacccagcccgagtttcccatcgcactgagcactgagatcctgctggaagctctg ccgcagcatgagctccgcagccgggttctgcgcctcacgccccgggctgctgttcctggg gttgctgctcctgccacttgtggtcgccttcgccagcgggaaaccttgcataccctggag gcccaccccatagcttctgcactggcagactcatgcctgactggtgaagagatccagcag agcagcccctacagccacacaccagcctgcattctccctccccatactgaagcttccccc acatccattgctactccccacattgctttgctggtgtctgcctgcaggtggcagattttg ctttccttgtcctgccagcgcacaggaatacatgcggtcctcacccctcccaatgggcat agagaaggcacacagaactgtgctcgccagtgccctgcacttaagccaacatcaccggaa gataagcccacaaatatgagaaagaatcagtgcaagaactctgaaaattcaaaaagccag agtgccttccttcctctaagcaaccatatcacctctccagcaagggttctgaacagggct gaaatggcttaa >gi568815594r:73881207_74081953|GENSCAN_predicted_peptide_9|215_aa MWHFIILFQETATATLTTSLLNQQPSTSRLFQACVVFILPVSAYNVTFSVTIVLRPIRGV NPLSCKSGTGLHPTRALADCLIVLWFTPQLTSITGMSDLKIHLLLVNASNIEFYGKPLIL NDREKYSLTFLSESNTTMEVGYEIDLESYEQNFFKREIEWNRKFEKGVMFSLYAWRYPYP VLVLSTLSIPVEAFANFLVRLGIFPRTSMDSTADT >gi568815594r:73881207_74081953|GENSCAN_predicted_CDS_9|648_bp atgtggcacttcatcatcttatttcaagaaactgccacagccaccttaaccacctccctg ctcaatcagcaaccatcaacatccaggctctttcaggcttgtgtcgtttttatccttcca gtgtcagcttacaatgtcaccttctcagtgaccattgttctaaggcctatccggggagtg aatcctctcagctgcaagtctgggacgggacttcatcctactagagccctagctgattgc ttgatagttctttggtttacaccacagttgacatccataacaggaatgtctgacttaaag attcatttgttactggtcaatgcatctaacattgagttttacggaaaacccctgatactt aatgacagagagaagtattctctgacttttctgagtgagtcaaataccactatggaagta ggttatgaaatagatttagagagttatgagcagaacttttttaaaagggaaatagaatgg aatagaaaatttgaaaaaggagttatgttctcactctatgcgtggcggtatccatatcct gtcttggtactttctacgctttccattcctgttgaagctttcgctaacttccttgtgagg ctgggaattttccccaggacttctatggattcaacagcagatacataa >gi568815594r:73881207_74081953|GENSCAN_predicted_peptide_10|148_aa MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNIQSV NSHTQEWEESLSQPRIPHGSENHRKDTEQDELHLPESLAASRTSMDFPIFMPLITWLSPT GAFSIPQCCPIYLENSPSFKTHISNDFL >gi568815594r:73881207_74081953|GENSCAN_predicted_CDS_10|447_bp atggcccacgccacgctctccgccgcccccagcaatccccggctcctgcgggtggcgctg ctgctcctgctcctggtggccgccagccggcgcgcagcaggagcgtccgtggtcactgaa ctgcgctgccagtgcttgcagacactgcagggaattcacctcaagaacatccaaagtgtg aatagccacactcaagaatgggaagaaagcttgtctcaaccccgcatcccccatggttca gaaaatcatcgaaaagatactgaacaagatgaactccatctacctgagtcacttgctgct tctaggacttccatggactttccaatcttcatgcctttgattacttggttgtctcctact ggagcgttttccattcctcagtgttgtcccatctacttggaaaactctccatcctttaaa actcacataagtaacgatttcttgtag >gi568815594r:73881207_74081953|GENSCAN_predicted_peptide_11|177_aa MGIQNSPALLLMAVIVFGTFAVSVDSDLYTELRCVYVKSTFVLHPRNIHNLELVSAGPHC SKDEVMMEQCLSLGSSKMQNLSHEPAMQREEGRYAGYKRRGHVIQPWLPRTLTLNSNFDT DNLLPPNGKRKQGILSVIREYAKQGTSRTFFSGIRDDGCTFTESMMLDVHEITLNRK >gi568815594r:73881207_74081953|GENSCAN_predicted_CDS_11|534_bp atgggtattcagaactcaccagcactcctcctgatggctgtcattgtgtttggcacattt gctgtaagtgtagacagtgacttgtacactgaactgcgctgcgtgtatgtgaagtcaacc tttgtacttcatcccagaaacatccacaatttggagttggtctcagcaggaccccattgc agcaaagacgaagtaatgatggagcagtgcctaagtttaggctcctccaaaatgcagaat ttgagtcatgagcctgccatgcagagggaggaaggacgttatgcaggatacaaaagaaga ggtcatgttatacagccctggcttccacggacactaacactgaattcaaattttgacact gataatctgttgccaccaaatggaaaacgtaaacaaggtattctaagtgtgattagagaa tatgcaaaacaaggaacaagtagaacattcttctctggaatccgagacgatggctgtact ttcacagagagcatgatgttagatgtacatgaaataacgctaaaccgaaaatga >gi568815594r:73881207_74081953|GENSCAN_predicted_peptide_12|51_aa GLEKLRKGSRCNEIIFNIKQNIETVKLKVPQNKTVCGNEVFTEVTRLSGVH >gi568815594r:73881207_74081953|GENSCAN_predicted_CDS_12|156_bp ggactggaaaaactcagaaagggttccagatgtaatgagattatatttaacattaagcaa aatatagaaactgtgaaattaaaagtacctcagaataagactgtatgtggaaatgaagtc tttacagaggtaaccaggttaagtggagttcattag