GENSCAN 1.0 Date run: 8-Nov-116 Time: 02:05:31 Sequence gi568815595r:138845595_139046722 : 201128 bp : 45.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1603 1729 127 0 1 104 60 48 0.520 3.92 1.02 Term + 11854 11885 32 2 2 136 50 18 0.743 0.82 1.03 PlyA + 15112 15117 6 1.05 2.00 Prom + 15706 15745 40 -3.66 2.01 Init + 19240 19242 3 0 0 88 115 0 0.554 2.70 2.02 Intr + 30325 30453 129 0 0 96 78 33 0.518 3.99 2.03 Intr + 54007 54099 93 0 0 115 -4 64 0.009 0.06 2.04 Intr + 90273 90669 397 2 1 26 80 206 0.213 7.75 2.05 Intr + 92201 92387 187 0 1 46 105 124 0.623 8.65 2.06 Intr + 92563 92710 148 1 1 29 93 47 0.308 -0.46 2.07 Intr + 93007 93208 202 0 1 45 56 146 0.461 5.96 2.08 Term + 98378 98586 209 0 2 1 36 147 0.140 -1.70 2.09 PlyA + 98614 98619 6 1.05 3.06 PlyA - 98655 98650 6 1.05 3.05 Term - 101179 99998 1182 1 0 52 54 1732 0.872 157.97 3.04 Intr - 101937 101803 135 0 0 89 49 60 0.720 2.96 3.03 Intr - 104681 104517 165 2 0 71 56 77 0.802 2.96 3.02 Intr - 105946 105830 117 1 0 72 98 137 0.782 13.76 3.01 Init - 107384 107289 96 2 0 60 77 67 0.776 3.11 3.00 Prom - 107888 107849 40 -5.06 4.03 PlyA - 108284 108279 6 1.05 4.02 Term - 115234 114717 518 2 2 -23 38 403 0.904 18.68 4.01 Init - 127596 127422 175 0 1 52 95 72 0.212 1.85 4.00 Prom - 140559 140520 40 -4.46 5.03 PlyA - 141115 141110 6 1.05 5.02 Term - 142474 142377 98 2 2 124 47 87 0.862 6.33 5.01 Init - 152370 152313 58 0 1 62 97 40 0.556 3.77 5.00 Prom - 158584 158545 40 -3.66 6.06 PlyA - 158882 158877 6 -0.45 6.05 Term - 160949 159874 1076 0 2 -4 43 1028 0.676 81.39 6.04 Intr - 175171 174341 831 2 0 29 30 868 0.001 66.47 6.03 Intr - 178330 178200 131 0 2 56 40 67 0.162 -0.96 6.02 Intr - 182551 182406 146 1 2 75 39 117 0.102 4.58 6.01 Init - 182807 182607 201 2 0 60 86 84 0.499 4.28 6.00 Prom - 182882 182843 40 -9.55 7.02 PlyA - 183014 183009 6 1.05 7.01 Sngl - 184097 183399 699 2 0 56 42 262 0.845 14.61 7.00 Prom - 186919 186880 40 -2.76 8.02 PlyA - 187611 187606 6 1.05 8.01 Sngl - 199026 198238 789 0 0 69 44 863 0.751 73.93 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 72663 72543 121 2 1 24 32 196 0.900 5.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:138845595_139046722|GENSCAN_predicted_peptide_1|52_aa MAMRVTSGNPHCYTPTSAMTVYKCHGNVRKLPYGNLYGLKRGDQCISFVKVG >gi568815595r:138845595_139046722|GENSCAN_predicted_CDS_1|159_bp atggcgatgagagtgacctctggtaatcctcactgctacactcccaccagtgccatgaca gtttacaaatgccatggcaacgtcaggaagttaccctatggtaacctgtatggtctaaaa aggggagatcaatgtatcagctttgtgaaagtaggttag >gi568815595r:138845595_139046722|GENSCAN_predicted_peptide_2|455_aa MRHHGTLSTLLQNFRYTVSPIFSETIQFSNPIRNGNEFSLAGNECAKLVPVFKPWDLLLA MLKMLYEDFLMADSLGAAPSGRRGALNLVGETSALQETEAAKRLGHHQVRLKRGGQVDRD PWAPINAMVAESKGDEHSLGPSPESPSVRLWPCLVGRGSRRAPQSGGIASAELRLSALSL GPSTGASGVCGEKSRIEKYGRVSSVAAVHTRARLSVPGDTVWHRRSCSAVGATWEVGFRV LLWSAGKSGSGSAGQRFGAAGLLYLSSAGRSGKSAPGSLLPSQAMVKGGEARVCHGLGEN AAAGSLWAAALARGPKTVKESVAQKIRGRALEESPRPGEGTEAPLRGAAGTGDARVRIWP ANRAQRSRKPESPQRLSTIPFLGIGSVAPCERTTAEKVFLLTLNSRNGPVMTQKTNNPRK TKQQRQPQTPATPVALNLCLTHSEKDKILLKIFLF >gi568815595r:138845595_139046722|GENSCAN_predicted_CDS_2|1368_bp atgagacaccacgggaccttatctaccctcttgcagaatttcagatacactgtgtctcca atattctcggaaaccatccagtttagtaaccctatcagaaacggaaatgagtttagcttg gcaggaaatgagtgtgccaagcttgttcctgtcttcaagccatgggatttgttgttggct atgcttaaaatgctctatgaagatttcctcatggctgactccttgggagcagccccttcc ggtcgtcgaggcgctcttaatttggttggagagacaagtgcactacaagaaacagaggca gccaagcgcctgggccaccatcaagtaaggttgaaaaggggagggcaggtagatcgggac ccgtgggcgccaatcaatgctatggtggcggagagtaaaggggacgaacacagcctgggg ccatccccggagtcccccagcgtccgcctgtggccgtgcctggttggccgcggatcccgg cgggcgccgcaaagcggcgggattgccagcgcagagctccggctctctgccttgtccctg ggtccgagcaccggagcctctggtgtctgcggggagaagtctcggattgagaaatacggg agggtctcgtcagtggctgcagtacacacgcgagcacgactgtctgtgcctggagacacc gtatggcaccggaggagctgctcagccgtcggtgccacctgggaagtagggtttcgggtg ctgctctggagcgcgggaaagtccggatccgggtctgctggtcagcgcttcggcgccgcc gggctcctttatctctcatccgccggccggtcagggaagtcggcccccgggagcctcctg ccctcacaagccatggtgaagggaggcgaagccagggtttgccacggcctgggagaaaat gcggctgcagggtctctttgggctgcagcgctggcccggggccccaagactgttaaggag agcgttgcgcagaaaattcgaggccgggcgctggaggagtctccgcggcccggagaaggc acagaggcgcccctgagaggcgcagctggaacaggcgatgcacgggttcggatctggccg gccaaccgagcccagcggtcgcgaaaaccagagtcgccacagaggttgagcacgattcca tttctggggatcgggtccgtcgcgccttgtgaacgaaccacggcagaaaaagtctttctg ctgacgttaaacagccgcaacgggccagtgatgacacagaaaacaaacaatcccaggaaa acaaagcagcagcgacagccacaaacgcctgccaccccggttgctctcaatctttgttta actcattctgaaaaagataaaattttgctgaaaatcttcctcttttaa >gi568815595r:138845595_139046722|GENSCAN_predicted_peptide_3|564_aa MRDKRLHTGYRVQCSGDKCTKISEITIKELIHLASTMKKADNCPAFCLIDDGAHTPWDKM VLEDSYPNISKDSRPWGEKRLYQSRVDCSRRLSGPPRRGARLRGGAGAETALNRRFTGTV SAGGATPRARSSASRRQQTAEQRSRFLSPESREEAWSAFPRWGPSFLGFGRRLERRAEAA RPRRAGFVMMASYPEPEDAAGALLAPETGRTVKEPEGPPPSPGKGGGGGGGTAPEKPDPA QKPPYSYVALIAMAIRESAEKRLTLSGIYQYIIAKFPFYEKNKKGWQNSIRHNLSLNECF IKVPREGGGERKGNYWTLDPACEDMFEKGNYRRRRRMKRPFRPPPAHFQPGKGLFGAGGA AGGCGVAGAGADGYGYLAPPKYLQSGFLNNSWPLPQPPSPMPYASCQMAAAAAAAAAAAA AAGPGSPGAAAVVKGLAGPAASYGPYTRVQSMALPPGVVNSYNGLGGPPAAPPPPPHPHP HPHAHHLHAAAAPPPAPPHHGAAAPPPGQLSPASPATAAPPAPAPTSAPGLQFACARQPE LAMMHCSYWDHDSKTGALHSRLDL >gi568815595r:138845595_139046722|GENSCAN_predicted_CDS_3|1695_bp atgagggataaaagattacacactgggtaccgtgtacaatgctcgggtgacaagtgcacc aaaatctcagaaattaccattaaagaacttatccatttggcctccaccatgaaaaaggcc gacaactgccctgcattctgcctgattgacgatggggcacatactccttgggacaaaatg gtgttggaggactcataccccaacatcagcaaggattcccggccctggggagaaaagcgt ctgtatcagagccgtgtagattgctcgcgccggctgtcgggacctccgaggcgcggggcc aggctgcgaggaggcgcgggagctgagactgcgctcaatcggcggttcactggaaccgta tccgcaggcggcgctacgccccgagctagaagttcagcctcaagacggcagcaaacggca gagcaaaggagtcgttttctttcacctgaaagccgcgaggaggcttggagcgcctttcct cgctggggcccgagcttcctgggctttggccggcgcctggagcggagagcagaggcggcc cggccgcggcgcgccggctttgtcatgatggccagctaccccgagcccgaggacgcggcg ggggccctgctggccccagagaccggtcgcacagtcaaggagccagaagggccgccgccg agcccaggcaagggcggtgggggtggcggcgggacagccccggagaagccggacccggcg cagaagcccccgtactcgtacgtggcgctcatcgccatggcgatccgcgagagcgcggag aagaggctcacgctgtccggcatctaccagtacatcatcgcgaagttcccgttctacgag aagaataagaagggctggcaaaatagcatccgccacaacctcagcctcaacgagtgcttc atcaaggtgccgcgcgagggcggcggcgagcgcaagggcaactactggacgctggacccg gcctgcgaagacatgttcgagaagggcaactaccggcgccgccgccgcatgaagaggccc ttccggccgccgcccgcgcacttccagcccggcaaggggctcttcggggccggaggcgcc gcaggcgggtgcggcgtggcgggcgccggggccgacggctacggctacctggcgcccccc aagtacctgcagtctggcttcctcaacaactcgtggccgctaccgcagcctccctcaccc atgccctatgcctcctgccagatggcggcagccgcagcggctgcagcagctgcggctgca gccgcgggccccggtagccctggcgcggccgctgtggtcaaggggctggcgggcccggcc gcctcgtacgggccgtacacacgcgtgcagagcatggcgctgccccccggcgtagtgaac tcgtacaatggcctgggaggcccgccggccgcacccccgcctccgccgcacccccacccg catccgcacgcacaccatctgcacgcggccgccgcaccgccgcctgccccaccgcaccac ggggccgccgcgccgccgccgggccagctcagccctgccagcccagccaccgccgcgccc ccggcgcccgcgcccaccagtgcgccgggcctgcagttcgcttgtgcccggcagcccgag ctcgccatgatgcattgctcttactgggaccacgacagcaagaccggcgcgctgcattcg cgcctcgatctctga >gi568815595r:138845595_139046722|GENSCAN_predicted_peptide_4|230_aa MVDPALRVALSAGMSGHPHLGAENTPSSPVPSTTPKTKLGTCSMLDRGCYSNTFKSQAAW ESHRASGRQEEGEVTSVQDPSSQELLWACASVPVHLNPERRSRGSPRASCIAASRKDITR LSRALVPIAGCRFRRASAGKERGTKVRKPPVAGVSPASGPEGRATLVLHPESYTLNTRQA WNNTGKRAFLPPPRNPAGLPAVFVLQPSPILNFPPLHATASLSMNTYEDP >gi568815595r:138845595_139046722|GENSCAN_predicted_CDS_4|693_bp atggtggacccagcattgcgggttgctctctcggcaggcatgtctggccatccacacctg ggggcagagaacactccatcttcccctgtgccctccacaacacccaagacaaagctgggc acatgcagcatgctcgataggggctgctactcaaatactttcaaaagccaagcagcgtgg gagagccatcgggcatcaggaagacaggaggaaggcgaggtgaccagtgtgcaggacccc agctctcaggagctgctctgggcttgcgcgtctgtgcccgtgcacctgaaccccgagcgg cgaagccggggctccccgcgggcttcctgcatcgctgcctccaggaaagacatcacccgc ctgtcccgggccctggtccccatcgccggctgccgcttccggcgggcctctgcggggaag gagcgcggtacgaaagtgaggaagccccccgtcgctggggtgagccccgcttccggtccg gagggacgcgccacgctcgtcctacaccctgagtcctacaccctgaacacccgtcaggcc tggaacaacaccgggaaacgtgccttccttcctccacccaggaacccggccggactgccc gcagtctttgtcctgcagccctctcccatcctgaatttcccgcctcttcatgctactgcc tcactatccatgaacacatacgaggacccctaa >gi568815595r:138845595_139046722|GENSCAN_predicted_peptide_5|51_aa MIKYVGNPNESTEKLLDIISSPLPLRQCPEQCLAQIEGTLMKELTFEGEHV >gi568815595r:138845595_139046722|GENSCAN_predicted_CDS_5|156_bp atgatcaaatatgtaggaaatcctaatgaatcaacagaaaaacttctagatataataagc tcacctttaccactccgccagtgtccagagcagtgcctggcccagattgagggaacatta atgaaggagctaacttttgagggcgagcatgtatga >gi568815595r:138845595_139046722|GENSCAN_predicted_peptide_6|794_aa MSELLFTIASKRIKYLGIQLTRDVKDLFEENYKPLLNEIKEDTNKWKNIPCSWIGRINIV KMAILPKNWKNYFKVHMEPKKARIAKTVLSQKNKAGGIMLPDVKLYYKAIVTKTAWWELN NENTWTQKGEHHTPELVKGWEERGGIALGDIPNVNDELIASPTTDICYFKEDFTAALPTS AARPRSAVHSAVEAMVSRPRSPSAFPAPWWGQQPGGPGPAKRLRLEEPAGPEPRAAPSLE DPAGDPAVDALTSIVVLAAGCALRVPLDDVDLVLEPAPTSILRVSLGGHTLILIPEVLLS SVDERSGAQHDSSAGLEVDVFLGAVREDVVVELEFCASVPEIAAQEEAYEEDADPEFPEL RMDSPTGSAAGLYPSSRSMFIPYREGPIPEPCALAPNPSSERRSPRPIFDLEFRLLEPVP SSPLQPLPPSPCVGSPGHLTSHVISHRTCIRVCARKAQDSIARGSLSSTDCGLREAPLEL PGLSPRRAVLRPFRPNPPRLAPTPVTSKRTSLPALPTSSARPKSAVEAMGSRPRSPSAFP APWWGQQPGGPGPAKRLRLEEPAGPEPRVAPSLEDPAGTPAVGALTSIVVLAAGCALRVP LDDVDLVLELPPTSILRVSLDGHTLILIPEVLLSSVDERSGAQDDSSAGLEVDVFLGALR EDVVVEQEVFCASVPEIAAQEEAYEEDADPEFPELQMDSAAGSAAGLYSSARSMFSPYRE GPIPEPCALAPNPSSEGHSPGPFFDPEFRLLEPVPSSPLQPLPPSPRVGSPGPHAHPPLP KRPPCKARRRLFQE >gi568815595r:138845595_139046722|GENSCAN_predicted_CDS_6|2385_bp atgagtgaactcctattcacaattgcttcaaagagaataaaatacctaggaatccagctt acaagggatgtgaaggacctattcgaagagaactacaaaccactgctcaacgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatggataggaagaatcaatatcgtg aaaatggccatactgcccaagaattggaaaaactactttaaagttcatatggaaccaaaa aaagcccgcattgccaagacagtcctaagccaaaagaacaaagctggaggcatcatgcta cctgatgtcaaactatactacaaggctatagtaaccaaaacagcatggtgggaattgaac aatgagaacacttggacacagaaaggggaacatcacacaccggagctcgtcaagggctgg gaggagaggggagggatagcattaggagatatacctaatgtaaatgacgagttaatagcc tcccctaccaccgacatctgttacttcaaagaggacttcaccgccgcgctccccacgtcc gctgctaggcccaggagcgccgtccacagcgccgtcgaggcgatggtcagccggccccgc agccccagcgccttccctgctccctggtggggacagcagccaggaggacccggccctgcc aagcgcctccgattggaggagcccgcgggccccgaaccccgcgcggcacccagcctggaa gacccggcgggggacccggccgtggacgcgctcacctccatagtggtcctggccgcgggc tgtgccctgcgtgtgcccctggacgacgtcgacctggtgctggagcccgcaccaacgtcg atcctgcgagtgtctctcggtggacacaccctcatcctgatcccagaggtcctcctgagc tccgtcgacgaacgctcaggagcgcagcacgactcgtctgccgggctggaagtggacgtt ttcctgggcgctgtcagggaggacgtcgtcgtcgagctggaattctgcgcatctgtccca gagatcgccgcccaggaagaggcctacgaggaggacgcggaccccgagttcccggagctc cggatggactccccaaccggctcagccgctgggctctacccctcctctagaagtatgttc atcccctaccgggagggccccatcccagaaccctgtgctctggcccccaaccccagttca gagagacgttctccacgccccatctttgacctggaattccgccttctggagcctgtcccc agctcacctctccaacctctacctccctctccgtgcgtggggagtccaggtcacctgacg tcccacgttattagtcatcggacgtgcatccgggtctgcgccaggaaggcccaagattcc atcgcgcgaggctctctcagcagcacggactgcggtctccgggaggcgcccctggagctc ccaggactctccccgcgccgcgcggtccttcgccccttccgccccaacccgcctcgccta gcaccgacacctgttacctcaaaaaggacttcgctgcccgcgctccctacgtcctctgct aggcccaagagcgccgtcgaggctatgggcagccggccccgcagccccagcgccttccct gcgccctggtggggacagcagccaggaggacccggccctgccaagcgcctccgattggag gagcccgcgggccccgaaccccgcgtggcgcccagcctggaagacccggcgggtaccccg gccgtgggcgcgctcacctccatagtggtcctggccgcgggctgtgccctgcgtgtgccc ctggacgacgtcgacctggtgctggagctcccgccaacgtcgatcctgcgagtgtctctc gatggacacaccctcatcctgatcccagaggtcctcctgagctctgtcgacgaacgctca ggagcgcaggacgactcgtctgctgggctggaagtggacgttttcctgggcgctctcagg gaggacgtcgtcgttgagcaggaagtcttctgcgcatctgtcccagagatcgccgcccag gaagaggcctacgaggaggacgcggaccccgagtttccggagctccagatggactccgca gccggctcagccgctgggctctactcctccgccagaagtatgttcagcccctaccgggag ggccccatcccagaaccctgtgctctggcccccaaccccagttcagagggacactctcca ggccccttcttcgacccggaattccgccttctggagcctgtccccagctcacctctccaa cctctacctccctctccgcgcgtggggagtccaggtccccacgcgcacccgccgctcccg aaacgccctccgtgcaaggcccgcagacgactgttccaggaatga >gi568815595r:138845595_139046722|GENSCAN_predicted_peptide_7|232_aa MKREKNQIDAIKNDKGDITTDRTEIQTTIREYYKHLYANKLENLEEMEKCLDTYTLPRLK QEEVESLNRPITGSEIKAIINSLPTKKSPGPDGFPAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEASIILIPKPGRDTTKKKENFRPISLMNIDAKILNKITATQIQQHIKKLIYH DQVGFIPGMQGWFNTRKSINVIHHINKTKDKNHMIISIDAEKAFDKIQQPSC >gi568815595r:138845595_139046722|GENSCAN_predicted_CDS_7|699_bp atgaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggatatcaccacc gatcgcacagaaatacaaactaccatcagagaatactataaacacctctacgcaaataaa ctagaaaatctagaagaaatggaaaaatgcctggacacatacaccctcccaagactaaaa caggaagaagttgaatccctgaatagaccaataacaggctctgaaattaaggcaataatt aatagcctaccaaccaaaaaaagtccaggaccagacggattcccagccgaattctaccag aggtacaaagaggagctggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcatcctgataccaaagcctggcaga gacacaacaaaaaaaaaagagaattttagaccaatctccctgatgaacatcgatgcaaaa atcctcaataaaataacagcaacccaaatccagcagcacatcaaaaagcttatctaccac gatcaagtgggcttcatccctgggatgcaaggctggttcaacacacgcaaatcaataaat gtaatccatcacataaacaaaaccaaagacaaaaaccacatgattatctcaatagatgca gaaaaggccttcgataaaattcaacagccttcatgctaa >gi568815595r:138845595_139046722|GENSCAN_predicted_peptide_8|262_aa MGSRPCSPSACLAPWWGQQPGGPGPAKRSRLEEPAGPESRAAPSPEDPAGTPAVDALTSM VVLDAGCALRVPLEDVDLVLELAPMSVLRVSLGGHTLIVIPEVLLSSVDECSGAQGDWSA GLEVDVFLGAHGEDVVVEQEVCASVPEIAAEEEAYEEDADSEFPELWMDSAAGSAAGLYP SARSMFSPYREGPIRGPCALAPNPSSERRSPRPIFDLEFHLLEPVPSSPLQPLPPSPSPG PHARPELPERPPCKVRRRLFQE >gi568815595r:138845595_139046722|GENSCAN_predicted_CDS_8|789_bp atgggcagccggccctgcagccccagcgcctgccttgcgccctggtggggacagcagcca ggaggaccaggccctgccaagcgcagccgattggaggagcccgcgggccccgaatcccga gcggcgcccagcccggaagacccggcggggaccccggccgtggacgcgctcacctccatg gtggtcctggacgcgggctgtgccctgcgtgtgcccctggaggacgtcgacctggtgctg gagctcgcgccaatgtcggtcctgcgagtgtctcttggtggacacaccctcatcgtgatc cccgaggtcctcctgagctccgtcgacgaatgctcaggagcgcagggcgactggtctgcc ggcctggaagtggacgttttcctgggcgctcacggggaagacgtcgtcgtcgagcaggaa gtctgcgcatctgtcccagagatcgctgccgaggaagaggcctacgaggaggacgcggac tctgagttcccggagctctggatggactccgcagccggctcagccgctgggctctacccc tccgctagaagtatgttcagcccctaccgggagggccccatccgagggccctgtgctctg gcccccaaccccagttcagagagacgctctccacgccccatcttcgacctggaattccat cttctggagcctgtccccagctcacctctccaacctctacctccctctccgagtccaggt ccccacgcgcgcccggagctcccagagcgccctccgtgcaaggtccgaagacgcctgttc caggaatga