GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:59:49 Sequence gi568815597f:162969485_163174557 : 205073 bp : 37.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3098 3220 123 2 0 53 39 115 0.350 1.88 1.02 Intr + 5531 5664 134 2 2 83 94 82 0.824 7.67 1.03 Intr + 44400 44525 126 1 0 104 23 78 0.418 2.63 1.04 Intr + 45245 45389 145 0 1 64 110 45 0.577 2.72 1.05 Term + 56982 57063 82 0 1 56 53 101 0.206 -0.41 1.06 PlyA + 57713 57718 6 1.05 2.08 PlyA - 57839 57834 6 1.05 2.07 Term - 61711 61629 83 2 2 83 43 55 0.070 -2.62 2.06 Intr - 80169 80088 82 0 1 68 95 57 0.274 2.69 2.05 Intr - 81674 81570 105 2 0 71 103 43 0.435 3.59 2.04 Intr - 83692 83512 181 0 1 27 23 140 0.309 0.35 2.03 Intr - 89108 89009 100 0 1 102 78 76 0.826 6.25 2.02 Intr - 91825 91732 94 1 1 58 45 64 0.409 -2.28 2.01 Init - 96686 96495 192 2 0 77 81 76 0.150 4.81 2.00 Prom - 96860 96821 40 -5.15 3.00 Prom + 97240 97279 40 -10.45 3.01 Init + 97377 97432 56 2 2 70 106 34 0.780 4.21 3.02 Intr + 99333 99508 176 0 2 60 71 95 0.978 3.66 3.03 Intr + 99768 100044 277 1 1 126 94 225 0.979 22.55 3.04 Intr + 102911 103015 105 2 0 96 94 56 0.976 5.41 3.05 Intr + 103321 103382 62 1 2 88 101 26 0.999 1.36 3.06 Intr + 103972 104138 167 2 2 105 92 129 0.999 13.76 3.07 Intr + 104837 105058 222 1 0 106 -8 272 0.405 16.80 3.08 Intr + 107244 107292 49 2 1 75 61 23 0.501 -4.37 3.09 Intr + 108256 109161 906 2 0 8 -57 447 0.596 12.52 3.10 Term + 109480 110249 770 2 2 67 48 279 0.718 14.37 3.11 PlyA + 110655 110660 6 -0.45 4.00 Prom + 111027 111066 40 -3.65 4.01 Init + 124138 124282 145 0 1 92 65 131 0.988 11.53 4.02 Intr + 128642 128802 161 0 2 41 91 60 0.508 0.39 4.03 Intr + 129380 129661 282 1 0 60 -18 197 0.194 2.99 4.04 Term + 129692 130153 462 1 0 30 43 386 0.534 22.37 4.05 PlyA + 132763 132768 6 1.05 5.02 PlyA - 133751 133746 6 1.05 5.01 Sngl - 140226 139582 645 0 0 100 33 248 0.824 16.43 5.00 Prom - 148567 148528 40 -6.45 6.00 Prom + 158786 158825 40 -4.85 6.01 Init + 162012 162165 154 2 1 39 77 143 0.850 8.49 6.02 Term + 162650 162966 317 0 2 31 48 198 0.372 4.22 6.03 PlyA + 164193 164198 6 1.05 7.06 PlyA - 164680 164675 6 1.05 7.05 Term - 178019 177858 162 2 0 100 39 180 0.898 11.25 7.04 Intr - 183232 183066 167 2 2 122 73 108 0.516 11.46 7.03 Intr - 192492 192431 62 2 2 61 93 79 0.779 3.16 7.02 Intr - 198884 198774 111 1 0 128 81 115 0.989 13.38 7.01 Init - 200255 200038 218 2 2 51 53 117 0.146 2.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:162969485_163174557|GENSCAN_predicted_peptide_1|203_aa XTRKKTSSRMEAPVASLSLLESLTSQRAHIPAAEEQRRLNGVHTYLPVDDSVGLIVSQNF YETGENSSSKNSSELEAKILSQLVTLDLATQQSYWFPGWYWRLSAQFCDVNHLHSGLSAM DSSSCSSRVFLANLPKDLCKTKSEMASLRTERAHRALPAASSTPEFCLALKICLRSRRKP LVSQVECDGVDMIFHYHRLDQGS >gi568815597f:162969485_163174557|GENSCAN_predicted_CDS_1|612_bp nnaactaggaagaaaacatcatcacgaatggaggctccagtggcctctttgtccctttta gagtctcttacctcccaaagagcccacattcctgcagctgaggagcagagacgtctgaat ggagtgcacacatatttgcctgttgatgatagtgtgggcctaattgtttctcagaacttc tatgagactggagaaaacagttctagtaagaacagcagtgaattggaagcaaaaattttg tcccaacttgtaactttggatctagccacccagcagagctattggtttccgggctggtac tggcggctgtctgcacagttctgtgatgtcaaccatcttcattcaggtctctcagccatg gatagcagcagctgctccagtagagtgtttctggcaaacctccccaaggacctctgcaag acaaagtcagaaatggcttccctaaggacagagagagcccacagggctcttcctgctgct tcctctacccctgaattttgcttggctctcaaaatttgtctcaggtccaggagaaagccc ctggtgagccaggtggaatgtgatggtgtagatatgattttccactatcatcgattagat caaggatcttga >gi568815597f:162969485_163174557|GENSCAN_predicted_peptide_2|278_aa MEPQKDEGARSQRMVRNYLRGTMHIIQVMNTQGARSQRMVRNYLRGTMHIIQVMNTQGAR SQRMHLSFRSKAGIKTSEKCPCRFPSTGATQGRFHRKQDDGVNVSGVSELSNDFNLQSST GINITKKLNPSPQCDAIRRKGLLEVMGHDDGTFINGISALTEEIPEGILTLFPPYKDTRR NWEAATQKKVSVGVTVCGIGTSLFGYKRETEMTALHLLEEEREQALAQSVLMCFRTHVLM KLSQQAVGWECAKLSNYIRATSGHFQEEKGKKPQNKQM >gi568815597f:162969485_163174557|GENSCAN_predicted_CDS_2|837_bp atggagcctcagaaggatgagggggcaagaagtcagcggatggtgagaaattacttaagg ggtacaatgcacatcattcaggtgatgaatacccagggggcaagaagtcagcggatggtg agaaattacttaaggggtacaatgcacatcattcaggtgatgaatacccagggggcaaga agtcagcggatgcatttaagttttaggagtaaagctggaatcaagacatcagaaaagtgc ccgtgcaggttcccaagcacaggagctacccaaggacgattccataggaagcaagatgat ggagtcaatgtctctggtgtcagtgaattatctaatgactttaatcttcagtcttcaact ggcattaatataactaaaaagttaaaccctagcccccagtgtgatgctattaggaggaag ggccttttggaggtaatgggacatgatgatgggacctttattaatgggattagtgccctt acagaagaaatcccagaagggattctcaccctctttccaccatataaggatacaaggaga aactgggaagctgcaacccagaagaaggtctcagtaggtgttacagtatgtggcataggg acttcactatttggatacaagagagaaacagaaatgactgcacttcatttgctggaagaa gaaagggagcaggccctagcacagtcagtgctcatgtgcttccgcacacatgttctcatg aagctttcacagcaagctgtgggctgggagtgcgctaagctaagtaactatatcagggca acttcaggtcatttccaagaagagaaaggcaaaaagcctcaaaataagcagatgtga >gi568815597f:162969485_163174557|GENSCAN_predicted_peptide_3|929_aa MMYQSSPPQTQSASPTVKNYFSPPSCLHNNQGNFRKSLRGDDDPASSLLEIVRIRSWTMY NMMLLIQKRKGIGSQLLRAGEAEGDRGAGTAERSSDWLDGRSWAIKETPTGLAGRRSEDS DNIFTGEEAKYAQSRSHSSSCRISFLLANSKLLNKMCKGLAGLPASCLRSAKDMKHRLGF LLQKSDSCEHNSSHNKKDKVVICQRVSQEEVKKWAESLENLISHECGLAAFKAFLKSEYS EENIDFWISCEEYKKIKSPSKLSPKAKKIYNEFISVQATKEVNLDSCTREETSRNMLEPT ITCFDEAQKKIFNLMEKDSYRRFLKSRFYLDLVNPSSCGAEKQKGAKSSADCASLASKLM LPVKHTHLNNKDNVYQGPPFEKHSLFENKDTTYENLWDTFKAVCRGKFIALNAHKRKQER SKIDTLTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFEKIN KIDRLLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDK FLDTYTLPRPNQEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTAEFYQTYKEELVPF LLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRI QQHIKKLIHHDQVVLEALARAIRQEKEIKGIQLGEEEVKLSLFADDMIVYLENPIVSAQN LLKLISNFSKVSGYKINVQKSQALLYINNRQTESQIMSELPFTIASKRIKYLGIQLIRDV KDLFKENYKPLLNEMKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLPMTFF TELEKTTLKFMWNQKRARITKSILSQKNKAGGITLPDFKLYCKATVTKTAWYWYQNRDID QWNRTEPSEIMPHIYNYLIFDKPDKNKKW >gi568815597f:162969485_163174557|GENSCAN_predicted_CDS_3|2790_bp atgatgtaccagtcctcgcctcctcaaacacaatctgcaagtcccacagtgaaaaactat ttctctccaccttcttgtttgcataacaaccaaggcaacttccgcaaatcactgcgtgga gacgatgatcctgccagctcccttttggaaatcgtgaggatcagatcttggaccatgtat aatatgatgcttctaatccaaaagaggaaaggcattgggagtcagctcctaagggctgga gaggcagagggagacagaggagctggtactgcagagcggtcgtctgattggctggacggt cgtagctgggctataaaagagacccctacaggcttagcaggaagacgctcagaggattct gacaatatctttaccggagaagaggcaaagtacgctcaaagccgaagccacagctcctcc tgccgcatttctttcctgcttgcgaattccaagctgttaaataagatgtgcaaagggctt gcaggtctgccggcttcttgcttgaggagtgcaaaagatatgaaacatcggctaggtttc ctgctgcaaaaatctgattcctgtgaacacaattcttcccacaacaagaaggacaaagtg gttatttgccagagagtgagccaagaggaagtcaagaaatgggctgaatcactggaaaac ctgattagtcatgaatgtgggctggcagctttcaaagctttcttgaagtctgaatatagt gaggagaatattgacttctggatcagctgtgaagagtacaagaaaatcaaatcaccatct aaactaagtcccaaggccaaaaagatctataatgaattcatctcagtccaggcaaccaaa gaggtgaacctggattcttgcaccagggaagagacaagccggaacatgctagagcctaca ataacctgctttgatgaggcccagaagaagattttcaacctgatggagaaggattcctac cgccgcttcctcaagtctcgattctatcttgatttggtcaacccgtccagctgtggggca gaaaagcagaaaggagccaagagttcagcagactgtgcttccctggcctcaaagctaatg cttccagtgaaacacacgcatcttaataataaggataatgtatatcaaggaccacctttt gaaaaacactcattattcgagaacaaagacacaacatacgagaatctctgggatacattc aaagcagtgtgtagagggaaatttatagcactaaatgcccacaagagaaagcaggaaaga tctaaaattgataccctaacatcacaattaaaagaactagaaaagcaagagcaaacacat tcaaaagctagcagaagacaagaaataactaagatcagagcagaactgaaggaaatagag acacaaaaaacccttcaaaaaattaatgaatccaggagctggttttttgaaaagattaac aaaattgatagactgctagcaagactaataaagaagaaaagagagaagaatcaaatagac acaataaaaaatgataaaggggatatcaccaccgatcccacagaaatacaaactaccatc agagaatactataaacacctctacgcaaataaactagaaaatctagaagaaatggataaa ttcctcgatacatacaccctcccaagaccaaaccaggaagaagttgaatctctgaataga ccaataacaggctctgaaattgaggcaataatcaatagcttaccaaccaaaaaaagtcca ggaccagatggattcacagctgaattctaccagacgtacaaagaggagctggtaccattc cttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattttatgag gccagcatcatcctgataccaaagcctggcagagacacaaccaaaaaagagaattttaga ccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatc cagcagcacatcaaaaagcttatccaccatgatcaagtggtgttggaagctctggccagg gcaattaggcaggagaaggaaataaagggtattcaattaggagaagaggaagtcaaattg tccctgtttgcagatgacatgattgtatatctagaaaaccccatcgtctcagcccaaaat ctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaa tcacaagcactcttatacatcaataacagacaaacagagagccaaatcatgagtgaactc ccattcacaattgcttcaaagagaataaaatatctcggaatccaacttataagggatgtg aaggacctcttcaaggagaactacaaaccactgctcaatgaaatgaaagaggatacaaac aaatggaagaacattccatgctcatgggtaggaagaatcaatatcatgaaaatggctata ctgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgactttcttc acagaattggaaaaaactactttaaagtttatgtggaaccaaaaaagagcccgcatcacc aagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaacta tactgcaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagac caatggaacagaacagagccctcagaaataatgccacatatctacaactatctgatcttt gacaaacctgacaaaaacaagaaatggtga >gi568815597f:162969485_163174557|GENSCAN_predicted_peptide_4|349_aa MPSRQQATDQQPQNHVEFGWGSWRRARATKRPNSKRKPSPFWLPHLLRGDLRVFTRVTVQ WGKGNDQTFQGLLDTDSELMLIPGYLKCHCCSPVKVGTYGGQAHQKKFAFSWQGQQHALA LLPQGVSTLALCHLIRRDLDHFLLPQDITLVHYIDDIMLIGSSEQEVANTLDLSVGHLHA RGWEIQGPSTSVKFLGVKEKLLHLATPTTKKVVQGLVGLFGFWRQHSPHLGVLLWPIYQV TQKAASFEWGPEQEKALQQVQAAVQAALIPGPYDPADPVIFEVSVADRDAVWRLWQASIG ESQQRALGFWSKALRSSANNYSPFERQLLACYWALVETEHLTMGHQVTM >gi568815597f:162969485_163174557|GENSCAN_predicted_CDS_4|1050_bp atgccttcacgccagcaggccactgaccagcagccacagaaccatgtggagtttggctgg ggcagttggaggagagcccgggccaccaagcggcccaactccaagagaaaaccatctccc ttctggctcccccatctgctgagaggagacctccgggtttttaccagggtaactgtgcag tggggaaagggaaatgatcagacatttcagggactactggacactgattctgagctgatg ttgattccagggtacctaaaatgtcattgttgttctccagttaaagtagggacttatgga ggtcaggcccaccagaagaaatttgccttcagctggcaaggccagcaacatgcccttgct ctcctacctcagggtgtatcaactctggctttgtgtcatcttattcggagagaccttgat cactttttgcttccacaagatatcacactggtccactacattgatgatattatgctaatt ggatccagtgagcaagaagtagcaaatacactggatttatcagtgggacatttgcatgcc agaggatgggaaattcagggaccttctacctcagtaaaatttcttggggtgaaggaaaag ttgctgcatttggccactcctacaaccaagaaggtggtacaaggcctagtgggcctattt ggattttggagacaacacagtcctcacttgggtgtgttactctggcccatttatcaagtg acccaaaaggctgccagttttgagtggggtccagaacaagagaaagctctgcaacaggtc caggctgctgtgcaagctgctctgatacctgggccatatgacccagcagatccagtgatc tttgaggtgtcagtggcagatagggatgctgtttggagactttggcaggcctccataggt gaatcacagcagagggctctaggattttggagcaaggccctccgatcttctgcaaataac tactctccttttgagagacagctcttggcctgttactgggctttggtggaaaccgaacat ttgactatgggtcatcaagtcaccatgtga >gi568815597f:162969485_163174557|GENSCAN_predicted_peptide_5|214_aa MAERGQHRAQAVASEGGSPKPWHLPCVIESRIEVWEPPPRFQKMYRNAWIPRQKFAAGAG PSWRTSASPVWKGYVESEPPHGVPTGALPSGAVRRGPPSSRPQDDRSTDSLHYAPEKSTD TQHQPVKAARTGAVSCKDTAAVLPKTMGIHLLHQRNLDVRPGLKGDHFVALKFDCPAGFW TCMGPISPLFWPISPISNGCSYPIPVLPLYLGSS >gi568815597f:162969485_163174557|GENSCAN_predicted_CDS_5|645_bp atggctgaaaggggccaacatagagctcaagctgtggcctcagagggtggaagccccaag ccttggcatcttccatgtgttattgagtcaagaattgaggtttgggaacctccacctaga tttcagaagatgtatagaaacgcctggatacctaggcagaagtttgctgcaggggcgggg ccctcatggagaacctctgctagcccagtgtggaagggatatgtggagtcagaaccccca catggagtgcctactggggcactgcctagtggagctgtgagaagagggccaccgtcctcc agaccccaggatgatagatccactgacagcttgcactatgcacctgaaaaaagcacagac actcaacaccagcctgtgaaagcagccaggacgggggctgtatcctgcaaagacacagcg gcagtgctgcccaagaccatgggcatccacctcttgcatcagcgtaacctggatgtgaga cctggactcaaaggagatcattttgtagctttaaaatttgactgccctgctggattttgg acttgcatgggccctatatcccctttgttttggccaatttctcccatttcgaatggctgt agttacccaatacctgtacttccattgtatctaggaagtagctag >gi568815597f:162969485_163174557|GENSCAN_predicted_peptide_6|156_aa MWKQLWNWVTGRNWKSLEGSEEDGKTRESLGFLRDFFNDCDPNADRSMDSEGLEALEKRM VLAARPRGPLHCSAAGHCSLILAALAPAVGQRGLDTAWAAALESESHHKPRHLSHGVTSD GLQNATVKEAWQLPSKFQSMYQKPCHRSRVPMERLY >gi568815597f:162969485_163174557|GENSCAN_predicted_CDS_6|471_bp atgtggaagcagctgtggaactgggtaacaggcagaaattggaagagtttggagggctca gaagaagatgggaaaaccagggaaagtttgggatttcttagagactttttcaatgattgt gacccaaatgctgatagaagtatggacagtgaaggcctagaagccttggagaaaagaatg gttttggcagccaggcctagggggccacttcactgctcagctgcaggacactgttccctc atcctggctgctctagctcccgctgtgggtcaaaggggcctagatacagcttgggctgca gctttggagagtgaaagccaccataagcctcggcatctttcacatggtgttacatctgat ggcttgcagaatgcaacagtgaaggaggcttggcagcttccatctaaatttcagagtatg tatcagaaaccctgccacagaagcagagtccccatggagagactctactag >gi568815597f:162969485_163174557|GENSCAN_predicted_peptide_7|239_aa MMDNVIILQMRELRSGSKKWSFYFSKEDIYAAKKHMKKCSSSLAIREMQIKTTVRYHLTP VRMAIIKKSGNNRAKEIKIKLGILLQKPDSVGDLVIPYNEKPEKPAKTQKTSLDEALQWR DSLDKLLQNNYGLASFKSFLKSEFSEENLEFWIACEDYKKIKSPAKMAEKAKQIYEEFIQ TEAPKEVNIDHFTKDITMKNLVEPSLSSFDMAQKRIHALMEKDSLPRFVRSEFYQELIK >gi568815597f:162969485_163174557|GENSCAN_predicted_CDS_7|720_bp atgatggataacgtcatcattttacagatgagggaactgaggtctggcagtaaaaagtgg tcattttacttctcaaaagaagatatttatgcagccaaaaaacacatgaaaaaatgctca tcatcactggccatcagagaaatgcaaatcaaaaccacagtgagataccatctcacacca gttagaatggcgatcatcaaaaagtcaggaaacaacagggccaaggagattaagatcaag ttgggaattctcctccagaagccagactcagttggtgaccttgtcattccgtacaatgag aagccagagaaaccagccaagacccagaaaacctcgctggacgaggccctgcagtggcgt gattccctggacaaactcctgcagaacaactatggacttgccagtttcaaaagtttcctg aagtctgaattcagtgaggaaaaccttgagttctggattgcctgtgaggattacaagaag atcaagtcccctgccaagatggctgagaaggcaaagcaaatttatgaagaattcattcaa acggaggctcctaaagaggtgaatattgaccacttcactaaggacatcacaatgaagaac ctggtggaaccttccctgagcagctttgacatggcccagaaaagaatccatgccctgatg gaaaaggattctctgcctcgctttgtgcgctctgagttttatcaggagttaatcaagtag