GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:23:02 Sequence gi568815594r:176115603_176369108 : 253506 bp : 37.45% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 194 377 184 1 1 55 70 136 0.459 6.94 1.02 Intr + 9502 9753 252 2 0 65 99 90 0.831 4.28 1.03 Intr + 15952 16136 185 2 2 56 76 89 0.582 3.09 1.04 Intr + 19506 19674 169 2 1 86 110 82 0.963 8.80 1.05 Intr + 21918 22009 92 1 2 45 94 54 0.406 0.49 1.06 Intr + 30925 31096 172 0 1 48 98 47 0.100 0.29 1.07 Intr + 34441 34571 131 2 2 83 84 31 0.887 1.69 1.08 Intr + 34866 34991 126 2 0 51 79 62 0.694 1.56 1.09 Intr + 36210 36365 156 2 0 41 80 204 0.985 14.19 1.10 Intr + 40477 40541 65 0 2 74 94 35 0.801 -0.70 1.11 Intr + 44392 44524 133 1 1 66 98 103 0.769 8.83 1.12 Intr + 45309 45400 92 2 2 35 61 81 0.354 -2.03 1.13 Intr + 47512 47691 180 1 0 -25 95 155 0.079 2.96 1.14 Intr + 53070 53181 112 0 1 92 91 80 0.998 8.16 1.15 Intr + 56773 56914 142 0 1 48 115 110 0.978 8.71 1.16 Intr + 57665 57767 103 0 1 74 61 36 0.733 -2.19 1.17 Intr + 59015 59116 102 2 0 54 93 51 0.455 0.57 1.18 Intr + 61869 62052 184 0 1 8 103 106 0.673 2.97 1.19 Term + 63858 63977 120 2 0 71 44 74 0.752 -1.21 1.20 PlyA + 64494 64499 6 1.05 2.17 PlyA - 64807 64802 6 1.05 2.16 Term - 74495 74340 156 2 0 68 42 108 0.357 1.15 2.15 Intr - 75237 75083 155 1 2 25 95 70 0.530 0.27 2.14 Intr - 77245 77025 221 0 2 80 111 120 0.985 10.52 2.13 Intr - 79976 79743 234 1 0 76 49 155 0.161 6.28 2.12 Intr - 84136 84042 95 1 2 14 99 78 0.018 -0.66 2.11 Intr - 91981 91837 145 0 1 44 98 76 0.078 3.56 2.10 Intr - 92501 92384 118 0 1 39 55 66 0.034 -3.00 2.09 Intr - 101407 101216 192 2 0 51 116 190 0.995 16.64 2.08 Intr - 105687 105598 90 1 0 102 36 59 0.631 1.15 2.07 Intr - 105998 105848 151 2 1 40 88 96 0.957 3.61 2.06 Intr - 106818 106711 108 0 0 42 101 53 0.715 1.56 2.05 Intr - 109739 109660 80 0 2 106 90 30 0.913 3.35 2.04 Intr - 121869 121692 178 0 1 50 96 124 0.281 7.97 2.03 Intr - 134004 133874 131 1 2 64 90 80 0.029 5.29 2.02 Intr - 153595 153311 285 2 0 96 95 342 0.987 32.09 2.01 Init - 158704 158611 94 1 1 84 81 25 0.375 1.99 2.00 Prom - 164287 164248 40 -4.65 3.08 PlyA - 164797 164792 6 1.05 3.07 Term - 179944 179516 429 1 0 75 36 216 0.221 9.42 3.06 Intr - 184480 184251 230 2 2 66 119 31 0.465 0.67 3.05 Intr - 204615 204383 233 2 2 58 62 304 0.615 21.29 3.04 Intr - 205082 204908 175 0 1 63 53 142 0.741 6.28 3.03 Intr - 205652 205561 92 1 2 82 81 6 0.150 -1.98 3.02 Intr - 217526 217395 132 1 0 83 103 16 0.221 1.44 3.01 Init - 219726 219635 92 0 2 76 87 24 0.261 1.11 3.00 Prom - 222671 222632 40 -3.25 4.00 Prom + 228488 228527 40 -6.35 4.01 Sngl + 228796 229428 633 0 0 72 47 536 0.991 43.83 4.02 PlyA + 230509 230514 6 1.05 5.00 Prom + 235008 235047 40 -6.05 5.01 Init + 240773 240789 17 1 2 41 99 33 0.021 -1.14 5.02 Intr + 252096 252254 159 0 0 104 91 65 0.077 6.58 5.03 Term + 253221 253380 160 0 1 69 38 157 0.039 5.23 5.04 PlyA + 253397 253402 6 -1.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 79960 79743 218 1 2 90 49 121 0.819 6.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:176115603_176369108|GENSCAN_predicted_peptide_1|899_aa LDHRYNEFKLHAIMSEHKKTITAISWCPHNPDLFASGSTDNLVIIWNVAEQKVIAKLDST KGNKNQKHVLRPESLEGTDEEDPVTALEWDPLSTDYLLVVNLHYGIRLVDSESLSCITTF NLPSAAASVQCLAWVPSAPGMFITGVSVQSPTKNHYTSSTSEAVPPPTLTQNQAFSLPPG HAVCCFLDGGVGLYDMGAKKWDFLRDLGHVETIFDCKFKPDDPNLLATASFDGTIKVWDI NTLTAVYTSPGNEGVIYSLSWAPGGLNCIAGGTSRNGAFIWNVQKGKIIQRFNENSQMLR CSLESLRHSFAVGPIKAFLGFDQKTCIGTSKPNYLTCHERKDTCSSGPYNNNYAIEPGTP PLLCGKVSRDIRQEIEKLTANSQVKKLRWFSECLSPPGGSDNLWNLVAVIKGQDDSLLPQ NYCKGIMHLKHLIKFRTSEAQELTTVKMSKFGGGIGVPAKEERLKEAAEIHLRLGQIQRY CELMVELGEWDKALSIAPGVSVKYWKKLMQRRADQLIQEDKDDVIPYCIAIGDVKKLVHF FMSRGQLKEALLVAQAACEGNMQPLHVSVPKGASYSDDIYKEDFNDFYATEEIIFLLSKL AMAYLIRGNELELAVCVGTVLGESAAPATHYALELLARKCMMISVWNLAADLLLMIPDNE LHLIKLCAFYPGCTEEINDLHDKCKLPTVEECMQLAETARADDNIFETVKYYLLSQEPEK ALPIGISFVKEYISSSDWTLDTIYPVLDLLSYIRTEKLLLHTCTEARNELLILCGYIGAL LAIRRQYQSIVPALYEYTRSLEDSPYTPPSDSQRMIYATLLKRLKEESLKGIIGPDYVTG SNLPSHSDIHISCLTGLKIQGPVFFLEDGKSAISLNDALMWAKVNPFSPLGTGIRLNPF >gi568815594r:176115603_176369108|GENSCAN_predicted_CDS_1|2700_bp ttggatcaccgttataatgaattcaaacttcacgcaattatgtctgaacataaaaaaaca atcacagcaatttcttggtgtccacataatcctgatctgtttgcaagtggcagtactgat aatttagtgatcatttggaatgttgcagaacaaaaagtcattgctaaactcgacagtaca aaaggtaataaaaatcagaaacatgttttgagaccagaatctcttgaagggacagatgaa gaggatccagttacggccttggaatgggacccactatctactgattatcttctagtggtt aatttgcattatggaattcgcctggtagattctgaatcactttcttgcataacaacattt aatcttcccagtgcagcagcttctgtacagtgcttagcctgggttcccagtgctcctggg atgtttataactggagtttcagtccaatctccaaccaaaaatcattatacatcctcaaca agcgaagcagttccacccccaactttaacacagaatcaagcattttctcttcctcctggt catgcagtgtgttgtttcttggatggtggagttggactttatgatatgggagctaagaag tgggattttcttagagacttgggacatgtggaaactatctttgactgcaaattcaaacct gacgatcctaatcttttagcaacagcttcatttgatggcactataaaagtctgggatata aacacattaacagcagtgtacacatccccgggtaatgaaggtgttatttattccctttct tgggctccaggtggtttaaattgtattgctgggggaacttcccgaaatggtgcttttatt tggaatgttcaaaagggcaaaattatacaacgatttaatgagaactcacagatgctgcgc tgcagcctggagagcctcagacactcatttgcagtggggcctataaaagcctttcttggg tttgaccaaaaaacatgcattgggacttcaaagcctaactacctaacatgtcatgaaagg aaagacacctgttctagtggcccctacaataacaattatgctatagaaccaggcactcct cctctactgtgtggtaaagtgtcaagagatattagacaggaaatagaaaaactaactgct aattctcaagtgaaaaaactaagatggttctcagaatgtttatctcctccaggtggcagt gacaatttatggaacttggttgctgtgataaaaggacaggatgatagcttacttcctcag aactactgcaaaggaataatgcacttgaaacatctgattaaatttagaacatctgaagct caagaactaacaacagtcaagatgtctaaatttggtggtggtattggtgtacctgctaaa gaggaaagactgaaggaagctgctgaaatccacttgagattaggacaaattcagagatac tgtgaacttatggttgaacttggagagtgggacaaagccctgtcaattgcaccaggagtc tctgtgaaatactggaagaagttaatgcagaggagagctgaccaattaatccaggaagat aaggatgatgtcattccatactgcatagccattggtgatgtgaaaaagctagtccatttt ttcatgtcaagaggtcagcttaaagaagctctgcttgttgcacaggctgcttgtgaagga aatatgcagcccttacatgtttccgtgcctaaaggagcttcatattctgatgatatctac aaggaagactttaatgacttctatgcaactgaagaaattattttcttattgagcaagctt gctatggcatacctgattcgcggaaatgaactggagttggcagtctgtgtgggcacagta ctaggagagtctgcagcaccagcaacccactatgccttagaattactggcgagaaagtgc atgatgatttcagtatggaatttggcagctgatcttcttctgatgattcctgataatgaa ctacatttaataaaactctgtgctttctacccaggatgtactgaagagataaatgacctt catgataagtgtaagctacccacagtggaagaatgtatgcagttagctgagacagcccgt gcagatgacaatatatttgaaactgtaaaatattacttgttaagtcaagaacctgaaaaa gcccttcctattggtattagctttgttaaagaatacatcagtagctcagactggactttg gataccatataccctgttcttgacctactgagctatattcgtactgaaaaattactcttg catacgtgtactgaagctcgaaatgagttgctgatattatgtggttacattggtgcatta ctggctatcagaagacagtaccaaagcattgttccagcactttatgagtacacaagatca ttagaagactctccgtatacacccccttctgattcacaaagaatgatttatgcaacttta ttaaagagactaaaagaagagtcactgaaaggaattattggaccagattatgtgactgga tcaaatcttccaagtcattctgatattcacatttcttgtcttacgggattaaaaatccag ggccctgtgtttttccttgaagacgggaaatctgctatctccttgaatgatgctttgatg tgggcaaaggtgaatccattctcacctttagggactggaatacgactcaatccattctga >gi568815594r:176115603_176369108|GENSCAN_predicted_peptide_2|810_aa MTLLLHITVKSILLSDQLLQVQLTRSRCSNEGPAVQEHPGREPGTFETWIQLKTAADSSA AMSVLEENRPFAQQLSNVYFTILSLFCFKLFVKISLAILSHFYIVKGNRKEAARIAAEFY GVTQGQAPMGEELKSLITLSGRADIQQICSPNQGEQRVPLRLQVGIKNSRSICTYDTWLS QTKNVNQTMSADIAVEMNCIKQVLTKRRLVSIEECSSKRKASWGILTSQGSWADRSPLHE AASQGRLLALRTLLSQGYNVNAVTLDHVTPLHEACLGDHVACARTLLEAGANVNAITIDG VTPLFNACSQGSPSCAELLLEYGAKAQLESCLPSPTHEAASKGHHECLDILISWGIDVDQ EIPHLGTPLYVACADVQKGKYWDTPLHAAAQQSSTEIVNLLLEFGADINAKNTELLRPID VATSSSMVERILLQHEGRELAAMPLQLMQIRVTLTANTHYLYSSRDKAIKSLNLVRNCSR ENFREYKCPTHTEENIQWNCCPKWIHAGCMMKGCDWLHIRSNHKNITLVHYIDDILLSRP HEQEEASSLDTLERCSNPESVMAAAGQEKGYLTQTAAALDKSPSLSPQLAAPIRGRPKKC LVYPHAPKSSRLSRSVLRWLQGLDLSFFPRNINRIKSIQDDFVNFTDYSYQMRLPLVSRS TVSKSIKDNIRLSELLSNPNMLTNELKAEFLILLHMLQRKLGRKLNPGRPDLITQTLRSR DLSSAGIKTHVGEGEVRDLECDRVSTCYCYKGQHGKHEKIIDIVKKSTVKIPVLFCSILI TGARSPQSRTRCLLNQSGPSHVHPLRVVSP >gi568815594r:176115603_176369108|GENSCAN_predicted_CDS_2|2433_bp atgactttattacttcatattactgtcaaatctatattactgtcagatcagctacttcag gtgcaactaactagatccaggtgctccaatgaaggaccagctgttcaggagcatcccgga cgagagccaggcacatttgagacttggatccaactaaagaccgccgcagattcttctgca gcaatgtcggtgttagaagaaaatcggccgtttgctcaacaattatccaatgtctacttt acaatactttcgctgttctgttttaagctttttgtgaaaatcagccttgccatcctcagt catttctacatagtgaaaggcaaccgcaaggaagcggcaaggatagcagctgaattttat ggagtaacccaaggacaagcaccaatgggagaagaactgaagtctcttattaccctcagt ggtagagcagacattcagcagatctgctctcccaatcagggagagcagagagtcccacta agactgcaggttggaataaagaacagcaggagcatttgcacttatgacacctggttgtca cagacaaaaaacgtcaaccaaactatgtccgcagatattgctgttgaaatgaactgcatt aagcaggtcctcacgaaaagaaggctagtgagtatagaggaatgttcttctaaacgaaaa gctagctggggtattctgaccagtcaaggttcctgggcagatcgatcaccactacatgaa gcagcaagtcaaggtcgccttcttgctctgagaacattattatcacagggttataatgta aatgcagtaaccttagaccatgtcaccccattgcacgaagcctgccttggagatcacgtg gcatgtgccagaactctgctggaagcaggagctaatgtaaatgcaatcacgatagatggc gtgactccgttattcaacgcatgctcccaaggcagtccaagctgtgcagagctgcttctg gagtatggtgccaaagcccagctggagtcatgtcttccatccccaacgcatgaggccgcc agtaaaggtcaccatgaatgtcttgacatcctgatatcctggggcatagatgttgaccaa gaaattcctcatttgggaactcctctctatgtagcttgtgctgacgtacagaaaggcaaa tattgggatactccattacatgctgctgctcaacaatccagcacagaaattgtaaactta ctgctagaatttggagcagatatcaatgccaaaaatacagagcttctgcgacctatagat gtagctacgtctagcagtatggtggaaaggatattgcttcaacatgaaggaagggagctg gcagctatgcctctgcagttgatgcaaataagggtgactcttactgccaacactcattat ctttatagctccagagataaggcaattaaaagcttgaatctggtgaggaattgctcaaga gagaactttagagaatataaatgccctacacatacagaagaaaacatccagtggaattgc tgtcctaagtggattcatgcaggctgcatgatgaagggctgtgactggctgcacattcgt tccaatcacaagaacatcacactagtccactatattgatgatatcctgctgagtcgacct catgaacaggaagaagcaagttccttagatactttagaaagatgcagcaacccagaaagc gtcatggctgccgccggccaggaaaaagggtatttgacacagactgcggcagccctagac aagtcaccgtcactttcgccacagctagcagctcccatccgagggaggcctaagaagtgt ctggtctatccgcatgcgccgaagagctcccgcttgtctcgttccgttctgcgttggctt cagggtctggatctcagcttcttccccaggaacatcaacagaattaaaagtatccaggat gactttgtgaatttcacggactatagctaccagatgcgtttacccctggtttccaggtct acagtttcgaagtctattaaagataacattaggttatcagaattactaagcaatcccaac atgctgaccaatgaacttaaagcggagttcctcatccttttacatatgttgcaaagaaaa ttaggcagaaaattgaatccaggcagacctgatctaatcacacaaacccttagaagcaga gatctttcttcagctggaatcaagacacatgtaggagaaggagaagtaagagatctggag tgtgatagggtctcaacctgctattgctacaagggacaacatggaaagcatgagaagatt atagacattgtaaagaaaagcactgtgaagatccctgtcctgttctgttccattctaatt accggtgcacgcagcccccagtcacgtacccgctgcttgctcaatcaatcaggaccctct cacgtgcacccccttagagttgtgagcccttaa >gi568815594r:176115603_176369108|GENSCAN_predicted_peptide_3|460_aa MAVKLIGCNRGSKLFKQGKKMANTKLKTAERCLFPTILNRYTSGMCINYAHEKGTLQSSF KLEVYLDIKVYLLKWADNGSADVHPRSIRTISNKHQYSNLKKKKKEPPGNPTEARPRARR QRLEGLACGPRAAPDAREAPAQAGGGAGSAVVLLPHWPGVGGGNMIRETCSRTGTLLSLK AVVMKQPKVSAAITLSEKANSEFARDSTVFIAIVSHESGLHTHTQAPVPAADRLPAIRAV CARLHPGEINSLVAHTKPVWWSLHMDAREIWCHDSDRGTSLGRSIPCPPALCCMRKIHLR PQVLRPTSPKNISPILNPFCLAQGVFFLCVSSTYMCLPANWTGTCTLVFLTPKIQFANRT EELPVPLMTPTRQKRVIPLIPLLVSLRLSAFTIALGTGIAGISTSVTTFRSLSNDFSASI TDVSQTLSVLQAQVDSLAAVVLQNRRGLDLLLKKEDSVYS >gi568815594r:176115603_176369108|GENSCAN_predicted_CDS_3|1383_bp atggcagtaaagctgataggatgcaacagaggtagtaaacttttcaagcaaggcaagaaa atggctaacacaaaactgaagaccgcagaaaggtgcctatttccaactatactcaacaga tacacaagtggaatgtgtataaattatgcacatgaaaagggaacacttcaaagctccttc aagcttgaggtttatttggatataaaggtatacttattaaagtgggctgacaatggcagt gctgatgtccaccccaggtctattcgaacaatatctaacaaacatcagtacagtaattta aaaaaaaaaaaaaaagagccgccaggaaaccccacggaagcccgacccagagcgcggcgg cagcgcctggagggcctcgcatgcggaccacgagcggcaccggacgcgcgggaggcgcca gcccaggcgggcggcggcgccggctctgctgttgtgttgctccctcattggccaggcgtg ggtggagggaacatgatccgcgagacgtgcagccgcaccgggacgctcctgtctttgaag gcggtggtgatgaagcagccgaaggtgagcgccgccatcacgctcagcgagaaggcgaac agtgagttcgcccgcgacagcaccgtgttcatcgcgatcgtctcccacgagtccggactc cacacgcacacccaggctcccgttccggcggcggaccggctccctgcgatccgcgccgtc tgcgcccgcctgcacccaggtgaaataaacagccttgttgctcacacaaagcctgtttgg tggtctcttcacatggacgcacgtgaaatttggtgccatgactcagatcgggggacctcc cttgggagatcaatcccctgtcctcctgctctttgctgcatgagaaagatccacctacga cctcaggtcctcagaccgaccagcccaaagaacatctcaccaattttaaatccgttttgc ctcgcacaaggtgtcttcttcctctgtgtatcctctacctacatgtgtctacctgctaat tggacaggcacatgcacactagttttccttactcccaaaattcaatttgcaaataggacc gaagagctccctgttcccctcatgacaccgacacgacaaaaaagagttattccactaatt cccttgcttgtcagtttaagactttctgccttcactattgctctcggtactggaatagca ggcatttcaacctctgtcacgaccttccgtagcctctctaatgacttctctgctagcatc acagacgtatcacaaactttatcagtcctccaggcccaagttgactctttagctgcagtt gtcctccaaaaccgccgaggccttgacttactgctgaaaaaggaggactctgtatattct taa >gi568815594r:176115603_176369108|GENSCAN_predicted_peptide_4|210_aa MGQVWALVRSTLEPFHTDDEEKGEYNEVTEEVTEQVCLPAKAKVAKEGEVYPYPSAPPPY FEEKEWPDPPDLSFLEDAGQKVIAPVTVQAAPQAIALSSIQAGIQQARREGDLEAWQFPI RIHPPDQQGNIIATFEPFPFKLLKESKQAINQYGQGSPFVMELLKNVAVSSQMIPTDWDA LAQACLTLTQFLQFKTLWADEVSIQAACNA >gi568815594r:176115603_176369108|GENSCAN_predicted_CDS_4|633_bp atgggacaagtgtgggctctggttcgttccaccttggaaccttttcacactgatgatgag gagaaaggagagtataatgaagtaacagaagaagtaacagagcaagtttgtttgcctgct aaagctaaagtggcaaaggagggagaggtttatccctatccttctgcaccccctccttat tttgaagaaaaagagtggcctgaccctccagatctttcttttctggaggatgctgggcaa aaagtgattgccccagtgactgttcaagcagcgcctcaagcgatcgctctcagttctatt caggcaggaattcagcaagctagaagagagggtgatttagaggcttggcagttccctatt agaatacaccccccagatcaacagggaaatattatagctacatttgagccttttcctttt aaattactcaaagaatctaaacaagctattaatcaatatggacaaggttctccttttgta atggaactgttaaagaatgttgctgtttccagtcagatgattcctactgactgggatgct cttgctcaagcttgtctaactcttactcagttcttacaatttaaaaccttgtgggcagat gaagtttccattcaggctgcttgtaatgcctag >gi568815594r:176115603_176369108|GENSCAN_predicted_peptide_5|111_aa MLAVVRGCDCSKCLVESFCFASTALRISVGRFYFALCSLTDKPVANTYGWELAVANVARP AMEESTIPVPTAEVLSTILAVEAPTLLQSMTSKMWPETEMPAWPQCQVAKE >gi568815594r:176115603_176369108|GENSCAN_predicted_CDS_5|336_bp atgctggcagttgtcagggggtgtgactgtagcaaatgtttggtggaatctttttgcttt gcttccacagctttacgcatttcagttgggaggttttattttgcgctgtgcagtttgact gacaagccagtagctaacacttatgggtgggagctggcagtggccaatgtggccaggcct gcgatggaggagagcacaattccagtgcctactgctgaggtgctttctacgattctggct gtggaggcccctaccctgctccagagcatgacctccaaaatgtggcctgagactgaaatg cctgcatggccacagtgccaggtggccaaagaatga