GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:19:24 Sequence gi568815582f:72913_176716 : 103804 bp : 53.73% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4135 4288 154 0 1 68 90 113 0.941 7.64 1.02 Intr + 5147 5421 275 0 2 115 99 85 0.702 10.10 1.03 Intr + 6513 6788 276 2 0 87 95 117 0.997 10.55 1.04 Term + 8693 9355 663 1 0 96 43 471 0.892 37.70 1.05 PlyA + 9468 9473 6 -5.12 2.00 Prom + 9507 9546 40 -14.12 2.01 Init + 9636 9704 69 2 0 42 70 58 0.322 0.40 2.02 Intr + 10140 10344 205 2 1 64 84 314 0.980 27.90 2.03 Term + 12489 12865 377 1 2 81 53 482 0.989 39.26 2.04 PlyA + 12911 12916 6 1.05 3.24 PlyA - 12925 12920 6 1.05 3.23 Term - 13958 13793 166 1 1 98 52 354 0.999 30.50 3.22 Intr - 15978 15786 193 1 1 92 94 256 0.997 25.67 3.21 Intr - 16990 16801 190 1 1 82 21 221 0.943 14.48 3.20 Intr - 17175 17104 72 0 0 98 38 43 0.567 0.40 3.19 Intr - 19813 19684 130 0 1 101 105 117 0.968 15.90 3.18 Intr - 20413 20307 107 1 2 106 65 232 0.968 22.21 3.17 Intr - 22204 21915 290 2 2 28 9 163 0.590 -0.09 3.16 Intr - 23101 23045 57 2 0 117 68 96 0.931 9.85 3.15 Intr - 24298 24178 121 1 1 3 54 70 0.307 -4.33 3.14 Intr - 25389 25233 157 2 1 38 80 237 0.834 18.43 3.13 Intr - 25530 25405 126 2 0 31 60 107 0.697 2.50 3.12 Intr - 25634 25578 57 1 0 85 21 86 0.331 0.09 3.11 Intr - 27597 27460 138 2 0 43 131 172 0.850 17.09 3.10 Intr - 37694 37613 82 0 1 71 110 76 0.264 7.29 3.09 Intr - 39863 39710 154 2 1 106 101 262 0.837 29.46 3.08 Intr - 44463 44389 75 0 0 63 96 21 0.104 0.61 3.07 Intr - 45934 45812 123 1 0 76 79 18 0.076 0.99 3.06 Intr - 46343 46214 130 1 1 68 80 84 0.118 6.80 3.05 Intr - 57679 57610 70 2 1 78 103 112 0.135 10.43 3.04 Intr - 65422 65238 185 0 2 117 108 92 0.777 14.05 3.03 Intr - 65932 65734 199 2 1 53 44 143 0.857 5.33 3.02 Intr - 67524 67403 122 2 2 65 48 57 0.866 0.04 3.01 Init - 69881 69799 83 2 2 107 89 55 0.873 6.04 3.00 Prom - 72448 72409 40 -2.71 4.00 Prom + 78022 78061 40 -6.40 4.01 Init + 79998 80092 95 2 2 95 105 197 0.987 21.91 4.02 Intr + 80980 81184 205 1 1 79 91 535 0.832 52.63 4.03 Term + 81360 81488 129 2 0 93 53 199 0.993 15.39 4.04 PlyA + 81574 81579 6 1.05 5.00 Prom + 84850 84889 40 -4.81 5.01 Init + 90249 90304 56 2 2 44 105 143 0.970 12.30 5.02 Intr + 91570 91774 205 1 1 79 91 535 0.832 52.63 5.03 Term + 92116 92244 129 0 0 77 53 193 0.999 13.19 5.04 PlyA + 92324 92329 6 1.05 6.00 Prom + 93027 93066 40 -5.21 6.01 Init + 93086 93177 92 1 2 94 67 117 0.751 10.20 6.02 Intr + 93356 93560 205 2 1 60 93 487 0.811 46.13 6.03 Term + 93668 93796 129 1 0 106 52 234 0.991 20.09 6.04 PlyA + 93832 93837 6 1.05 7.00 Prom + 97882 97921 40 -7.20 7.01 Init + 100001 100095 95 1 2 107 78 136 0.957 14.31 7.02 Intr + 100213 100417 205 1 1 143 105 560 0.996 62.93 7.03 Term + 100560 100688 129 2 0 104 43 154 0.607 10.99 7.04 PlyA + 100777 100782 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:72913_176716|GENSCAN_predicted_peptide_1|455_aa MACASVLPSVSHLTRPGAQLLFKAFSGAPFQDQGHRSQSSGKAPRGFLCGRGTASGPGIG PDLAVGAGKRIHLHVARPAPGAQPVPGAHCPPSPGFRPLLRRRRSAPVLGVLPWSAAAGL RAGVRVPRSPGPSRRMPARSGAQFCRRMGQKKQRPARAGQPHSSSDAAQAPAEQPHSSSD AAQAPCPRERCLGPPTTPGPYRSIYFSSPKGHLTRLGLEFFDQPAVPLARAFLGQEGLLN APHTDPFFPPPYLREGLLNAPHTDPFFPPPYLREGLLNAPHTDPFFPPPYLREALLNAPG ADPFFPPPYLREGLLNAPGADPFFPPPYLREGLLNAPRADPFFPPPYLQEALLNAPRADP FFPPPYLREALLNAPHAGPFFPPPYLREGLLNAPHAGPFFPPPYLREALLNAPHAGPFFP PPYLREALLNAPRADPFFPPPYLREALLNAPPASI >gi568815582f:72913_176716|GENSCAN_predicted_CDS_1|1368_bp atggcctgtgcgtctgtcctgcccagcgtctcccacctgacgcggccaggtgcccagttg ctcttcaaggccttttcaggtgctcctttccaggaccagggtcacagaagtcaaagctca ggaaaagcccctcgagggtttttgtgcggcagaggcaccgcatcgggcccgggaatcggt ccggacctggcggtgggcgctgggaagaggatccacctccacgtggcccgccccgccccg ggggcgcagccagttcccggcgctcactgccccccttctcccggcttccgtccccttctg cgcaggcgccgctccgccccggtcctaggggtgcttccgtggtcggcggctgctgggctc cgcgccggggtccgagtcccacgaagccccggcccgagccgccggatgcccgcgcgcagc ggggcccagttttgccgacggatggggcaaaagaagcagcgaccagctagagcagggcag ccacacagctcgtccgacgcagcccaggcacctgcagagcagccacacagctcgtccgat gcagcccaggcaccttgccccagggagcgctgcttgggaccgcccaccactccgggccca taccgcagcatctatttctcaagcccaaagggccaccttacccgactggggttggagttc ttcgaccagccggcagtccccctggcccgggcatttctgggacaggaaggcctcctgaat gctccccacactgaccccttcttcccaccaccctatctccgggaaggcctcctgaatgct ccccacactgaccccttcttcccaccaccctatctccgggaaggcctcctgaatgctccc cacactgaccccttcttcccaccaccctacctccgggaagccctcctgaatgctcccggc gctgaccccttcttcccaccaccctatctccgggaaggcctcctgaatgctcccggcgct gaccccttcttcccaccaccctacctccgggaaggcctcctgaatgctccccgcgctgac cccttcttcccaccaccctacctccaggaagccctcctgaatgctccccgcgctgacccc ttcttcccaccaccctatctccgggaagccctcctgaatgctccccacgctggccccttc ttcccaccaccctacctccgggaaggcctcctgaatgctccccacgctggccccttcttc ccaccaccctacctccgggaagccctcctgaatgctccccacgctggccccttcttccca ccaccctatctccgggaagccctcctgaatgctccccgcgctgaccccttcttcccacca ccctatctccgggaagccctcctgaatgctcccccggcgagcatctaa >gi568815582f:72913_176716|GENSCAN_predicted_peptide_2|216_aa MRVPCGHLSSPGLGDFTGSGIAQVLVRRLPNGTELRGRIVETEAYLGPEDEAAHSRGGRQ TPRNRGMFMKPGTLYVYIIYGMYFCMNISSQGDGACVLLRALEPLEGLETMRQLRSTLRK GTASRVLKDRELCSGPSKLCQALAINKSFDQRDLAQDEAVWLERGPLEPSEPAVVAAARV GVGHAGEWARKPLRFYVRGSPWVSVVDRVAEQDTQA >gi568815582f:72913_176716|GENSCAN_predicted_CDS_2|651_bp atgagggtcccttgtggccacctcagcagcccaggtttaggggacttcacgggctctggg attgcccaggtcctagtccggcgacttcctaatggcacagaactccgaggccgcatcgtg gagaccgaggcatacctggggccagaggatgaagccgcccactcaaggggtggccggcag accccccgcaaccgaggcatgttcatgaagccggggaccctgtacgtgtacatcatttac ggcatgtacttctgcatgaacatctccagccagggggacggggcttgcgtcttgctgcga gcactggagcccctggaaggtctggagaccatgcgtcagcttcgcagcaccctccggaaa ggcaccgccagccgtgtcctcaaggaccgcgagctctgcagtggcccctccaagctgtgc caggccctggccatcaacaagagctttgaccagagggacctggcacaggatgaagctgta tggctggagcgtggtcccctggagcccagtgagccggctgtagtggcagcagcccgggtg ggcgtcggccatgcaggggagtgggcccggaaacccctccgcttctatgtccggggcagc ccctgggtcagtgtggtcgacagagtggctgagcaggacacacaggcctga >gi568815582f:72913_176716|GENSCAN_predicted_peptide_3|1008_aa MDMRVALGSRVLLRGQGRTQDLCGLGSWNCQPVFHSECSLLHSHQQERDSRFSTPLLTLV FLITAIVGARGAPVPQTRTVRLQTAPRNSVASRRPRACAFRCLRLRLRLRRSPGRRLSTS SGRCRRSLASPERGTPPPPPAPSSSGPLRPRPRPHGGMRDNTSPISVILVSSGSRGNKLL FRYPFQRSQEHPASQTSKPRSRYAASNTGDHADEQDGDSRFSDVILATILATKSEMCGQK FELKIDNVRFVGHPTLLQHALGQVGFSLGMHRPSGQMWLSLGRQAWSILALGGSELRLTP ALVKISKTDPSPKREAPTMILFNVVFALRANADPSVINCLHNLSRRIATVLQHEERRCQY LTREAKLILALQDEVSAMADGNEGPQSPFHHILPKCKLARDLKEAYDSLCTSGVVRLHIN SWLEVSFCLPHKIHYAASSLIPPEAIERSLKAIRCIPDAHPVQDLVQDLVHTWCIPGAYP VHTWCRTWCIPGAYLVQDLVHTRCRVPALRVFRDSPYHALLLLSDEKSLLGELPIDCSPA LVRVIKTTSAVKNLQQLAQDADLALLQAMGSAVAASHVGVAQSHGTIEILPEGGSRWCLW VAPSSGGGQQGVAAGVAVRKPLGPSPGPYQREAEEDLIKRGSDVATETEWSDVTASQETL AVEEAGRILFWSFQREPGPAGTLCLPLSTDFGFSISRMDSILERLLQGEAGGGDHYAEQS WGKVFQLAAHLVYWGKAIIIYPLCENNVYMLSPNASVCLYSPLAEQFSHQFPSHDLPSVL AKFSLPVSLSEFRNPLAPAVQEEGERALQSWLAAGGSLSEKRAFLPTQLIQMVVWMLQRR LLIQLHTYVCLMASPSEEEPRPREDDVPFTARVGGRSLSTPNALSFGSPTSSDDMTLTSP SMDNSSAELLPSGDSPLNQRMTENLLASLSEHERAAILSVPAAQNPEDLRMFARLLHYFR GRHHLEEIMYNENTRRSQLLMLFDKFRSVLVVTTHEDPVIAVFQALLP >gi568815582f:72913_176716|GENSCAN_predicted_CDS_3|3027_bp atggacatgagggtggcgctgggatccagagtgctgcttcgagggcagggacgcacccag gacctctgtgggctggggtcatggaactgccagcctgttttccatagcgagtgtagcctc ttacattcccaccagcaggagagagattcccgtttctccacacccctgctgacgcttgtc tttttgattacagccatcgtaggagcaagaggggcccctgtgccccaaacgcgaaccgta cggctccagacagcaccgcggaactcggtggcttccagaaggccccgcgcctgcgcattc cgctgcctgcgcctgcgcctgcgcctgcgccgttctcccggccgccgccttagcacctcc tccggacggtgtcgccgaagtctcgcgagcccggagcgtggcacgccccctccgcctccg gctccgtcctcctctggccccctccgcccccggccccggccccacggcgggatgcgggac aacaccagccccatcagcgtgattctggtgagctcggggagcaggggcaataagctgctg ttcaggtaccccttccagagaagccaggagcacccggcgtcccagacaagtaagccgcgt agcagatacgctgccagcaacacgggcgaccatgctgatgagcaggacggcgattccagg ttttcagatgttattctggcaacaattttggcaaccaagtctgaaatgtgtggccaaaaa tttgaactgaagattgataatgtgcgatttgttgggcacccaacactgctacagcatgct ctggggcaggtggggttctccctgggcatgcacagaccctctgggcagatgtggctctcc ctgggccggcaggcctggtccattctggctctggggggctcagaactgaggctgacacca gctctggtgaagatctccaaaacagatccttccccgaagagggaagcacctactatgatt ctttttaatgtggtgtttgcactgagggccaacgcagacccgtcagtgataaactgtctg cataacctgtcccgtcgtatcgccaccgtgctgcagcacgaggagcgccgctgccagtac ctcacccgggaggccaagctgatcctggcgctccaggatgaggtgtccgccatggctgat ggaaatgaaggtcctcagtccccattccatcacatcctgcccaagtgcaagctggccagg gacctcaaggaagcttatgacagcctgtgcacgtcgggcgtagttcggcttcacatcaac agctggctggaggtgagcttctgcctgccccacaagatccactatgcggcctccagtctg atccccccagaggccatcgaacggagcctgaaagccatccggtgcatacctgatgcacac ccggtgcaggacctcgtgcaggacctggtgcatacctggtgcatacctggtgcatacccg gtgcatacctggtgcaggacctggtgcatacccggtgcatacctggtgcaggacctggtg catacccggtgcagggttcctgcacttcgtgtgttccgtgacagcccctaccatgccctg ctgctgctcagtgatgagaagtccttgctgggtgagcttcctattgactgctcccctgcc ctagtgcgggtgatcaagaccacatctgctgtgaagaacctgcagcagctagcccaagat gcggacctggccttgctgcaggccatgggatcagctgtggctgctagtcatgtgggcgtg gcccagagccacgggaccattgaaatactgcctgagggtgggagcaggtggtgcctgtgg gtggcgccctcgtctggtggcggtcagcagggcgttgctgctggtgtggcggtgcggaag cccctggggcccagccctggcccttatcagagggaggcagaggaagatctgataaaaaga ggcagcgatgtggccacagagacagagtggagtgacgtgaccgcaagccaagagacactg gccgttgaagaggcaggacgaattctcttctggagcttccagagggagcctggtcctgct ggcaccttatgcttgcccttaagtactgattttggattttcaatttccagaatggacagc atcttggagaggctgctgcagggagaggcaggaggtggagaccactatgctgagcagagc tgggggaaagttttccagcttgcagctcatctggtgtactggggcaaggccatcatcatc tacccgctgtgtgagaacaacgtctacatgctgtctcccaatgccagcgtatgtctgtac tccccgctggccgagcagttctcccaccagttcccatctcatgacctgccgtccgttctt gccaagttctccttgccggtctccttgtcagaatttaggaatcccctggcccccgctgtg caggaggagggtgagcgggccttgcagagctggctggcagctgggggctccctttctgag aagagggcctttctgcctacccagctcatccagatggtggtgtggatgctgcagcgccgg cttctcatccagctgcacacctatgtctgcctgatggcctcacccagcgaggaggagccc cgtccgcgagaggacgacgtccccttcactgcccgggtcggcggtcgcagcctcagcacg cccaacgccctcagctttggctccccaaccagcagcgatgacatgaccctcaccagcccc agcatggacaactccagcgcagagctacttcccagcggggactcgccactgaaccagagg atgacggagaacctgctggccagcctgtcggagcatgaacgcgcagccatcctcagtgta cccgcagcccagaaccctgaggacctccgcatgtttgccaggctccttcactacttccgc ggccgccaccacctggaggagattatgtacaacgagaacacgcggcgctcccagctgctc atgctgtttgacaagttccgcagcgtgctggtggtgaccacccacgaggaccctgtcatt gccgtcttccaggctctgctcccctga >gi568815582f:72913_176716|GENSCAN_predicted_peptide_4|142_aa MSLTKTERTIIVSMWAKISTQADTIGTETLERLFLSHPQTKTYFPHFDLHPGSAQLRAHG SKVVAAVGDAVKSIDDIGGALSKLSELHAYILRVDPVNFKLLSHCLLVTLAARFPADFTA EAHAAWDKFLSVVSSVLTEKYR >gi568815582f:72913_176716|GENSCAN_predicted_CDS_4|429_bp atgtctctgaccaagactgagaggaccatcattgtgtccatgtgggccaagatctccacg caggccgacaccatcggcaccgagactctggagaggctcttcctcagccacccgcagacc aagacctacttcccgcacttcgacctgcacccggggtccgcgcagttgcgcgcgcacggc tccaaggtggtggccgccgtgggcgacgcggtgaagagcatcgacgacatcggcggcgcc ctgtccaagctgagcgagctgcacgcctacatcctgcgcgtggacccggtcaacttcaag ctcctgtcccactgcctgctggtcaccctggccgcgcgcttccccgccgacttcacggcc gaggcccacgccgcctgggacaagttcctatcggtcgtatcctctgtcctgaccgagaag taccgctga >gi568815582f:72913_176716|GENSCAN_predicted_peptide_5|129_aa MWAKISTQADTIGTETLERLFLSHPQTKTYFPHFDLHPGSAQLRAHGSKVVAAVGDAVKS IDDIGGALSKLSELHAYILRVDPVNFKLLSHCLLVTLAARFPADFTAEAHAAWAKFLSVV SSVLTEKYR >gi568815582f:72913_176716|GENSCAN_predicted_CDS_5|390_bp atgtgggccaagatctccacgcaggccgacaccatcggcaccgagactctggagaggctc ttcctcagccacccgcagaccaagacctacttcccgcacttcgacctgcacccggggtcc gcgcagttgcgcgcgcacggctccaaggtggtggccgccgtgggcgacgcggtgaagagc atcgacgacatcggcggcgccctgtccaagctgagcgagctgcacgcctacatcctgcgc gtggacccggtcaacttcaagctcctgtcccactgcctgctggtcaccctggccgcgcgc ttccccgccgacttcacggccgaggcccacgccgcctgggccaagttcctatcggtcgta tcctctgtcctgaccgagaagtaccgctga >gi568815582f:72913_176716|GENSCAN_predicted_peptide_6|141_aa MLSAQERAQIAQVWDLIAGHEAQFGAELLLRLFTVYPSTKVYFPHLSACQDATQLLSHGQ RMLAAVGAAVQHVDNLRAALSPLADLHALVLRVDPANFPLLIQCFHVVLASHLQDEFTVQ MQAAWDKFLTGVAVVLTEKYR >gi568815582f:72913_176716|GENSCAN_predicted_CDS_6|426_bp atgctcagcgcccaggagcgcgcccaaatcgcgcaggtctgggacctgattgcgggccac gaggcgcaattcggggcggagctgctgctcaggctcttcacggtgtaccccagcaccaag gtctacttcccgcacctgagcgcctgccaggacgcgacgcagctgctgagccacgggcag cgcatgctggcggctgtgggcgcggcggtgcagcacgtggacaacctgcgcgccgcgctg agcccgctggcggacctgcacgcgctcgtgctgcgcgtggacccagccaactttccgctg ctaatccagtgtttccacgtcgtgctggcctcccacctgcaggacgagttcaccgtgcaa atgcaagcggcgtgggacaagttcctgactggtgtggccgtggtgctgaccgaaaaatac cgctga >gi568815582f:72913_176716|GENSCAN_predicted_peptide_7|142_aa MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHG KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP AVHASLDKFLASVSTVLTSKYR >gi568815582f:72913_176716|GENSCAN_predicted_CDS_7|429_bp atggtgctgtctcctgccgacaagaccaacgtcaaggccgcctggggtaaggtcggcgcg cacgctggcgagtatggtgcggaggccctggagaggatgttcctgtccttccccaccacc aagacctacttcccgcacttcgacctgagccacggctctgcccaggttaagggccacggc aagaaggtggccgacgcgctgaccaacgccgtggcgcacgtggacgacatgcccaacgcg ctgtccgccctgagcgacctgcacgcgcacaagcttcgggtggacccggtcaacttcaag ctcctaagccactgcctgctggtgaccctggccgcccacctccccgccgagttcacccct gcggtgcacgcctccctggacaagttcctggcttctgtgagcaccgtgctgacctccaaa taccgttaa