GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:06:52 Sequence gi568815582f:27240204_27463827 : 223624 bp : 46.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 2186 2181 6 1.05 1.04 Term - 4735 4656 80 2 2 51 42 114 0.060 1.03 1.03 Intr - 17361 17232 130 0 1 68 84 226 0.022 20.47 1.02 Intr - 28112 27960 153 2 0 -5 98 128 0.458 4.77 1.01 Init - 39999 39982 18 0 0 86 86 24 0.305 0.96 1.00 Prom - 42669 42630 40 -4.36 2.00 Prom + 42703 42742 40 -1.96 2.01 Init + 47185 47301 117 0 0 103 94 0 0.181 2.30 2.02 Term + 50042 50143 102 1 0 106 47 68 0.487 2.78 2.03 PlyA + 50264 50269 6 1.05 3.03 PlyA - 51612 51607 6 1.05 3.02 Term - 64157 64053 105 2 0 88 43 105 0.567 4.41 3.01 Init - 66524 65742 783 2 0 53 14 1069 0.142 88.70 3.00 Prom - 67684 67645 40 -2.96 4.02 PlyA - 68869 68864 6 1.05 4.01 Sngl - 84647 84270 378 2 0 103 42 127 0.918 5.83 4.00 Prom - 88517 88478 40 -4.46 5.00 Prom + 95678 95717 40 -4.36 5.01 Init + 100001 100070 70 1 1 56 119 100 0.857 9.11 5.02 Intr + 101918 102056 139 0 1 102 65 157 0.910 14.32 5.03 Intr + 104666 104817 152 2 2 96 89 208 0.997 21.51 5.04 Intr + 106264 106415 152 2 2 76 76 170 0.916 14.48 5.05 Intr + 112337 112423 87 1 0 98 76 44 0.566 4.37 5.06 Intr + 112595 112650 56 1 2 80 96 37 0.582 1.48 5.07 Intr + 114817 114986 170 1 2 42 79 53 0.455 -0.61 5.08 Intr + 115605 115704 100 1 1 99 89 144 0.975 14.77 5.09 Intr + 118713 118791 79 0 1 120 91 39 0.737 6.95 5.10 Intr + 120563 120612 50 1 2 96 52 31 0.474 -2.22 5.11 Term + 122049 123627 1579 0 1 102 43 894 0.806 76.08 5.12 PlyA + 124347 124352 6 1.05 6.00 Prom + 140558 140597 40 -5.86 6.01 Init + 142872 142919 48 2 0 111 92 38 0.853 5.77 6.02 Intr + 148581 148737 157 2 1 57 65 99 0.576 4.08 6.03 Term + 148853 148884 32 0 2 128 35 -3 0.319 -3.58 6.04 PlyA + 149186 149191 6 1.05 7.04 PlyA - 149837 149832 6 1.05 7.03 Term - 162391 162006 386 2 2 86 54 127 0.502 4.15 7.02 Intr - 164614 164591 24 2 0 96 72 33 0.337 0.50 7.01 Init - 168599 168581 19 2 1 89 98 18 0.598 3.23 7.00 Prom - 170270 170231 40 -3.66 8.00 Prom + 172688 172727 40 -3.96 8.01 Init + 189869 189917 49 1 1 94 107 139 0.998 15.51 8.02 Intr + 194066 194246 181 0 1 99 119 164 0.776 19.53 8.03 Intr + 196032 196085 54 0 0 85 103 37 0.887 2.99 8.04 Intr + 197285 197484 200 2 2 62 73 270 0.998 21.89 8.05 Intr + 202801 202913 113 2 2 80 65 130 0.612 10.00 8.06 Intr + 204339 204516 178 2 1 82 73 158 0.915 13.19 8.07 Intr + 204974 205073 100 0 1 112 78 -13 0.879 -0.73 8.08 Intr + 205804 205885 82 1 1 121 105 116 0.999 16.24 8.09 Term + 208331 209080 750 1 0 136 39 634 0.992 56.75 8.10 PlyA + 209851 209856 6 1.05 9.04 PlyA - 210101 210096 6 1.05 9.03 Term - 221359 221147 213 1 0 106 48 228 0.849 17.83 9.02 Intr - 222283 222091 193 0 1 91 64 291 0.943 26.49 9.01 Intr - 223389 223338 52 1 1 125 98 44 0.973 7.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:27240204_27463827|GENSCAN_predicted_peptide_1|126_aa MAILPKPLPTTLFVPPPSVLPSVLCTEFLTLYKSAAAYVTAPRPDAFKIEEFSEFGKGST RRMGVMTDVHRRFLQLLMTHGVLEEWDVKRLQTHCYKVHDQNSQSMVTTCTTHLPRFVDG YVLSVS >gi568815582f:27240204_27463827|GENSCAN_predicted_CDS_1|381_bp atggccatactgcccaagcctttgccgaccaccctctttgtcccgccgccctccgtgctc ccctctgttttgtgtacggagtttcttactttgtacaagtccgcagccgcttatgttaca gccccgaggccagatgcgtttaaaatcgaggagttctcggagtttggaaagggcagcaca aggagaatgggcgtcatgactgatgtccaccggcgcttcctccagttgctgatgacccat ggcgtgctagaggaatgggacgtgaagcgcttgcagacgcactgctacaaggtccatgac caaaattcacaaagcatggtcaccacttgtaccacacatcttcctcggttcgtggacgga tacgtcttgagtgtcagctga >gi568815582f:27240204_27463827|GENSCAN_predicted_peptide_2|72_aa MVDLHPKWSCFRLTVGDIQRMSQATKNRESSLEDIALLQTDCGTAHEAKHLLVDQEASPI HGLSRQPAIVLI >gi568815582f:27240204_27463827|GENSCAN_predicted_CDS_2|219_bp atggtggatctgcatccaaaatggagttgctttcgcctcacagtaggcgacatacagagg atgtcacaagcaacaaaaaacagggaaagcagcttggaggatatagctttacttcagaca gactgcggaactgcccatgaggccaagcacctattagtagaccaagaagcatcacccatc catggcctaagtcggcagccagctattgttctcatctga >gi568815582f:27240204_27463827|GENSCAN_predicted_peptide_3|295_aa MMLTVMVVVVMMMVMMVMVAVVTMLTVMMVMVAVVTMLTVMMVMVAVVTMLTVMMVMVAV VTMLTVMMVMVAVVTMLTVMMVMVAVVTMLMVMMVMVAVVMMMLMVVVVMLMVMMVVVVM MLMVMMVVVVMMLMVMMVMVGVVMMTVMMVMVAVVMMLMVMVMVAVVMMMLMVVVVMMLM VMMVVVVVMLTVMMVMVGVVMMTVMMVMVVVVMVLMVMMVVMMLMVMMAMVVVIMVVVAV VMMIVMVAVMMLMVVVVVMMMPAEIPLSLKACDFLSGAFFSLPDKLLPVYPKHIG >gi568815582f:27240204_27463827|GENSCAN_predicted_CDS_3|888_bp atgatgttgacggtgatggtggtggtggtgatgatgatggtgatgatggtgatggtggcg gtggtgacgatgttgacggtgatgatggtgatggtggcggtggtgacgatgttgacggtg atgatggtgatggtggcggtggtgacgatgttgacggtgatgatggtgatggtggcggtg gtgacgatgttgacggtgatgatggtgatggtggcggtggtgacgatgttgacggtgatg atggtgatggtggcggtggtgacgatgttgatggtgatgatggtgatggtggcggtggtg atgatgatgttgatggtggtggtggtgatgttgatggtgatgatggtggtggtggtgatg atgttgatggtgatgatggtggtggtggtgatgatgttgatggtgatgatggtgatggtg ggggtggtgatgatgacggtgatgatggtgatggtggcggtggtgatgatgttgatggtg atggtgatggtggcagtggtgatgatgatgttgatggtggtggtggtgatgatgttgatg gtgatgatggtggtggtggtggtgatgttgacggtgatgatggtgatggtgggggtggtg atgatgacggtgatgatggtgatggtggtggtggtgatggtgttgatggtgatgatggtg gtgatgatgttgatggtgatgatggcgatggtcgtggtgataatggtagtggtggcagtg gtgatgatgatagtgatggtggcggtgatgatgttgatggtggtagtggtggtgatgatg atgcctgctgaaatccccctctccttgaaggcctgtgacttcctgagtggagccttcttc agccttccagacaagctccttcctgtttacccgaagcacattggatag >gi568815582f:27240204_27463827|GENSCAN_predicted_peptide_4|125_aa MESREEKQICWGRPRAPAHCGHFESTQSTGNSIRDPGSQACLQKIPEAQRRKQSLRKLCP EATYPGTAASLASIFLQPVILLYFSAIFVAYCTLGPYVFGAHIPYCKASSHKASVDPAHS HLPRA >gi568815582f:27240204_27463827|GENSCAN_predicted_CDS_4|378_bp atggagagccgagaagagaagcagatttgctggggaagaccacgcgctccggcgcactgt ggacactttgagtccacacagtcaacaggcaactccatccgggatcctggaagccaggcc tgtcttcagaagattccagaggcccaaaggaggaaacagagcctgaggaaactgtgtcca gaagccacctacccaggaacagcagccagcctggcctccatctttctgcaacctgttatc ttgctttatttctctgcaatctttgtggcatattgcacactcgggccttatgtgtttggt gctcatatcccctactgcaaggccagctcacacaaggccagtgtcgatcccgctcacagc catctccccagagcctag >gi568815582f:27240204_27463827|GENSCAN_predicted_peptide_5|877_aa MGWLCSGLLFPVSCLVLLQVASSGNMKVLQEPTCVSDYMSISTCEWKMNGPTNCSTELRL LYQLVFLLSEAHTCIPENNGGAGCVCHLLMDDVVSADNYTLDLWAGQQLLWKGSFKPSEH VKPRAPGNLTVHTNVSDTLLLTWSNPYPPDNYLYNHLTYAVNIWSENDPADFRIYNVTYL EPSLRIAASTLKSGISYRARDASNGNPAIRYTCRGVSASKYLLSSSYMASPVLDTGESAM NKADRNPHSCGADILEGETKSKHIKKERNHMDLDDTYREPFEQHLLLGVSVSCIVILAVC LLCYVSITKIKKEWWDQIPNPARSRLVAIIIQDAQGSQWEKRSRGQEPAKCPHWKNCLTK LLPCFLEHNMKRDEDPHKAAKEMPFQGSGKSAWCPVEISKTVLWPESISVVRCVELFEAP VECEEEEEVEEEKGSFCASPESSRDDFQEGREGIVARLTESLFLDLLGEENGGFCQQDMG ESCLLPPSGSTSAHMPWDEFPSAGPKEAPPWGKEQPLHLEPSPPASPTQSPDNLTCTETP LVIAGNPAYRSFSNSLSQSPCPRELGPDPLLARHLEEVEPEMPCVPQLSEPTTVPQPEPE TWEQILRRNVLQHGAAAAPVSAPTSGYQEFVHAVEQGGTQASAVVGLGPPGEAGYKAFSS LLASSAVSPEKCGFGASSGEEGYKPFQDLIPGCPGDPAPVPVPLFTFGLDREPPRSPQSS HLPSSSPEHLGLEPGEKVEDMPKPPLPQEQATDPLVDSLGSGIVYSALTCHLCGHLKQCH GQEDGGQTPVMASPCCGCCCGDRSSPPTTPLRAPDPSPGGVPLEASLCPASLAPSGISEK SKSSSSFHPAPGNAQSSSQTPKIVNFVSVGPTYMRVS >gi568815582f:27240204_27463827|GENSCAN_predicted_CDS_5|2634_bp atggggtggctttgctctgggctcctgttccctgtgagctgcctggtcctgctgcaggtg gcaagctctgggaacatgaaggtcttgcaggagcccacctgcgtctccgactacatgagc atctctacttgcgagtggaagatgaatggtcccaccaattgcagcaccgagctccgcctg ttgtaccagctggtttttctgctctccgaagcccacacgtgtatccctgagaacaacgga ggcgcggggtgcgtgtgccacctgctcatggatgacgtggtcagtgcggataactataca ctggacctgtgggctgggcagcagctgctgtggaagggctccttcaagcccagcgagcat gtgaaacccagggccccaggaaacctgacagttcacaccaatgtctccgacactctgctg ctgacctggagcaacccgtatccccctgacaattacctgtataatcatctcacctatgca gtcaacatttggagtgaaaacgacccggcagatttcagaatctataacgtgacctaccta gaaccctccctccgcatcgcagccagcaccctgaagtctgggatttcctacagggcacgg gatgcctctaatggcaatcctgccattagatacacctgccgtggtgtatctgccagcaaa tatttgctgagttcctcctacatggctagccctgtgctagacactggggaatcggcgatg aacaaagcagatagaaatccccactcttgtggagctgacattctggagggagagacaaaa agcaaacatataaagaaagaaagaaatcacatggatctggatgacacctacagggagccc ttcgagcagcacctcctgctgggcgtcagcgtttcctgcattgtcatcctggccgtctgc ctgttgtgctatgtcagcatcaccaagattaagaaagaatggtgggatcagattcccaac ccagcccgcagccgcctcgtggctataataatccaggatgctcaggggtcacagtgggag aagcggtcccgaggccaggaaccagccaagtgcccacactggaagaattgtcttaccaag ctcttgccctgttttctggagcacaacatgaaaagggatgaagatcctcacaaggctgcc aaagagatgcctttccagggctctggaaaatcagcatggtgcccagtggagatcagcaag acagtcctctggccagagagcatcagcgtggtgcgatgtgtggagttgtttgaggccccg gtggagtgtgaggaggaggaggaggtagaggaagaaaaagggagcttctgtgcatcgcct gagagcagcagggatgacttccaggagggaagggagggcattgtggcccggctaacagag agcctgttcctggacctgctcggagaggagaatgggggcttttgccagcaggacatgggg gagtcatgccttcttccaccttcgggaagtacgagtgctcacatgccctgggatgagttc ccaagtgcagggcccaaggaggcacctccctggggcaaggagcagcctctccacctggag ccaagtcctcctgccagcccgacccagagtccagacaacctgacttgcacagagacgccc ctcgtcatcgcaggcaaccctgcttaccgcagcttcagcaactccctgagccagtcaccg tgtcccagagagctgggtccagacccactgctggccagacacctggaggaagtagaaccc gagatgccctgtgtcccccagctctctgagccaaccactgtgccccaacctgagccagaa acctgggagcagatcctccgccgaaatgtcctccagcatggggcagctgcagcccccgtc tcggcccccaccagtggctatcaggagtttgtacatgcggtggagcagggtggcacccag gccagtgcggtggtgggcttgggtcccccaggagaggctggttacaaggccttctcaagc ctgcttgccagcagtgctgtgtccccagagaaatgtgggtttggggctagcagtggggaa gaggggtataagcctttccaagacctcattcctggctgccctggggaccctgccccagtc cctgtccccttgttcacctttggactggacagggagccacctcgcagtccgcagagctca catctcccaagcagctccccagagcacctgggtctggagccgggggaaaaggtagaggac atgccaaagcccccacttccccaggagcaggccacagacccccttgtggacagcctgggc agtggcattgtctactcagcccttacctgccacctgtgcggccacctgaaacagtgtcat ggccaggaggatggtggccagacccctgtcatggccagtccttgctgtggctgctgctgt ggagacaggtcctcgccccctacaacccccctgagggccccagacccctctccaggtggg gttccactggaggccagtctgtgtccggcctccctggcaccctcgggcatctcagagaag agtaaatcctcatcatccttccatcctgcccctggcaatgctcagagctcaagccagacc cccaaaatcgtgaactttgtctccgtgggacccacatacatgagggtctcttag >gi568815582f:27240204_27463827|GENSCAN_predicted_peptide_6|78_aa MGTPPHTPFSSLASLQVHAIGKLEVSESRGPGSSPQLTTNEVLEKKYPRFLVSPLGQVGD MFYIVSQRGLFPKSMTYP >gi568815582f:27240204_27463827|GENSCAN_predicted_CDS_6|237_bp atggggaccccgccccacacgccgttcagctctttagcctcgctccaggtgcatgcaatc ggcaagctggaagtgtcagagagtcgtggccctggaagcagccctcaactaacaacaaat gaagtcctggaaaagaaatatcccagatttcttgtttctcctttgggacaagttggagac atgttctatattgtctcccagagaggattatttcccaaatctatgacttatccttaa >gi568815582f:27240204_27463827|GENSCAN_predicted_peptide_7|142_aa MGLKVKDAATELQKEIHSVCWGQSDGLLGVAAGFSVGVTDEGLRKQAEAWGENPHPARVT NQAVIRPVTLMTGSWASRQVSHRAADRGKRRQGRYQAAAAGMHGPQTDRHSCVAVSHTQG EAGRGPDAELTVTQQRDASGSV >gi568815582f:27240204_27463827|GENSCAN_predicted_CDS_7|429_bp atggggctgaaagtgaaagatgcagcaactgagctgcagaaagaaatccattccgtgtgc tgggggcagtctgatgggcttctgggggtggcagctgggttcagcgtgggggtgacagat gagggtctgagaaagcaggcagaggcctggggtgagaatccacacccagcccgagtcacc aatcaggctgtcataagacctgtcactctgatgacgggcagctgggcctccaggcaggtg tcccacagagcagcagacagaggaaagagacggcaaggaaggtaccaggcggccgcagca gggatgcacgggccgcagacagacagacacagctgcgtggctgtcagccacactcaggga gaggcaggcagagggccagacgccgagcttacggtcactcagcagagagacgccagtggg tctgtctga >gi568815582f:27240204_27463827|GENSCAN_predicted_peptide_8|568_aa MPRGWAAPLLLLLLQGALEGMERKLCSPKPPPTKASLPTDPPGWGCPDLVCYTDYLQTVI CILEMWNLHPSTLTLTWILSNNTGCYIKDRTLDLRQDQYEELKDEATSCSLHRSAHNATH ATYTCHMDVFHFMADDIFSVNITDQSGNYSQECGSFLLAESRQYNISWRSDYEDPAFYML KGKLQYELQYRNRGDPWAVSPRRKLISVDSRSVSLLPLEFRKDSSYELQVRAGPMPGSSY QGTWSEWSDPVIFQTQSEELKEGWNPHLLLLLLLVIVFIPAFWSLKTHPLWRLWKKIWAV PSPERFFMPLYKGCSGDFKKWVGAPFTGSSLELGPWSPEVPSTLEVYSCHPPRSPAKRLQ LTELQEPAELVESDGVPKPSFWPTAQNSGGSAYSEERDRPYGLVSIDTVTVLDAEGPCTW PCSCEDDGYPALDLDAGLEPSPGLEDPLLDAGTTVLSCGCVSAGSPGLGGPLGSLLDRLK PPLADGEDWAGGLPWGGRSPGGVSESEAGSPLAGLDMDTFDSGFVGSDCSSPVECDFTSP GDEGPPRSYLRQWVVIPPPLSSPGPQAS >gi568815582f:27240204_27463827|GENSCAN_predicted_CDS_8|1707_bp atgccgcgtggctgggccgcccccttgctcctgctgctgctccagggagccctcgagggg atggagaggaagctctgcagtcccaagccaccccccaccaaggcctctctccccactgac cctccaggctggggctgccccgacctcgtctgctacaccgattacctccagacggtcatc tgcatcctggaaatgtggaacctccaccccagcacgctcacccttacctggatactttct aataatactgggtgctatatcaaggacagaacactggacctcaggcaagaccagtatgaa gagctgaaggacgaggccacctcctgcagcctccacaggtcggcccacaatgccacgcat gccacctacacctgccacatggatgtattccacttcatggccgacgacattttcagtgtc aacatcacagaccagtctggcaactactcccaggagtgtggcagctttctcctggctgag agcagacagtataatatctcctggcgctcagattacgaagaccctgccttctacatgctg aagggcaagcttcagtatgagctgcagtacaggaaccggggagacccctgggctgtgagt ccgaggagaaagctgatctcagtggactcaagaagtgtctccctcctccccctggagttc cgcaaagactcgagctatgagctgcaggtgcgggcagggcccatgcctggctcctcctac caggggacctggagtgaatggagtgacccggtcatctttcagacccagtcagaggagtta aaggaaggctggaaccctcacctgctgcttctcctcctgcttgtcatagtcttcattcct gccttctggagcctgaagacccatccattgtggaggctatggaagaagatatgggccgtc cccagccctgagcggttcttcatgcccctgtacaagggctgcagcggagacttcaagaaa tgggtgggtgcacccttcactggctccagcctggagctgggaccctggagcccagaggtg ccctccaccctggaggtgtacagctgccacccaccacggagcccggccaagaggctgcag ctcacggagctacaagaaccagcagagctggtggagtctgacggtgtgcccaagcccagc ttctggccgacagcccagaactcggggggctcagcttacagtgaggagagggatcggcca tacggcctggtgtccattgacacagtgactgtgctagatgcagaggggccatgcacctgg ccctgcagctgtgaggatgacggctacccagccctggacctggatgctggcctggagccc agcccaggcctagaggacccactcttggatgcagggaccacagtcctgtcctgtggctgt gtctcagctggcagccctgggctaggagggcccctgggaagcctcctggacagactaaag ccaccccttgcagatggggaggactgggctgggggactgccctggggtggccggtcacct ggaggggtctcagagagtgaggcgggctcacccctggccggcctggatatggacacgttt gacagtggctttgtgggctctgactgcagcagccctgtggagtgtgacttcaccagcccc ggggacgaaggacccccccggagctacctccgccagtgggtggtcattcctccgccactt tcgagccctggaccccaggccagctaa >gi568815582f:27240204_27463827|GENSCAN_predicted_peptide_9|152_aa XFTESFGAANISQAARERDCESVCFIGRPWRVVDGHLNLPVCKGMMEAMLYHIMTRPGIP ESSLLRHYQGVLQPVAVLELLQGLESLGCIRKRWLRKPRPVSLFSTPVVEEVEVPSSLDE SPMAFYEPTLDCTLRLGRVFPHEVNWNKWIHL >gi568815582f:27240204_27463827|GENSCAN_predicted_CDS_9|459_bp nggttcacagagagtttcggagctgccaacatctcccaggcagcacgggaaagggactgt gagagtgtctgcttcatcggccggccgtggcgtgtcgtggatggccacctgaaccttcct gtatgcaagggtatgatggaggccatgctgtaccacatcatgaccaggcctggcatcccc gagagctccctgctgcgccactaccagggggtcctgcagcccgtcgccgtgctggagttg ctccagggcctggagtccctcggctgcatccggaagcgctggctgagaaagccaaggcct gtctcgctcttctctacacccgtggtggaagaggtggaagtgccctccagcctggacgag agccccatggctttctatgagcccaccttggactgtaccctccggctgggccgtgtgttc ccccacgaggtcaactggaacaagtggatccacctctag