GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:39:03 Sequence gi568815585r:98584727_98826449 : 241723 bp : 44.43% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1026 1021 6 1.05 1.03 Term - 15883 15878 6 1 0 122 48 0 0.624 -2.73 1.02 Intr - 16802 16694 109 1 1 61 69 97 0.888 5.39 1.01 Init - 19193 19024 170 2 2 64 48 97 0.912 2.31 1.00 Prom - 19326 19287 40 -4.06 2.10 PlyA - 19791 19786 6 1.05 2.09 Term - 25979 25794 186 2 0 -6 46 153 0.057 -0.81 2.08 Intr - 34140 34108 33 0 0 82 97 15 0.132 0.22 2.07 Intr - 41089 41036 54 1 0 72 55 76 0.496 1.88 2.06 Intr - 56377 56106 272 2 2 1 36 262 0.004 9.56 2.05 Intr - 65505 65343 163 0 1 10 69 109 0.012 0.75 2.04 Intr - 70571 70309 263 0 2 2 74 228 0.051 10.01 2.03 Intr - 76048 75925 124 1 1 94 45 55 0.177 1.96 2.02 Intr - 76318 76296 23 2 2 92 100 -1 0.141 -1.24 2.01 Init - 77396 77345 52 2 1 65 94 47 0.217 4.32 2.00 Prom - 79119 79080 40 -5.26 3.00 Prom + 92595 92634 40 -2.76 3.01 Init + 93415 93578 164 0 2 58 50 177 0.530 10.10 3.02 Intr + 93971 94097 127 2 1 72 89 68 0.758 5.98 3.03 Term + 97141 97152 12 0 0 105 38 6 0.438 -4.50 3.04 PlyA + 97302 97307 6 -0.45 4.21 PlyA - 99093 99088 6 1.05 4.20 Term - 100189 99998 192 1 0 39 49 215 0.997 10.12 4.19 Intr - 101571 101464 108 0 0 89 80 176 0.994 17.38 4.18 Intr - 101666 101639 28 1 1 71 97 17 0.792 -0.98 4.17 Intr - 103851 103744 108 2 0 47 103 127 0.701 9.50 4.16 Intr - 117803 117754 50 2 2 99 91 56 0.944 4.48 4.15 Intr - 119709 119563 147 0 0 116 103 68 0.991 11.53 4.14 Intr - 122633 122415 219 2 0 69 38 109 0.605 2.50 4.13 Intr - 124041 123960 82 2 1 57 76 154 0.912 10.74 4.12 Intr - 124934 124846 89 2 2 112 63 147 0.970 13.37 4.11 Intr - 125048 125016 33 2 0 116 75 44 0.951 4.32 4.10 Intr - 127217 127128 90 2 0 75 95 19 0.729 1.49 4.09 Intr - 127858 127772 87 1 0 90 95 41 0.953 5.17 4.08 Intr - 131234 131152 83 0 2 86 78 109 0.984 9.06 4.07 Intr - 134594 134511 84 0 0 84 80 46 0.894 3.19 4.06 Intr - 136859 136769 91 2 1 99 94 -13 0.892 -0.03 4.05 Intr - 137177 137078 100 1 1 76 110 76 0.989 8.71 4.04 Intr - 139305 139186 120 2 0 32 67 132 0.960 5.11 4.03 Intr - 141538 141397 142 2 1 40 89 156 0.999 10.31 4.02 Intr - 141723 141642 82 0 1 74 78 118 0.992 8.61 4.01 Init - 146661 146530 132 0 0 82 85 75 0.712 6.79 4.00 Prom - 149857 149818 40 -4.66 5.00 Prom + 151410 151449 40 -5.46 5.01 Init + 151701 151825 125 2 2 71 91 92 0.725 7.44 5.02 Intr + 157385 157446 62 2 2 75 81 42 0.738 0.58 5.03 Intr + 158731 158823 93 2 0 108 105 41 0.822 7.74 5.04 Intr + 160341 160390 50 1 2 44 84 75 0.632 1.10 5.05 Intr + 167825 168086 262 1 1 67 80 216 0.584 15.76 5.06 Term + 168730 169568 839 2 2 89 39 153 0.838 3.63 5.07 PlyA + 169921 169926 6 -0.45 6.03 PlyA - 171453 171448 6 1.05 6.02 Term - 171600 171456 145 2 1 104 44 83 0.494 2.88 6.01 Init - 171877 171867 11 1 2 74 72 11 0.310 -1.79 6.00 Prom - 172366 172327 40 -0.26 7.04 PlyA - 173773 173768 6 1.05 7.03 Term - 175123 174983 141 1 0 69 43 147 0.855 6.23 7.02 Intr - 181452 181330 123 0 0 61 18 112 0.185 2.28 7.01 Init - 186436 186374 63 1 0 48 95 55 0.153 3.35 7.00 Prom - 194570 194531 40 -5.36 8.02 PlyA - 195349 195344 6 1.05 8.01 Sngl - 196659 196480 180 0 0 72 45 203 0.882 9.40 8.00 Prom - 204775 204736 40 -4.06 9.10 PlyA - 205284 205279 6 1.05 9.09 Term - 210022 209900 123 1 0 93 38 171 0.999 10.98 9.08 Intr - 212527 212389 139 0 1 82 94 146 0.999 15.07 9.07 Intr - 212763 212663 101 0 2 76 103 53 0.999 4.51 9.06 Intr - 215752 215562 191 2 2 55 99 323 0.795 29.40 9.05 Intr - 220483 220273 211 1 1 48 80 241 0.961 17.79 9.04 Intr - 223081 222935 147 1 0 78 86 175 0.988 16.73 9.03 Intr - 224739 224626 114 0 0 39 99 226 0.999 19.44 9.02 Intr - 225565 225443 123 1 0 66 64 268 0.739 22.98 9.01 Intr - 239778 239672 107 1 2 154 92 164 0.959 23.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:98584727_98826449|GENSCAN_predicted_peptide_1|94_aa MNTPSAGILTPEYNPPCKESELFREMADSRAGAEKILHEPNHLVPENMEIAKNNGNRLNG TKDRSKKTNWKAMAVSIQERDNGVFDQGVIIEVP >gi568815585r:98584727_98826449|GENSCAN_predicted_CDS_1|285_bp atgaacacacctagtgccggcattcttactcccgaatacaaccctccatgtaaagaatca gagctctttagagaaatggctgattccagagctggagcagagaaaatactgcatgagccg aaccatcttgtaccagaaaatatggaaatagcaaagaataatggaaacagactcaacggg accaaggatagaagcaagaaaactaactggaaggccatggcagtctctattcaggagaga gacaatggtgtctttgaccaaggggttatcatagaggtgccctag >gi568815585r:98584727_98826449|GENSCAN_predicted_peptide_2|389_aa MLYRHQNIEPKSTGTEFRPLCSKMKLRISQLAPDQSRGCPTAIQLPLASPGNSLQQGTPE AIPHVSALEFGDYIRKSLKYPKTRVKDFLKRDLQGDKRDMAQAILHPHHFQARMLHWKPS SSSTPANRPHCNDPKDHLTQTWMRKVDPGNPGLGTLRLCQGKMSSDIDKYPFCGAKPPMD ENYGYQAALHFQSNKKRTQGLCFQVGAEKAELQDMIYEVDADSNGRVGFPEFVTMRARKM KDAGSEEEMREAFRVFDKDGNGYISAAELHHAMTNLGEKLTDEEVDEMIREADIDGDGQA KTTTYCLANANCKMPKIRECEDFRGADLQQELLGLDCSKYSPEFANRNDKGDQVLNCRLA VKVLSPVDGKADILRAAQDFCQLVAQKQK >gi568815585r:98584727_98826449|GENSCAN_predicted_CDS_2|1170_bp atgctctatcggcatcaaaatattgagcccaaatctacaggaacagagttccgacccctg tgcagtaagatgaagcttaggatcagccagcttgcccccgaccaatcacgtggctgcccc actgctattcagctgcccctggcttccccaggcaacagcctccaacagggcacacctgaa gccatccctcatgtcagcgccctggaatttggggactatatcaggaagtcactcaaatac cccaaaacaagagtcaaagatttcctgaaacgggaccttcagggagacaaacgggatatg gctcaagccatcctgcatccacaccacttccaagcccgcatgctccactggaagccatca tcttcctccactccagcaaacagaccacattgtaatgatcccaaggaccacctgacgcag acctggatgagaaaagttgatccaggaaatccaggactagggacgcttcgtttatgtcaa ggaaaaatgtcttcagacattgataaatatcccttctgcggggcaaaaccacccatggat gaaaactacggataccaagctgccctccactttcagagcaataagaagaggacacagggc ctctgctttcaagtgggtgcagaaaaagcagagttacaggacatgatttatgaagtagat gctgatagtaatggcagagttggcttccctgaatttgtgacaatgagggcaagaaaaatg aaagacgcaggcagtgaagaagaaatgagagaagcattccgtgtgtttgataaggatggc aatggttatattagtgcagcagaacttcaccatgcgatgacaaaccttggagagaagtta acagatgaagaggttgacgaaatgatcagggaagcagatattgatggtgacggtcaagca aaaacgaccacttactgcctggccaatgccaattgtaaaatgccaaaaataagggagtgt gaagatttcagaggggctgacttgcagcaagaacttctgggcctagattgttcaaaatac tcaccagagtttgcaaataggaacgacaaaggtgatcaagttctaaattgccgtttggca gtgaaggtgctgtccccggtagatggaaaagcagatattctgagagctgctcaggacttt tgccagttagtagcccagaagcaaaagtga >gi568815585r:98584727_98826449|GENSCAN_predicted_peptide_3|100_aa MKKVLEMDGGHDEKVLEMDGGHDEKVLEMDGGHGPPAMRMFFMPLNCRLKNWFKWPMDFH ARLSSPSLFLATPFSVAFTPSAPKPAAPPGEMLAYCVERK >gi568815585r:98584727_98826449|GENSCAN_predicted_CDS_3|303_bp atgaaaaaagttctggagatggatggtggccatgatgaaaaagttctggagatggatggt ggccatgatgaaaaagttctggagatggatggtggccatggtcctccagcaatgaggatg ttcttcatgccactcaactgcagacttaaaaactggttcaaatggcccatggacttccat gctcgcctttcctctccttcgctgtttctggccactcctttttctgtggcttttactccc tctgcccctaagccggctgcccctccaggtgagatgctggcctactgtgtggagaggaaa tga >gi568815585r:98584727_98826449|GENSCAN_predicted_peptide_4|688_aa MTLCTFRLVCTELAKHTLARPALPIRRTLALSLPGVKDSPSPQQSFFGYPLSIFFIVVNE FCERFSYYGMRAILILYFTNFISWDDNLSTAIYHTFVALCYLTPILGALIADSWLGKFKT IVSLSIVYTIGQAVTSVSSINDLTDHNHDGTPDSLPVHVVLSLIGLALIALGTGGIKPCV SAFGGDQFEEGQEKQRNRFFSIFYLAINAGSLLSTIITPMLRVQQCGIHSKQACYPLAFG VPAALMAVALIVFVLGSGMYKKFKPQGNIMGKVAKCIGFAIKNRFRHRSKAFPKREHWLD WAKEKYDERLISQIKMVTRVMFLYIPLPMFWALFDQQGALEIQPDQMQTVNAILIVIMVP IFDAVLYPLIAKCGFNFTSLKKMAVGMVLASMAFVVAAIVQVEIDVPPVVDCVRISFLCK LNTKPIVCIHHIVHPFTPDRHLGYLHLLAIANSAAMNMAVLTLLLDIASFVEDELWAKTN AFMTFDVNKLTRINISSPGSPVTAVTDDFKQGQRHTLLVWAPNHYQVVKDGLNQKPEKGE NGIRFVNTFNELITITMSGKVYANISSYNASTYQFFPSGMVICFTKRPSAPSNMKSVLQA GWLLTVAVGNIIVLIVAGAGQFSKQWAEYILFAALLLVVCVIFAIMARFYTYINPAEIEA QFDEDEKKNRLEKSNPYFMSGANSQKQM >gi568815585r:98584727_98826449|GENSCAN_predicted_CDS_4|2067_bp atgacgctttgcaccttcaggctggtttgcaccgagcttgcaaaacacacgcttgcccga ccagcccttcctattcgccggacactggcgctttccctgcctggtgtgaaagacagccca agtccacagcagagtttctttggttatcccctgagcatcttcttcatcgtggtcaatgag ttttgcgaaagattttcctactatggaatgcgagcaatcctgattctgtacttcacaaat ttcatcagctgggatgataacctgtccaccgccatctaccatacgtttgtggctctgtgc tacctgacgccaattctcggagctcttatcgccgactcgtggctgggaaagttcaagacc attgtgtcgctctccattgtctacacaattggacaagcagtcacctcagtaagctccatt aatgacctcacagaccacaaccatgatggcacccccgacagccttcctgtgcacgtggtg ctgtccttgatcggcctggccctgatagctctcgggactggaggaatcaaaccctgtgtg tctgcgtttggtggagatcagtttgaagagggccaggagaaacaaagaaacagatttttt tccatcttttacttggctattaatgctggaagtttgctttccacaatcatcacacccatg ctcagagttcaacaatgtggaattcacagtaaacaagcttgttacccactggcctttggg gttcctgctgctctcatggctgtagccctgattgtgtttgtccttggcagtgggatgtac aagaagttcaagccacagggcaacatcatgggtaaagtggccaagtgcatcggttttgcc atcaaaaatagatttaggcatcggagtaaggcatttcccaagagggagcactggctggac tgggctaaagagaaatacgatgagcggctcatctcccaaattaagatggttacgagggtg atgttcctgtatattccactcccaatgttctgggccttgtttgaccagcagggagctctt gaaattcagcccgatcagatgcagaccgtgaacgccatcctgatcgtgatcatggtcccg atcttcgatgctgtgctgtaccctctcattgcaaaatgtggcttcaatttcacctccttg aagaagatggcagttggcatggtcctggcctccatggcctttgtggtggctgccatcgtg caggtggaaatcgatgttcctcctgttgtagactgtgtcagaatttccttcctttgtaaa ctgaatactaaacccattgtgtgtatacaccacattgttcatccattcacccccgacagg cacttgggttacctgcaccttttggctattgcaaacagtgccgccatgaacatggctgta ctcacgctcctcttggacatagctagctttgtggaagatgaattgtgggctaagacaaat gcatttatgacttttgatgtaaacaaactgacaaggataaacatttcttctcctggatca ccagtcactgctgtaactgacgacttcaagcagggccaacgccacacgcttctagtgtgg gcccccaatcactaccaggtggtaaaggatggtcttaaccagaagccagaaaaaggggaa aatggaatcagatttgtaaatacttttaacgagctcatcaccatcacaatgagtgggaaa gtttatgcaaacatcagcagctacaatgccagcacataccagttttttccttctggcatg gtgatctgcttcacgaaacgcccatcggctccttccaacatgaagtcggtgcttcaggca ggatggctgctgaccgtggctgttggcaacatcattgtgctcatcgtggcaggggcaggc cagttcagcaaacagtgggccgagtacattctatttgccgcgttgcttctggtcgtctgt gtaatttttgccatcatggctcggttctatacttacatcaacccagcggagatcgaagct caatttgatgaggatgaaaagaaaaacagactggaaaagagtaacccatatttcatgtca ggggccaattcacagaaacagatgtga >gi568815585r:98584727_98826449|GENSCAN_predicted_peptide_5|476_aa MGKNFTSKTPKAMTKKAKIDKWDLIKLKSSCTAKETAIGVNRTQENGKAVRGSTSLRTPE DSGRNDLSGTQDPATFGHADRQLISNLLSASGKGPQYRINRMGTSFLELELRVFSPAGPP PAERTHGGGSQGSCDLPAGRAPGRCYSSDLTSRPGPVAPGTAPALRGGAAGSTRGAVLRG AEVRAPEPSRVDTSPGPELEKTTLKFIRNQKRAPIAKTILSKKNKAGGIMLPDFKLYYKA TVSKTAWHWYQNRYIDQWNRTEASEITPHIYNYLIFDKPDKNKPWEMDSLFNKWCWENWL AICRKLKLNPFLTPYTKINSRWIKDLNVRPKIIKTLEENLGNTIQDIGMGKDFMSKTPRA MATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKMFAIYPSDKGLISRIYKELKQIY KKKIKQPHQKVGKGYEQTLLKRRHLCSQQTHEKMLIITGHQRNANQNHNQIPSHAS >gi568815585r:98584727_98826449|GENSCAN_predicted_CDS_5|1431_bp atgggcaagaacttcacgagtaaaacaccaaaagcaatgacaaaaaaagccaaaatagac aaatgggatctaattaaactaaagagctcctgcacagcaaaagaaactgccatcggagtg aacagaacacaagagaatgggaaggccgtgaggggctccacctccctcaggacccccgag gattctggaagaaatgatctctctgggacccaggatccagccacctttggacatgctgac agacagttgatttccaacctgctctcagcttcagggaaaggtccacagtaccgcatcaac agaatgggaacgtcgtttttagaactagagctccgagtctttagcccggccggcccccca cccgccgagcgtacccatggcggcggctcccagggctcctgcgacctgccggcgggacgt gctcctggcaggtgctactcctccgacctgacgtccaggcccggccccgttgccccaggt acagccccagctctgcgaggcggggccgccggctccacccggggggcggtgctgcgggga gcagaggtccgtgctcccgagccgtcgcgcgtggacaccagccccggcccagaattggaa aaaactactttaaagttcatacggaaccaaaaaagagcccccatagccaagacaatccta agcaaaaagaacaaagctggaggcatcatgctacctgacttcaaactatactacaaggct acagtaagcaaaacagcatggcactggtaccaaaacagatatatagaccaatggaacaga acagaggcctcagaaataacaccacacatctacaactatctgatctttgacaaacctgac aaaaacaagccatgggaaatggattccctatttaataaatggtgctgggaaaactggcta gccatatgtagaaagctgaaactgaatcccttccttacaccttatacaaaaattaattca agatggattaaagacttaaatgtaagacctaaaatcataaaaaccctagaagaaaaccta ggcaataccattcaggacataggcatgggcaaggacttcatgtctaaaacaccaagagca atggcaacaaaagccaaaatagacaaatgggacctaattaaactaaaaagcttctgcaca gcaaaagaaactaccatcagagtgaacaggcaacctacagaatgggagaaaatgtttgca atctacccatctgacaaaggactaatatctagaatctacaaagaacttaaacaaatttac aagaaaaaaatcaaacaaccccatcaaaaagtgggcaaaggatatgaacagacccttctc aaaagaagacatttatgcagccaacagacacacgaaaaaatgctcatcatcactggtcat cagagaaatgcaaatcaaaaccacaatcagataccatctcatgccagttag >gi568815585r:98584727_98826449|GENSCAN_predicted_peptide_6|51_aa MPKRLLHLEYLASSGRAVTDKNVSSVGAGLSLRWPPTLPNAVTSMVVGTGR >gi568815585r:98584727_98826449|GENSCAN_predicted_CDS_6|156_bp atgcccaagaggcttctccatcttgagtaccttgcctcttctggccgtgctgtaacggat aagaatgtgagctctgtgggcgcaggcctgtctctgcgctggccacccactctgcccaat gccgtcactagcatggtagttggcactgggaggtga >gi568815585r:98584727_98826449|GENSCAN_predicted_peptide_7|108_aa MPPKGLGSGCQNATHEKCSKPHKTASICEVLDSWVFEGSVCKIFPTGEYLIYCFDSHGGE LPGGRKVRIREEDAMAKEAESKEDVGKFYAAGFEDGGEGYTIRHAGGL >gi568815585r:98584727_98826449|GENSCAN_predicted_CDS_7|327_bp atgccaccaaaaggtcttggatcaggctgccaaaatgctacgcacgagaaatgtagcaaa cctcacaaaactgcttccatctgtgaagtactcgacagctgggtctttgaggggtccgtc tgtaagatctttccaactggagaatacctcatctactgtttcgacagtcatggaggggag ctcccgggaggtaggaaagtcaggatcagagaggaagatgcgatggcgaaagaagcagag agcaaggaagatgttggaaaattctatgctgctggctttgaagatggaggagaaggctac acaatcaggcatgccggcggcctctag >gi568815585r:98584727_98826449|GENSCAN_predicted_peptide_8|59_aa MSFICVMTMMTSIIYVMTVMTSIIYVMTTMMSIVYVMTMKMSIVYVTTTMMSIVYVLTT >gi568815585r:98584727_98826449|GENSCAN_predicted_CDS_8|180_bp atgtccttcatctgtgtgatgactatgatgacatccatcatctatgtgatgactgtgatg acgtccattatctatgtaatgaccacgatgatgtccattgtctacgtaatgaccatgaag atgtcaattgtctatgtaacgaccacgatgatgtccattgtctatgtgttgactacatga >gi568815585r:98584727_98826449|GENSCAN_predicted_peptide_9|418_aa XVFRQGCTAFRVITPNIDEEASMMEDVGMQDVHFNEDVLMELLEQCADGLWKAERYELIA DIYKLIIPIYEKRRDFERLAHLYDTLHRAYSKVTEVMHSGRRLLGTYFRVAFFGQGFFED EDGKEYIYKEPKLTPLSEISQRLLKLYSDKFGSENVKMIQDSGKVNPKDLDSKYAYIQVT HVIPFFDEKELQERKTEFERSHNIRRFMFEMPFTQTGKRQGGVEEQCKRRTILTAIHCFP YVKKRIPVMYQHHTDLNPIEVAIDEMSKKVAELRQLCSSAEVDMIKLQLKLQGSVSVQVN AGPLAYARAFLDDTNTKRYPDNKVKLLKEVFRQFVEACGQALAVNERLIKEDQLEYQEEM KANYREMAKELSEIMHEQICPLEEKTSVLPNSLHIFNAISGTPTSTMVHGMTSSSSVV >gi568815585r:98584727_98826449|GENSCAN_predicted_CDS_9|1257_bp ngcgtgtttagacaaggatgcaccgccttcagggtcattaccccaaacatcgacgaggag gcctccatgatggaagacgtggggatgcaggatgtccatttcaacgaggatgtgctgatg gagctccttgagcagtgcgcagatggactctggaaagccgagcgctacgagctcatcgcc gacatctacaaacttatcatccccatttatgagaagcggagggattttgagaggctggcc catctgtatgacacgctgcaccgggcctacagcaaagtgaccgaggtcatgcactcgggc cgcaggcttctggggacctacttccgggtagccttcttcgggcagggattctttgaagat gaagatggaaaggagtatatttacaaggaacccaaactcacaccgctgtcggaaatttct cagagactccttaaactgtactcggataaatttggttctgaaaatgtcaaaatgatacag gattctggcaaggtcaaccctaaggatctggattctaagtatgcatacatccaggtgact cacgtcatccccttctttgacgaaaaagagttgcaagaaaggaaaacagagtttgagaga tcccacaacatccgccgcttcatgtttgagatgccatttacgcagaccgggaagaggcag ggcggggtggaagagcagtgcaaacggcgcaccatcctgacagccatacactgcttccct tatgtgaagaagcgcatccctgtcatgtaccagcaccacactgacctgaaccccatcgag gtggccattgacgagatgagtaagaaggtggcggagctccggcagctgtgctcctcggcc gaggtggacatgatcaaactgcagctcaaactccagggcagcgtgagtgttcaggtcaat gctggcccactagcatatgcgcgagctttcttagatgatacaaacacaaagcgatatcct gacaataaagtgaagctgcttaaggaagttttcaggcaatttgtggaagcttgcggtcaa gccttagcggtaaacgaacgtctgattaaagaagaccagctcgagtatcaggaagaaatg aaagccaactacagggaaatggcgaaggagctttctgaaatcatgcatgagcagatctgc cccctggaggagaagacgagcgtcttaccgaattcccttcacatcttcaacgccatcagt gggactccaacaagcacaatggttcacgggatgaccagctcgtcttcggtcgtgtga