GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:27:50 Sequence gi568815592f:2910023_3119652 : 209630 bp : 45.49% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 PlyA - 513 508 6 1.05 1.12 Term - 5449 5277 173 2 2 98 29 97 0.559 2.79 1.11 Intr - 30465 30293 173 2 2 74 3 173 0.019 6.99 1.10 Intr - 38677 38280 398 1 2 105 47 531 0.256 44.08 1.09 Intr - 39047 38892 156 2 0 107 44 74 0.968 5.11 1.08 Intr - 40813 40753 61 0 1 27 30 90 0.390 -4.16 1.07 Intr - 41823 41718 106 1 1 94 34 42 0.417 -1.33 1.06 Intr - 43164 43018 147 1 0 69 63 125 0.830 8.31 1.05 Intr - 44687 44570 118 2 1 57 62 69 0.974 1.24 1.04 Intr - 45648 45502 147 0 0 82 89 75 0.993 7.43 1.03 Intr - 49320 49146 175 2 1 79 92 205 0.941 19.94 1.02 Intr - 56568 56378 191 0 2 42 106 66 0.006 2.18 1.01 Init - 63886 63800 87 1 0 57 98 52 0.079 3.75 1.00 Prom - 70282 70243 40 -6.36 2.00 Prom + 73566 73605 40 -3.06 2.01 Init + 78649 78687 39 0 0 87 96 34 0.581 4.34 2.02 Intr + 79026 79128 103 2 1 70 24 74 0.659 -1.05 2.03 Intr + 79506 79688 183 1 0 94 33 58 0.321 0.66 2.04 Intr + 89947 90063 117 2 0 63 113 115 0.897 11.94 2.05 Intr + 93164 93200 37 0 1 112 95 -26 0.151 -2.18 2.06 Intr + 96446 96537 92 2 2 125 95 99 0.450 13.94 2.07 Intr + 100003 100167 165 2 0 39 105 160 0.803 12.73 2.08 Intr + 102522 102652 131 1 2 77 72 107 0.751 8.41 2.09 Intr + 105508 105621 114 0 0 85 91 128 0.999 13.44 2.10 Intr + 106862 106963 102 1 0 109 74 86 0.997 9.67 2.11 Term + 109457 109633 177 1 0 74 35 90 0.535 -0.01 2.12 PlyA + 109713 109718 6 1.05 3.03 PlyA - 110668 110663 6 1.05 3.02 Term - 113586 113113 474 0 0 -12 44 324 0.171 12.89 3.01 Init - 117948 117820 129 0 0 88 15 152 0.224 6.51 3.00 Prom - 127681 127642 40 -2.86 4.00 Prom + 129710 129749 40 -2.86 4.01 Init + 135362 135699 338 1 2 107 36 371 0.005 30.55 4.02 Intr + 135939 136102 164 0 2 73 81 97 0.004 7.12 4.03 Intr + 143197 143335 139 2 1 43 115 28 0.013 0.52 4.04 Intr + 144329 144429 101 2 2 89 93 16 0.018 1.95 4.05 Intr + 149163 149288 126 1 0 -3 73 101 0.003 0.15 4.06 Intr + 153573 153678 106 1 1 96 47 62 0.001 2.17 4.07 Intr + 158441 158639 199 2 1 63 89 58 0.003 2.75 4.08 Intr + 166742 166965 224 1 2 124 61 179 0.319 15.73 4.09 Intr + 167757 167913 157 0 1 102 65 264 0.999 25.51 4.10 Intr + 170957 171094 138 1 0 45 92 84 0.962 5.16 4.11 Intr + 173093 173291 199 1 1 56 78 336 0.733 28.32 4.12 Intr + 175237 175386 150 2 0 24 99 97 0.600 4.53 4.13 Intr + 176570 176611 42 0 0 74 100 32 0.659 1.21 4.14 Intr + 179559 179635 77 1 2 64 102 33 0.576 1.53 4.15 Term + 182165 182620 456 1 0 1 39 274 0.220 8.93 4.16 PlyA + 184435 184440 6 1.05 5.00 Prom + 188534 188573 40 -4.96 5.01 Init + 195504 196029 526 2 1 45 99 235 0.216 15.32 5.02 Intr + 200781 200933 153 1 0 98 91 65 0.750 7.84 5.03 Term + 203031 203317 287 1 2 91 40 327 0.971 23.77 5.04 PlyA + 203658 203663 6 1.05 6.00 Prom + 204045 204084 40 -6.56 6.01 Init + 208719 208825 107 2 2 98 62 182 0.827 14.32 6.02 Term + 209263 209290 28 1 1 74 48 53 0.455 -2.55 6.03 PlyA + 209504 209509 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 135362 135778 417 1 0 107 42 479 0.995 41.50 S.002 Term - 154120 153912 209 2 2 116 39 167 0.929 12.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:2910023_3119652|GENSCAN_predicted_peptide_1|643_aa MCDGRTAPTTTSYRVHAVNSVAVEKLCLRHATKITTCTVHRRFTLLQDSNAATDLTGGGA QAVMFAHLLLTSCCVAQFLTDHRRICWGLLHIKSAIMDVLAEANGTFALNLLKTLGKDNS KNVFFSPMSMSCALAMVYMGAKGNTAAQMAQILSFNKSGGGGDIHQGFQSLLTEVNKTGT QYLLRMANRLFGEKSCDFLSSFRDSCQKFYQAEMEELDFISAVEKSRKHINTWVAEKTEG KIAELLSPGSVDPLTRLVLVNAVYFRGNWDEQFDKENTEERLFKVSKAKRESCFLYGVGQ NYRSCGFGEMWNPVMLSEFCSEEGRMCYTRDWGPYGVFDCAYCRNEEKPVQMMFKQSTFK KTYIGEIFTQILVLPYVGKELNMIIMLPDETTDLRTVEKELTYEKFVEWTRLDMMDEEEV EVSLPRFKLEESYDMESVLRNLGMTDAFELGKADFSGMSQTDLSLSKVVHKSFVEVNEEG TEAAAATAAIMMMRCARFVPRFCADHPFLFFIQHSKTNGILFCGRFSSPNKVTIFAVSAT AHKGNADAKCKRTNLITAQKTPTGYHYKLRQPAFILLSGPTHILLIGCTWVIQPLSHGSI PPGPAPSDVSTGPKSTPPTQVSTWDLGFIFDVYAKAALDSSLA >gi568815592f:2910023_3119652|GENSCAN_predicted_CDS_1|1932_bp atgtgtgatggcaggacagcccccacaaccacaagttatcgggttcatgcggtcaacagt gtcgcagttgagaaactctgccttaggcatgcaaccaagatcaccacgtgcacagttcac cgtaggttcactctcctacaagactccaacgctgccactgacctaacaggaggtggagct caggcggtgatgttcgctcacctgctgctcacctcctgctgcgtggcccagttcctaaca gaccacagacggatctgctggggactcctgcatataaagtctgccatcatggatgttctc gcagaagcaaatggcacctttgccttaaaccttttgaaaacgctgggtaaagacaactcg aagaatgtgtttttctcacccatgagcatgtcctgtgccctggccatggtctacatgggg gcaaagggaaacaccgctgcacagatggcccagatactttctttcaataaaagtggcggt ggtggagacatccaccagggcttccagtctcttctcaccgaagtgaacaagactggcacg cagtacttgcttaggatggccaacaggctctttggggaaaagtcttgtgatttcctctca tcttttagagattcctgccaaaaattctaccaagcagagatggaggagcttgactttatc agcgccgtagagaagtccagaaaacacataaacacctgggtagctgaaaagacagaaggt aaaattgcggagttgctctctccgggctcagtggatccattgacaaggctggttctggtg aatgctgtctatttcagaggaaactgggatgaacagtttgacaaggagaacaccgaggag agactgtttaaagtcagcaaggcgaaaagggaaagctgctttctatatggtgtaggccag aactataggtcttgtgggtttggggagatgtggaatcccgtcatgctttctgagttctgc agtgaagaaggaagaatgtgctacaccagagactggggaccatacggcgtctttgactgt gcctattgcaggaatgaggagaaacctgtgcaaatgatgtttaagcaatctacttttaag aagacctatataggagaaatatttacccaaatcttggtgcttccatatgttggcaaggaa ctgaatatgatcatcatgcttccggacgagaccactgacttgagaacggtggagaaagaa ctcacttacgagaagttcgtagaatggacgaggctggacatgatggatgaagaggaggtg gaagtgtccctcccgcggtttaaactagaggaaagctacgacatggagagtgtcctgcgc aacctgggcatgactgatgccttcgagctgggcaaggcagacttctctggaatgtcccag acagacctgtctctgtccaaggtcgtgcacaagtcttttgtggaggtcaatgaggaaggc acggaggctgcagccgccacagctgccatcatgatgatgcggtgtgccagattcgtcccc cgcttctgcgccgaccaccccttccttttcttcatccagcacagcaagaccaacgggatt ctcttctgcggccgcttttcctctccaaataaagtgacaattttcgcagtgagcgctaca gctcataaaggcaacgcagacgccaaatgcaaacgaacaaaccttatcacagcacagaaa accccaacgggttaccactacaagctccggcagcctgcttttattcttttatctggcccc acccacatcctgctgattgggtgcacatgggtgatccagccactctcacatggaagcata cctcctggacctgccccaagtgatgtgtcgactggccccaaatcaacacctccaactcaa gtttccacgtgggatttgggcttcatatttgatgtctatgccaaggctgccctggacagc tccctagcctag >gi568815592f:2910023_3119652|GENSCAN_predicted_peptide_2|419_aa MAPRSRMNRESDRLRPERLLPGESPPLCNQPAAAEGAWVPSAEPAAHDALAYIKICIYGN VTGSPGYPKKGCFLLRGEANARNRKGVCQAVQAVFNGYGIGRSEINLACVSECAVQCGRN LAQLLERSLGRRGSQSCGSYWGVRWSEDNPFPDPPFWLTLLLDSLKRDYAGKPQPPIKSE RRNPPSYAMAGKKVLIVYAHQEPKSFNGSLKNVAVDELSRQGCTVTVSDLYAMNLEPRAT DKDITGTLSNPEVFNYGVETHEAYKQRSLASDITDEQKKVREADLVIFQFPLYWFSVPAI LKGWMDRVLCQGFAFDIPGFYDSGLLQGKLALLSVTTGGTAEMYTKTGVNGDSRYFLWPL QHGTLHFCGFKVLAPQISFAPEIASEEERKGMVAAWSQRLQTIWKEEPIPCTAHWHFGQ >gi568815592f:2910023_3119652|GENSCAN_predicted_CDS_2|1260_bp atggcgccgcgctcccggatgaacagagaaagcgacaggctgcgcccggagcgcctgctg cctggcgagagcccgcccctctgcaaccagcccgctgcagcggagggcgcctgggtgccc tcggccgagccagcagcccacgacgcccttgcatacattaagatatgtatatatggaaat gttaccgggagtcccggttatcccaaaaagggttgtttcctgttgcgtggtgaggccaat gcacgaaaccgaaagggagtgtgtcaagcagtgcaggctgtattcaatggctatggaatt ggaagatctgaaatcaacttagcttgcgtcagcgagtgcgcggtccagtgcggccggaac ctggcgcaactcctagagcggtccttggggagacgcgggtcccagtcctgcggctcctac tggggagtgcgctggtcggaagataatccctttccagacccgccattttggctcacccta ttgctggactcgctgaagagagactacgcaggaaagccccagccacccatcaaatcagag agaaggaatccaccttcttacgctatggcaggtaagaaagtactcattgtctatgcacac caggaacccaagtctttcaacggatccttgaagaatgtggctgtagatgaactgagcagg cagggctgcaccgtcacagtgtctgatttgtatgccatgaaccttgagccgagggccaca gacaaagatatcactggtactctttctaatcctgaggttttcaattatggagtggaaacc cacgaagcctacaagcaaaggtctctggctagcgacatcactgatgagcagaaaaaggtt cgggaggctgacctagtgatatttcagttcccgctgtactggttcagcgtgccagccatc ctgaagggctggatggatagggtgctgtgccagggctttgcctttgacatcccaggattc tacgattccggtttgctccagggtaaactagcgctcctttccgtaaccacgggaggcacg gccgagatgtacacgaagacaggagtcaatggagattctcgatacttcctgtggccactc cagcatggcacattacacttctgtggatttaaagtccttgcccctcagatcagctttgct cctgaaattgcatccgaagaagaaagaaaggggatggtggctgcgtggtcccagaggctg cagaccatctggaaggaagagcccatcccctgcacagcccactggcacttcgggcaataa >gi568815592f:2910023_3119652|GENSCAN_predicted_peptide_3|200_aa MAGCRSRALPRGKAAKARREIERSARGLALLGDPVHPPQPLARTEELKVKFYRDNQGHLK GDRLCDHWKREAVDLAFMHLDEDDTGNCTLQVEVAKYQRNGKYEASGRKCANHRKAPSLR QKRPRRSPSKRRDTSELSSSNTFHPVDFEDGQRRPSRRVKFGPTRRLIVFDRHPAGEPVS WRNAGAAAHCIQTFDGRWFE >gi568815592f:2910023_3119652|GENSCAN_predicted_CDS_3|603_bp atggcgggctgcaggtcccgagccctgccccgcgggaaggcagctaaagcccggcgagaa atcgagcgcagcgcccgtgggctggcactgctgggggacccagtacatcctccgcagccg ctggcccggacagaagaacttaaggtcaagttttacagagataaccaaggacatcttaaa ggagacaggctgtgcgatcattggaagagggaagctgtggatcttgcattcatgcatttg gatgaagatgacactggaaactgcacgttgcaggttgaggtggccaagtatcaacggaat gggaaatatgaggcttcaggaaggaagtgcgcgaaccacaggaaggctccgtctctacgg cagaagcggccacgccgcagcccatccaaacgccgcgacacgagtgagttgtcctcatcg aacacatttcatcctgtggattttgaggatggccagagaagaccttcaaggagagtcaag tttggaccaactcggaggctcattgtctttgacagacacccagctggtgaacctgtgtcc tggaggaatgcaggagcagctgctcattgtattcaaacctttgatggaaggtggtttgaa tga >gi568815592f:2910023_3119652|GENSCAN_predicted_peptide_4|871_aa MVELQQLRVQEVVDSMVKSLERENIWKTQGLMFWCSASCCEDSQAFTQQVHQCIECCPVP LAQVQALVTSELEKFQDHLARCTMHCNDKAKDSIDAGSKELQVKQQLDGCVTNPEKGSGK KCAKLGQRYYGRVLELILPVASACPGSKDLLCNKPEPSNIPDSSYYTGGLISGQVLGPQD RYEWNFVLFGDGTGGGQETINKPAKKAPQNSRWRYPGYRMRVCSATKRMSHWYSTPISLL DLPGSYADQTSHRTPFSQSRIQSKAPTLFNPMQDERSEEIAEEKLEASRDALFLRAEALD AAGSQPTSLGCQVLNPVLKRPWRRGAGAGGGGSANSPRPPAGRARRGRRASCRSAATPGD PQLGRQSAAIRAGPTERGRTWLDGAATEKGAGTALPGGEKVVPFWAFLSFRMQPDMSLNV IKMKSSDFLESAELDSGGFGKVSLCFHRTQGLMIMKTVYKGPNCIEHNEALLEEAKMMNR LRHSRVVKLLGVIIEEGKYSLVMEYMEKGNLMHVLKAEMSTPLSVKGRIILEIIEGMCYL HGKGVIHKDLKPENILVDNDFHIKMWSKLNNEEHNELREVDGTAKKNGGTLYYMAPEHLN DVNAKPTEKSDVYSFAVVLWAIFANKEPYENAICEQQLIMCIKSGNRPDVDDITEYCPRE IISLMKLCWEANPEARPTFPVGTIQKQPNGRDVQGIEEKFRPFYLSQLEESVEEDVKSLK RAYLPHLVTAERLPAAPRNRRAPTCRTSEPQSAYLPHLGTAERLPAAPRNRRAPTCRTSE PQSAYLPHLGTAERLPAAPRNRRAPTCRTSEPQRAYLPHLGTAARLPAAPRNRSAPTCRT SEPQRAYLPHLGTAARLPAAPSNRSAPTCRT >gi568815592f:2910023_3119652|GENSCAN_predicted_CDS_4|2616_bp atggtggagctgcagcagctgcgggtgcaggaggtggtggactccatggtgaagagtctg gaaagggagaacatctggaagacacagggtctcatgttctggtgcagcgccagctgttgt gaggacagccaggcattcacccagcaggtgcaccagtgcatcgagtgctgccctgtgcct ctggctcaagtccaggccttggtcaccagtgagttggagaagttccaggaccacctggct cggtgcaccatgcattgcaatgacaaagccaaagattcaatagatgctgggagtaaggag cttcaggtgaagcagcagctggacggttgtgtgaccaatcctgaaaagggaagtggaaaa aagtgtgctaaattgggtcagagatattacgggagagttttagagcttattcttcctgtg gccagtgcttgtcctggaagtaaggatctcctctgtaacaagccagagccctccaacata ccagactcttcttactacacaggagggctcatctctgggcaggtgctggggccacaagac agatacgaatggaactttgtactcttcggggacggtacggggggcgggcaagaaacaata aacaagccggcaaagaaagcgccacaaaattccaggtggcgatatccaggttacagaatg agagtctgtagtgcgacaaaaagaatgagccactggtactccacacccatcagcctgctg gatctcccggggagttacgctgatcaaaccagccacagaactcccttcagccaaagccga atccagagcaaggccccaactctcttcaatcctatgcaggatgagagaagtgaggaaatt gcagaagaaaagttggaagctagcagagacgccttgtttcttcgcgcggaggccctggac gccgcaggctcccaacctacttctctgggctgtcaggttctgaacccggtcctgaagagg ccctggcgccggggggccggggcagggggaggaggcagcgcgaacagtccacgccctcca gccgggcgcgctcgacgcggacggcgggccagctgccggagcgcggcgactccaggggac ccacagctggggcgccagagcgcggccatccgggcggggccgacggagcgcggcaggact tggctggacggcgcggccacggagaagggcgcgggtacagctctgccggggggggaaaaa gtggtaccattttgggcgttcttgagcttcagaatgcaaccagacatgtccttgaatgtc attaagatgaaatccagtgacttcctggagagtgcagaactggacagcggaggctttggg aaggtgtctctgtgtttccacagaacccagggactcatgatcatgaaaacagtgtacaag gggcccaactgcattgagcacaacgaggccctcttggaggaggcgaagatgatgaacaga ctgagacacagccgggtggtgaagctcctgggcgtcatcatagaggaagggaagtactcc ctggtgatggagtacatggagaagggcaacctgatgcacgtgctgaaagccgagatgagt actccgctttctgtaaaaggaaggataattttggaaatcattgaaggaatgtgctactta catggaaaaggcgtgatacacaaggacctgaagcctgaaaatatccttgttgataatgac ttccacattaagatgtggagcaaactgaataatgaagagcacaatgagctgagggaagtg gacggcaccgctaagaagaatggcggcaccctctactacatggcgcccgagcacctgaat gacgtcaacgcaaagcccacagagaagtcggatgtgtacagctttgctgtagtactctgg gcgatatttgcaaataaggagccatatgaaaatgctatctgtgagcagcagttgataatg tgcataaaatctgggaacaggccagatgtggatgacatcactgagtactgcccaagagaa attatcagtctcatgaagctctgctgggaagcgaatccggaagctcggccgacatttcct gtgggtacaattcagaaacagccaaatggaagagatgtacaaggcattgaagaaaaattt aggcctttttatttaagtcaattagaagaaagtgtagaagaggacgtgaagagtttaaag cgcgcctacctgccgcacctagtaaccgcagagcgcctacctgccgcacctcggaaccgc agagcgcctacctgccgcacctcggaaccgcagagcgcctacctgccgcacctcggaacc gcagagcgcctacctgccgcacctcggaaccgcagagcgcctacctgccgcacctcggaa ccgcagagcgcctacctgccgcacctcggaaccgcagagcgcctacctgccgcacctcgg aaccgcagagcgcctacctgccgcacctcggaaccgcagcgcgcctacctgccgcacctc ggaaccgcagcgcgcctacctgccgcacctcggaaccgcagcgcgcctacctgccgcacc tcggaaccgcagcgcgcctacctgccgcacctcggaaccgcagcgcgcctacctgccgca cctagtaaccgcagcgcgcctacctgccgcacctag >gi568815592f:2910023_3119652|GENSCAN_predicted_peptide_5|321_aa MGPVEESWFAPSLEHPQEENEPSLQSKLQDEANYHLYGSRMDRQTKQQPRQNVAYNREEE RRRRVSHDPFAQQRPYENFQNTEGKGTAYSSAASHGNAVHQPSGLTSQPQVLYQNNGLYS SHGFGTRPLDPGTAGPRVWYRPIPSHMPSLHNIPVPETNYLGNTPTMPFSSLPPTDESIK YTIYNSTGIQIGAYNYMEIGGTSSSLLDSTNTNFKEEPAAKYQAIFDNTTSLTDKHLDPI RENLGKHWKNCARKLGFTQSQIDEIDHDYERDGLKEKVYQMLQKWVMREGIKGATVGKLA QALHQCSRIDLLSSLIYVSQN >gi568815592f:2910023_3119652|GENSCAN_predicted_CDS_5|966_bp atgggtcctgtggaggagtcctggtttgctccttccctggagcacccacaagaagagaat gagcccagcctgcagagtaaactccaagacgaagccaactaccatctttatggcagccgc atggacaggcagacgaaacagcagcccagacagaatgtggcttacaacagagaggaggaa aggagacgcagggtctcccatgacccttttgcacagcaaagaccttacgagaattttcag aatacagagggaaaaggcactgcttattccagtgcagccagtcatggtaatgcagtgcac cagccctcagggctcaccagccaacctcaagtactgtatcagaacaatggattatatagc tcacatggctttggaacaagaccactggatccaggaacagcaggtcccagagtttggtac aggccaattccaagtcatatgcctagtctgcataatatcccagtgcctgagaccaactat ctaggaaatacacccaccatgccattcagctccttgccaccaacagatgaatctataaaa tataccatatacaatagtactggcattcagattggagcctacaattatatggagattggt gggacgagttcatcactactagacagcacaaatacgaacttcaaagaagagccagctgct aagtaccaagctatctttgataataccactagtctgacggataaacacctggacccaatc agggaaaatctgggaaagcactggaaaaactgtgcccgtaaactgggcttcacacagtct cagattgatgaaattgaccatgactatgagcgagatggactgaaagaaaaggtttaccag atgctccaaaagtgggtgatgagggaaggcataaagggagccacggtggggaagctggcc caggcgctccaccagtgttccaggatcgaccttctgagcagcttgatttacgtcagccag aactaa >gi568815592f:2910023_3119652|GENSCAN_predicted_peptide_6|44_aa MVAVLGGRGVLRLRLLLSALKPGIHVPRAGPAAAFGNQPEPEYR >gi568815592f:2910023_3119652|GENSCAN_predicted_CDS_6|135_bp atggtggctgtgctgggcggccggggcgtgttgcgcctgcggctgcttctctcagcgctg aagcccgggatccacgtcccacgggccggacccgcggccgcgttcggaaatcagcctgag cctgagtaccgctaa