GENSCAN 1.0 Date run: 7-Nov-116 Time: 20:04:57 Sequence gi568815591r:20040949_20261862 : 220914 bp : 37.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4798 4915 118 0 1 84 80 84 0.614 7.63 1.02 Term + 15358 15668 311 2 2 27 40 195 0.626 2.84 1.03 PlyA + 15851 15856 6 1.05 2.00 Prom + 16771 16810 40 -3.25 2.01 Init + 29372 29951 580 1 1 49 -22 273 0.641 7.76 2.02 Intr + 30175 30819 645 2 0 44 40 285 0.682 10.32 2.03 Term + 30968 31611 644 0 2 61 41 223 0.872 8.24 2.04 PlyA + 31753 31758 6 1.05 3.00 Prom + 32517 32556 40 -3.65 3.01 Init + 48487 48536 50 0 2 77 94 56 0.947 5.57 3.02 Term + 49032 49587 556 0 1 -36 48 312 0.581 7.61 3.03 PlyA + 49797 49802 6 1.05 4.00 Prom + 56383 56422 40 -3.35 4.01 Sngl + 60078 60464 387 2 0 71 39 233 0.976 12.66 4.02 PlyA + 61148 61153 6 1.05 5.09 PlyA - 61759 61754 6 1.05 5.08 Term - 66936 66890 47 1 2 119 49 48 0.137 0.39 5.07 Intr - 78879 78829 51 1 0 69 98 34 0.190 0.46 5.06 Intr - 82169 82038 132 0 0 27 109 94 0.236 5.10 5.05 Intr - 88307 88183 125 1 2 47 81 132 0.348 7.71 5.04 Intr - 102486 102386 101 0 2 23 85 33 0.002 -5.51 5.03 Intr - 113433 113245 189 0 0 61 94 240 0.963 20.76 5.02 Intr - 119297 117256 2042 0 2 24 89 989 0.834 78.97 5.01 Init - 120914 120800 115 2 1 58 80 59 0.684 2.52 5.00 Prom - 121387 121348 40 -7.75 6.00 Prom + 125950 125989 40 -3.75 6.01 Init + 127500 127638 139 2 1 78 45 119 0.964 6.95 6.02 Intr + 129102 129207 106 1 1 86 30 121 0.050 4.45 6.03 Intr + 150772 150904 133 1 1 124 107 58 0.264 11.03 6.04 Intr + 152588 152609 22 1 1 41 115 19 0.074 -3.90 6.05 Intr + 162381 162476 96 1 0 53 66 183 0.276 11.66 6.06 Intr + 167949 168059 111 1 0 51 80 99 0.174 4.83 6.07 Intr + 175573 175633 61 2 1 70 111 -20 0.052 -4.63 6.08 Intr + 178212 178284 73 0 1 80 25 105 0.505 1.89 6.09 Intr + 178635 178731 97 2 1 94 34 128 0.635 6.66 6.10 Intr + 178999 179279 281 2 2 72 62 128 0.743 4.87 6.11 Intr + 184571 184696 126 1 0 43 100 58 0.332 2.46 6.12 Intr + 192685 192799 115 0 1 72 21 68 0.014 -2.40 6.13 Intr + 197211 197266 56 1 2 87 103 40 0.051 3.18 6.14 Term + 210949 211116 168 0 0 82 52 79 0.393 0.60 6.15 PlyA + 211196 211201 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 129102 129211 110 1 2 86 32 130 0.904 4.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:20040949_20261862|GENSCAN_predicted_peptide_1|142_aa MAYNGPAQCLGASQKWAAATLQPHSRVNLQDFNKWYIWSDSEKAFDKIQHHFVTKILNKV AIEGKCLKVVKAICDEPTANIILNGEKLTAFPLRTGTRQGCPISPLLFNIVLEVLARAIR QDKERKGIQIGKEEVKVSLFIR >gi568815591r:20040949_20261862|GENSCAN_predicted_CDS_1|429_bp atggcttataatgggcctgctcagtgtttaggcgccagccaaaaatgggcagcagctacc ttgcagccacactccagggtgaacttgcaagacttcaacaaatggtacatctggtcagac tcagaaaaagcatttgataaaatccagcatcactttgtgacaaaaatcctcaacaaagta gccatagaagggaaatgcctcaaagtagtaaaagccatatgtgacgaacccacagccaac atcatactgaatggagaaaagctgacagcattccccctgagaactggaacaagacaagga tgcccaatttcaccacttctattcaacatagtactggaagtcctagccagagcaatcaga caagacaaagaaagaaagggcatccaaattggaaaggaggaagtcaaagtgtcactgttc attcgatga >gi568815591r:20040949_20261862|GENSCAN_predicted_peptide_2|622_aa MGDFNTPLSTLDRSTRQKVYKDIQELNTALHQADLIDIYRTLYPKSTEYTFSEYTFFSAP HHTYSKIDHLVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKFTQNHSTTWKLNNLLL NNYWVHNEMKADIKMFFEMNENKDTTYQNLWDTFKAACKGKFIALNAHKRKQERSKIDTL TSQLKELEKQEQTQIQTTIREYYKHVYANKLENLEEMDKFLDTYTLTRLNQEEVESLHRP ITGSEIEAIINSLRTQKSPGTDGFTAKFYQSYKEELVPFLLKLFQSIEKDGILPNSFYEA SIILIPKPGRDTTKKEDFRPISLKNIDTKILSKILANQIQQHIKKLIHHDEVGFIPVMQG WFNICKSINIIHHINRTKDKNHMIISTDAEKTFEKIQQCFMLKTLHKLVLEFLARAIRQE KEIKGIQLGKEEIKLFLLADDTIVCLENPLISAQNLLKLISNFSKVSRHKISVQKPQAFL YTNNRQIESQIMSELPFTIASKKIKYLGIQLTRDVKDLFKENYKPLLNEIKEYSNKWKNI PCSWIGRINIMKMAIWPKVTYRFSAIPIKLPMTFFTELEKTTLKFIWNQKRAHITKTILS QKNKAAGITLPDFKLYYKSTVS >gi568815591r:20040949_20261862|GENSCAN_predicted_CDS_2|1869_bp atgggagactttaacaccccactgtcaacattagacagatcaacaagacagaaagtttac aaggatatccaggaattaaacacagctctgcaccaagcggacctaatagacatctacaga actctctaccccaagtcaacagaatatacattttcagaatatacatttttctcagcacca catcacacttattccaaaattgaccacttagttggaagtaaagcactccttagcaaatgt aaaagaacagaaattataacaaactgtctctcagaccacagtgcaatcaaactagaactc aggattaagaaattcactcaaaaccactcaactacatggaaactgaacaacctgctcctg aacaactactgggtacataacgaaatgaaggcagacataaagatgttctttgaaatgaat gagaacaaagacacaacataccagaatctctgggacacatttaaagcagcatgtaaaggg aaatttatagcactaaatgcccacaagagaaagcaggaaagatctaaaattgacacccta acatcacaattaaaagaactagagaagcaagagcaaacacaaatacaaactaccatcaga gaatactataaacacgtctatgcaaataaactagaaaatctagaagaaatggataaattc ctggacacatacaccctcacaagactaaaccaggaagaagttgaatccctgcataggcca ataacaggctctgaaattgaggcaataattaatagcctacgaacccaaaaaagtccagga acagatggattcacagccaaattctaccagagttacaaagaggagctggtaccattcctt ctgaaactattccaatcaatagaaaaagatggaatcctccctaactcattttatgaggcc agcatcatcctgataccaaagcctggcagagacacaacaaaaaaagaggattttagacca atatccctgaagaacatcgatacaaaaatcctcagtaaaatactggcaaaccaaatccag cagcacatcaaaaagcttatccaccatgatgaagtgggcttcattcctgtgatgcaaggc tggttcaacatatgcaaatcaataaacataatccatcatataaacagaaccaaagacaaa aaccacatgattatctcaacagatgcagaaaagacctttgagaaaattcaacagtgcttc atgctaaaaactctccataaattagtgttggaatttctggccagggcaatcaggcaggag aaagaaataaagggtattcaattaggaaaagaggaaatcaaattgttcctgttggctgat gacacgattgtgtgtttagaaaaccccctcatctcagcccaaaatctccttaaactgata agcaacttcagcaaagtctcaagacacaaaatcagtgtgcaaaaaccacaagcattctta tacaccaataacagacaaatagagagccaaatcatgagtgaactcccattcacaattgct tcaaagaaaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaatgaaataaaagagtactcaaacaaatggaagaacatt ccatgctcatggataggaagaatcaatatcatgaaaatggccatatggcccaaggtaact tatagattcagtgccatccccatcaagctaccaatgactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagggcccacatcaccaagacaatcctaagc caaaagaataaggctgcaggcatcacgctacctgacttcaaactatactacaagtctaca gtatcctaa >gi568815591r:20040949_20261862|GENSCAN_predicted_peptide_3|201_aa MTPQKPHPKVNGLKDKRPNLQWIGVPERDGENGIKLENTLQDIIQENFPNLARQANIQTQ EIQRTPVGYSMRRSTPRHIIVRFSKVEMKEKMLRAVREKVQVTYKGKPIRLTMNLSAEAL QARRDLGPILHILKEKNFQPRISYLAKVSFISEGEDKQMLKDFVTSRPALQELLKVALNM ERKNHIRPLQKHIEIHRSTTL >gi568815591r:20040949_20261862|GENSCAN_predicted_CDS_3|606_bp atgaccccacagaaaccccatccgaaggtcaacggcctcaaagataaaagaccaaatcta caatggattggtgtacctgaaagagatggggagaatggaatcaagttggaaaacacactt caagatatcatccaggagaacttccccaatctagcaagacaggccaacattcaaactcag gaaatccagagaaccccagtaggatactccatgagaagatctaccccaagacacataatt gtcagattctccaaggttgaaatgaaagaaaaaatgctaagggcagtcagagagaaagtc caggtcacctacaaagggaagcccatcagactaacaatgaacctctcagcagaagcccta caagcaagaagagatttggggccaatattacacattcttaaagaaaagaatttccaaccc agaatttcctatctggccaaagtaagctttataagtgaaggagaagacaagcaaatgctg aaggattttgtcacctccaggcctgccttgcaagagctcctgaaggtagcactaaacatg gaaaggaaaaaccatatccggccactacaaaaacacattgaaatacacagatcaacgaca ctatga >gi568815591r:20040949_20261862|GENSCAN_predicted_peptide_4|128_aa MGNDFMSKTPKAMATKAKIDKWDLIKLNSFCTAKETTIRVNKQPTEWEKIFAIYSSDKGL ISRIYNELKQIYKKKNKQPHQKVGEGYEQTLLKRRHLCSQKTYEKMLIITGHQRNANQNH NEIPAHTS >gi568815591r:20040949_20261862|GENSCAN_predicted_CDS_4|387_bp atgggcaacgacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaacagcttctgcacagccaaagaaactaccatcagagtg aacaagcaacctacagaatgggagaaaatttttgcaatctactcatctgacaaagggcta atatccagaatctacaatgaactcaaacaaatctacaagaaaaaaaacaaacagccccat caaaaagtgggtgaaggatatgaacagacacttctgaaaagaagacatttatgcagccaa aaaacatatgaaaaaatgctcatcatcactggccatcagcgaaatgcaaatcaaaaccac aatgagataccagctcacaccagttag >gi568815591r:20040949_20261862|GENSCAN_predicted_peptide_5|933_aa MLITERKHFRSGRIAQSMSEANLIDMEAGKLSKSCNITECQDPDLLHNWPDAFTLRGNNA SKVANPFWNQLSASNPFLDDITQLRNNRKRNNISILKEDPFLFCREIENGNSFDSSGDEL DVHQLLRQTSSRNSGRSKSVSELLDILDDTAHAHQSIHNSDQILLHDLEWLKNDREAYKM AWLSQRQLARSCLDLNTISQSPGWAQTQLAEVTIACKVNHQGGSVQLPESDITVHVPQGH VAVGEFQEVSLRAFLDPPHMLNHDLSCTVSPLLEIMLGNLNTMEALLLEMKIGAEVRKDP FSQVMTEMVCLHSLGKEGPFKVLSNCYIYKDTIQVKLIDLSQVMYLVVAAQAKALPSPAA TIWDYIHKTTSIGIYGPKYIHPSFTVVLTVCGHNYMPGQLTISDIKKGGKNISPVVFQLW GKQSFLLDKPQDLSISIFSCDPDFEVKTEGERKEIKQKQLEAGEVVHQQFLFSLVEHREM HLFDFCVQVEPPNGEPVAQFSITTPDPTPNLKRLSNLPGYLQKKEEIKSAPLSPKILVKY PTFQDKTLNFSNYGVTLKAVLRQSKIDYFLEYFKGDTIALLGEGKVKAIGQSKVKEWYVG VLRGKIGLVHCKNVKVISKEQVMFMSDSVFTTRNLLEQIVLPLKKLTYIYSVVLTLVSEK VYDWKVLADVLGYSHLSLEDFDQIQADKESEKVSYVIKKLKEDCHTERNTRKFLYELIVA LLKMDCQELVARLIQEAAVLTSAVKLGKGWRELAEKLVRLTKQQMEAYEIPHRGNTGDVA VEPGRQSQTPSPKQNKAKQQQQQQIAKCEDVMWCSAVFGNPPPDSPEHDSDFTWIQILQL STNPSISIYGALIGVNPASVMVWEGSEVIRQRRSSAAKLRRSGKESESLGPEFQGLWEWL PVANKYHDDIAVTKPGTHTSGTIELVAEGEEAL >gi568815591r:20040949_20261862|GENSCAN_predicted_CDS_5|2802_bp atgctaatcactgaaagaaaacattttcggtcaggaagaattgcacaaagtatgtctgaa gcaaatttgattgacatggaagctggaaaactctcaaaaagttgcaatattacagaatgc caggacccagacttgcttcacaattggccggatgctttcacccttcgtggtaataatgct tccaaagttgcaaatccattctggaatcaactgtctgcttctaacccatttttggatgac ataactcaactaagaaataacaggaagagaaataatatttccatcttaaaggaagatcct tttcttttctgtagagaaatagaaaatggaaattcttttgattcctccggtgatgaactt gatgtgcatcagttacttaggcagacttcctcaagaaattctggaagatctaaaagtgtt tcagaacttctggacattttagacgacacagcacatgcccatcagagtatacataactct gaccagatcctactacacgacttagagtggcttaaaaatgatcgggaggcttataaaatg gcttggttaagtcaacgccagctggcccgctcctgccttgatttgaatacaattagtcag agccctggatgggcccagacacaacttgcggaggtcaccatagcttgcaaagtaaaccat caaggagggtcagtacaattacctgaatcagacatcactgttcatgtgccccaaggtcat gtggctgtgggagaattccaagaggtgtctctaagggctttccttgatccgccacacatg cttaaccatgatctttcgtgcactgtgagcccgttgttggaaatcatgttaggcaacctc aatacaatggaagcccttttgctggagatgaaaattggggctgaagtaagaaaggatcct ttcagccaagtcatgacagaaatggtgtgtttacacagcttgggtaaagaaggccctttt aaagttttaagcaactgctacatttataaagacaccatccaagtcaagctaatcgacttg agtcaggtaatgtatctagtggttgctgcacaagctaaagctcttccgtcaccagctgcc accatttgggattatatccacaaaaccacctcaattggaatttatggacccaaatatatc catcccagttttactgttgttttaacagtttgtggacacaattatatgccaggacagctt acaatttctgatattaagaagggtggaaaaaacatatctccagttgtgtttcagctctgg gggaagcagtcatttttacttgacaagccacaagatttaagtatttctattttttcctgt gatcctgattttgaagtaaagacagaaggagaaaggaaagaaattaaacaaaagcagttg gaagcaggtgaagtagttcatcaacaatttttattttctttagttgagcacagagagatg cacttgtttgatttttgtgttcaagtggagcctcccaatggtgaaccagttgcacagttc tctatcactactcctgatccaaccccaaacctaaaaagactctcgaatctgccaggctat ttgcagaagaaggaggaaatcaagtctgctcctttatcaccaaaaattcttgttaaatat cctacatttcaagataaaacattgaactttagcaactatggggtaaccctgaaggcagtg ctaagacaaagcaagattgattacttccttgaatatttcaaaggggacacaatagctctc ctcggggaaggtaaggtaaaagctattggtcagtccaaagtgaaagaatggtatgtagga gtcctcagaggtaagattggacttgtacactgcaaaaatgtcaaggtgatttcaaaggag caagtaatgtttatgtcagatagtgtctttacaaccagaaatcttcttgaacagattgtc ctgcctttaaaaaaattgacttatatctactcagttgtattaaccttggtgtcagaaaaa gtttatgattggaaagttttagctgatgtcctgggttactcacatctgtccctggaagat tttgatcaaattcaagcagacaaagaatcagagaaagtttcttatgttataaagaagtta aaggaagattgccacacagagagaaatacaaggaagtttctgtatgaacttattgtggct cttctgaaaatggattgccaagagttagtcgcacgtctcatccaagaagctgctgttctg acttcagctgtcaagcttggaaaaggctggagggaactagctgaaaagttagtacgactc acaaagcaacaaatggaggcatatgaaattcctcatcgaggaaacactggagatgttgct gttgagcctgggcgacagagccagactccgtctccaaaacaaaacaaagcaaagcaacaa caacaacaacaaattgccaaatgtgaggacgttatgtggtgctctgcggtctttggaaat cctcctccagattccccagagcatgacagtgacttcacgtggatacagattttgcagtta tcaactaacccatcaattagcatttatggagcactaattggtgtcaacccagcatctgtg atggtctgggaaggttccgaggtgattaggcagcgtcggtcttcagctgctaagctgaga agatctgggaaggagtcagagagccttgggccagagttccaggggctctgggagtggctg ccagtggctaataagtaccatgatgatatagctgtaacaaagccagggactcacacttca ggaacaatagaattggtggcagaaggagaagaggctttatga >gi568815591r:20040949_20261862|GENSCAN_predicted_peptide_6|527_aa MISYKLHSDHRTQRSHLTAAPAIPGIKRGEEEHVTTGDSILGPTFYKNFTSAMIPAKRVT GLITAEALECNRKNAVFENRQTTSHWQDLTKIRLAIKIWKGNLCGIRNLQHIKEEHRTEI DAEGKELHPVIDNATAATEGSEVGSFDPPFENQQIFIAAAADDDDAASPPVMAKRDQGTA QVIAPETASPKPWWLTHGVEPAAPGNYHSKYHLCEFDYSRYLSLYLQKGKALNLAIEPLT VPEMKALLSATVEEVNLLDPADSYNEFCVDHQTKGLAGEDKGEAKHLEWQSDMAIWDQEL GTADTKKQGQKRTHSCGEQRQLCKQPAVPDVTVVAVGCRVQGSWRWSCKLWFLGLSRNKE PIQWHCFGGCSGFREISGELYTGNYKLFPESTHILPAHKSFAKTKHMVTPTHSGPSRFTH RALTPITLKCIAKPLWAVSGQTLAPMGPVRLDLGRNEARLQSVLEGISEAAHTLVPSSPS VGHQSPGSSASGLWDLHQGIPGGCQAFSHRLKGALSASLILRLSDLD >gi568815591r:20040949_20261862|GENSCAN_predicted_CDS_6|1584_bp atgattagttacaagcttcacagtgatcatagaacacaaagaagccacttaacagcagca ccagcaatacctggaataaagaggggtgaagaggagcatgtaactactggggacagtatc ctcggtcccaccttttacaaaaattttaccagtgccatgattcctgccaaaagagtgact ggactcatcacagcagaggcactggagtgcaatagaaagaacgcagtctttgaaaataga cagacgacctctcactggcaagatctaactaaaatacggctggcaataaaaatttggaaa ggtaatctgtgcggtatcagaaacctacaacacataaaagaagaacacaggacagaaatc gacgctgaaggcaaggagttgcatcctgttatagacaatgctactgctgctactgagggc tccgaagttggaagctttgaccctcctttcgagaaccagcaaatatttattgctgctgct gctgatgatgatgatgctgcttcacctccagtcatggctaaaagggatcaaggtacagct caggtcattgctccagagactgcaagccccaagccttggtggcttacacatggtgttgag cctgcagcccctggcaactaccattctaagtatcatttatgtgagtttgactactccaga tacctcagcctttatttgcagaaaggcaaagcattaaatctagcaattgaacccttgaca gtacccgaaatgaaagccctactgtcggccactgtggaggaagtgaaccttctagatcct gctgactcatacaatgagttttgtgtggaccatcagacaaaagggctggctggtgaagat aagggagaggcaaagcacttagaatggcaatcagacatggctatctgggaccaggaatta ggaactgctgacacaaagaagcagggacagaagagaactcactcttgtggtgagcagagg cagctgtgcaaacaacctgctgttccggatgtgactgtggtggccgtggggtgcagagtc cagggctcatggcgttggtcatgcaaattgtggttcctgggccttagcagaaacaaggaa cctattcagtggcattgctttgggggctgctctggcttcagggagatctctggtgaactt tacacaggcaattataaactcttccctgaatcaacacacatccttcctgctcacaagtct ttcgccaaaactaaacacatggtcacacccacccatagtgggccatcaaggttcacccac agggcattaactcccataacactgaagtgcattgccaagcccctttgggcagtcagtgga caaaccttggcccccatgggtcctgttagattggacctaggtagaaatgaagccagattg cagagcgttctggagggaatatctgaagcagcacatacacttgttccctcttctccttca gttggacatcagtctccaggttcttcagcctctggactctgggacttgcaccaggggatt cctgggggctgtcaggccttcagccacagattgaagggtgcactgtcagcttccctgatt ttgaggctttcagacttggactga