GENSCAN 1.0 Date run: 3-Nov-116 Time: 01:24:18 Sequence gi568815578f:32209778_32431316 : 221539 bp : 44.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 294 415 122 1 2 98 92 95 0.518 11.11 1.02 Intr + 5492 5674 183 1 0 102 98 229 0.981 25.28 1.03 Intr + 6832 6944 113 0 2 118 65 -6 0.075 -0.72 1.04 Intr + 18486 18678 193 0 1 108 28 150 0.870 10.49 1.05 Intr + 21042 21284 243 2 0 95 95 289 0.960 27.99 1.06 Term + 24696 24884 189 2 0 130 37 242 0.999 20.75 1.07 PlyA + 27539 27544 6 1.05 2.00 Prom + 28827 28866 40 -5.26 2.01 Init + 38340 38441 102 2 0 21 70 151 0.086 4.88 2.02 Intr + 46708 46729 22 0 1 119 67 4 0.010 -1.48 2.03 Intr + 55741 55805 65 2 2 109 67 44 0.127 2.84 2.04 Intr + 62618 62706 89 1 2 52 100 51 0.159 1.47 2.05 Intr + 67898 67988 91 2 1 41 109 94 0.435 6.80 2.06 Term + 71219 71242 24 1 0 111 43 -5 0.084 -4.48 2.07 PlyA + 73258 73263 6 1.05 3.00 Prom + 81115 81154 40 -3.36 3.01 Sngl + 100001 101482 1482 1 0 53 41 1667 0.910 154.01 3.02 PlyA + 101759 101764 6 -0.45 4.00 Prom + 101956 101995 40 -8.06 4.01 Init + 102213 102261 49 2 1 86 89 63 0.942 5.31 4.02 Intr + 106370 106542 173 0 2 99 93 112 0.951 12.46 4.03 Intr + 106750 106872 123 0 0 52 86 144 0.997 11.38 4.04 Intr + 106979 107097 119 1 2 125 108 153 0.950 20.26 4.05 Intr + 116994 117107 114 0 0 104 83 12 0.719 1.86 4.06 Intr + 117779 117884 106 2 1 35 76 110 0.468 4.72 4.07 Intr + 120364 120542 179 0 2 68 109 206 0.920 19.42 4.08 Intr + 148258 148348 91 1 1 60 63 31 0.166 -2.20 4.09 Intr + 148927 149055 129 0 0 68 81 111 0.342 9.29 4.10 Intr + 156607 156689 83 0 2 71 68 84 0.138 3.14 4.11 Intr + 159238 159346 109 1 1 86 115 -40 0.078 -1.21 4.12 Intr + 218351 218471 121 1 1 78 91 70 0.489 6.37 4.13 Intr + 218548 218645 98 2 2 109 110 -8 0.954 3.23 4.14 Intr + 219561 219654 94 2 1 76 91 46 0.951 3.24 4.15 Intr + 220124 220276 153 0 0 75 109 243 0.999 25.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 13359 13453 95 2 2 81 84 32 0.810 1.96 S.002 Term - 166418 166318 101 0 2 77 42 79 0.895 0.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:32209778_32431316|GENSCAN_predicted_peptide_1|347_aa XRFGNQADHFLGSLAFAKLLNRTLAVPPWIEYQHHKPPFTNLHVSYQKYFKLEPLQAYHR VISLEDFMEKLAPTHWPPEKRVAYCFEVAAQRSPDKKTCPMKEGNPFGPFWDQFHVSFNK SELFTGISFSASYREQWSQRFSPKEHPVLALPGAPAQFPVLEEHRPLQKYMVWSDEMVKT GEAQIHAHLVRPYVGIHLRIGSDWKNACAMLKDGTAGSHFMASPQCVGYSRSTAAPLTMT MCLPDLKEIQRAVKLWVRSLDAQSVYVATDSESYVPELQQLFKGKVKVVSLKPEVAQVDL YILGQADHFIGNCVSSFTAFVKRERDLQGRPSSFFGMDRPPKLRDEF >gi568815578f:32209778_32431316|GENSCAN_predicted_CDS_1|1044_bp nggcgctttgggaaccaggccgatcacttcttgggctctctggcatttgcaaagctgcta aaccgtaccttggctgtccctccttggattgagtaccagcatcacaagcctcctttcacc aacctccatgtgtcctaccagaagtacttcaagctggagcccctccaggcttaccatcgg gtcatcagcttggaggatttcatggagaagctggcacccacccactggccccctgagaag cgggtggcatactgctttgaggtggcagcccagcgaagcccagataagaagacgtgcccc atgaaggaaggaaacccctttggcccattctgggatcagtttcatgtgagtttcaacaag tcggagctttttacaggcatttccttcagtgcttcctacagagaacaatggagccagaga ttttctccaaaggaacatccggtgcttgccctgccaggagccccagcccagttccccgtc ctagaggaacacaggccactacagaagtacatggtatggtcagacgaaatggtgaagacg ggagaggcccagattcatgcccaccttgtccggccctatgtgggcattcatctgcgcatt ggctctgactggaagaacgcctgtgccatgctgaaggacgggactgcaggctcgcacttc atggcctctccgcagtgtgtgggctacagccgcagcacagcggcccccctcacgatgact atgtgcctgcctgacctgaaggagatccagagggctgtgaagctctgggtgaggtcgctg gatgcccagtcggtctacgttgctactgattccgagagttatgtgcctgagctccaacag ctcttcaaagggaaggtgaaggtggtgagcctgaagcctgaggtggcccaggtcgacctg tacatcctcggccaagccgaccactttattggcaactgtgtctcctccttcactgccttt gtgaagcgggagcgggacctccaggggaggccgtcttctttcttcggcatggacaggccc cctaagctgcgggacgagttctga >gi568815578f:32209778_32431316|GENSCAN_predicted_peptide_2|130_aa MAGKLGLTIGRRPLFLSMGLLEGLCNIVAVYPQRYLEPNPLAPSNYHSIFYLHEFDYSKY FIQVGDGHVIKDQPIKAWVIEVETVGNPYGEGSGEWLSQGFAAPAAAAAAAAAAAARFRL GPQTSVRCLD >gi568815578f:32209778_32431316|GENSCAN_predicted_CDS_2|393_bp atggctggcaagttggggctgactattggccggaggcctctgttcctctccatggggttg ctggaaggtctttgcaacatcgtggctgtctacccccagagatacctggagccaaaccct ttagcacctagcaactaccactctattttctatctccacgaatttgactactctaagtac ttcatacaggtaggagatgggcatgtgatcaaggatcaaccaattaaggcttgggttata gaggtcgagactgtggggaatccttatggagagggaagcggggaatggctgagccagggg ttcgccgcccccgccgccgccgccgccgccgccgccgccgccgccgcccgctttcggctc gggcctcagacttctgttaggtgcctggattag >gi568815578f:32209778_32431316|GENSCAN_predicted_peptide_3|493_aa MSKLKSSESVRVVVRCRPMNGKEKAASYDKVVDVDVKLGQVSVKNPKGTAHEMPKTFTFD AVYDWNAKQFELYDETFRPLVDSVLQGFNGTIFAYGQTGTGKTYTMEGIRGDPEKRGVIP NSFDHIFTHISRSQNQQYLVRASYLEIYQEEIRDLLSKDQTKRLELKERPDTGVYVKDLS SFVTKSVKEIEHVMNVGNQNRSVGATNMNEHSSRSHAIFVITIECSEVGLDGENHIRVGK LNLVDLAGSERQAKTGAQGERLKEATKINLSLSALGNVISALVDGKSTHIPYRDSKLTRL LQDSLGGNAKTVMVANVGPASYNVEETLTTLRYANRAKNIKNKPRVNEDPKDALLREFQE EIARLKAQLEKRSIGRRKRREKRREGGGSGGGGEEEEEEGEEGEEEGDDKDDYWREQQEK LEIEKRAIVEDHSLVAEEKMRLLKEKEKKMEDLRREKDAAEMLGAKIKVPYPYPSLGPCP VTAFAFIKQQQKT >gi568815578f:32209778_32431316|GENSCAN_predicted_CDS_3|1482_bp atgtcaaagttgaaaagctcagagtcagtcagggtggtggttcgctgtcggcccatgaat ggcaaggaaaaggctgcttcgtatgacaaagtggtggatgtggatgttaagctggggcag gtgtctgtgaagaaccccaaagggacggcccatgaaatgcccaagaccttcacctttgat gccgtctatgactggaatgccaagcagtttgaactgtacgatgagacgttccgaccactt gttgactctgtcctgcaaggtttcaatggaaccatttttgcctatggacaaactgggaca ggaaaaacctacaccatggaaggaatccgtggtgaccctgaaaaaagaggagtcattcct aactcatttgaccatatcttcacccacatctctcgatcccagaatcaacaatacctggtc agggcttcttacttagagatctaccaggaggagatccgagatttgctctcaaaggatcag accaaaaggcttgagctcaaagagaggcctgacacaggagtgtatgtgaaagacctgtct tcctttgtcaccaagagtgtgaaggagatagagcatgtgatgaatgtggggaaccagaac cgttctgtcggtgctaccaacatgaacgagcacagctcgcgttctcatgcaattttcgtt atcactattgagtgcagcgaggtgggcctcgatggtgaaaaccacatccgtgtaggaaaa ttgaaccttgtagatcttgctggcagcgaacggcaagccaagaccggcgcacaaggggag agattaaaagaagctaccaagatcaacctctccctttccgctttgggtaatgtcatctct gctctagtggacggcaaaagcactcacattccatatcgggactcaaagcttaccaggctc ctccaagattcccttggtggcaatgccaagactgtgatggtggccaacgtggggcctgcc tcttacaacgtagaagagactctgaccactctgcgatatgccaaccgtgccaaaaacatt aagaacaaaccaagggtcaatgaggaccccaaggatgccctccttcgagaattccaggaa gagattgctcggctcaaggcccagctggaaaaacggtccattggtaggaggaagaggcga gagaagcggagggaaggtggtggcagtggtgggggtggggaagaggaggaggaggaggga gaagagggtgaggaggaaggggatgataaggatgattactggcgggaacagcaagaaaaa ctggagattgagaagcgggccattgtagaggatcacagcttggttgcagaggagaagatg aggctgctgaaggagaaagagaaaaagatggaggacctgcggcgggagaaggatgctgcc gagatgctgggcgccaagatcaaggtaccatacccgtacccttccttaggcccttgccct gtcactgcttttgctttcatcaaacaacaacaaaaaacataa >gi568815578f:32209778_32431316|GENSCAN_predicted_peptide_4|581_aa MGFHYVGLAGLELLTSGSPVRNYVLRETVHEMDSTLFLTQAMESKLLVGGKNIVDHTNEQ QKILEQKRQEIAEQKRREREIQQQMESRDEETLELKETYSSLQQEVDIKTKKLKKLFSKL QAVKAEIHDLQEEHIKERQELEQTQNELTRELKLKHLIIENFIPLEEKSKIMNRAFFDEE EDHWKLHPITRLENQQMMKRPVSAVGYKRPLSQHARMSMMIRPEARYRAENIVLLELDMP SRTTRDYEGPAIAPKVQAALDAALQDEDEIQVDASSFESTANKKSKARIPQAPSQFSHLA RLHPAPSPNSDPSPFLAEPRATAPARPARRSRVELPPPPGRRMKDKQKKKKERTWAEAAR LVLENYSDAPMTPKQILQVIEAEGLKEMSGTSPLACLNAMLHSNSRGGEGLFYKLPGRIS LFTLKKDALQWSRHPATVEGEEPEDTADVESCGSNEASTVSGENDVSLDETSSNASCSTE SQSRPLSNPRDSYRASSQANKQKKKTGVMLPRVVLTPLKVNGAHVESASGFSGCHADGES GSPSSSSSGSLALGSAAIRGQAEVTQDPAPLLRGFRKPATX >gi568815578f:32209778_32431316|GENSCAN_predicted_CDS_4|1743_bp atggggtttcactatgttggcctggctggtctcgaactcctgacctcaggctcccctgtg aggaattatgttctgagagaaaccgttcatgagatggattctactttgtttctcactcag gccatggagagtaagttgcttgttggaggaaaaaatatagtagatcatacgaatgaacag cagaaaatcctggagcagaaacgacaggaaattgcagagcagaaacgtcgagaaagagaa atccagcaacagatggaaagtcgagatgaggagaccttggaacttaaagagacatacagc tcattgcagcaagaggtggacatcaagaccaaaaaactcaaaaagctcttctccaagctt caggcagtgaaggctgagatccatgacctccaagaagaacacatcaaggagcgccaagag ctagagcagactcagaatgagctcaccagggagctgaaactcaagcatcttattatagaa aactttatccctctggaagaaaaaagtaaaattatgaatagagccttctttgatgaagag gaagatcattggaaactacatcctataaccagactggagaaccagcagatgatgaagcgg ccagtctcagccgtgggatataagagaccattgagccagcacgcaagaatgtccatgatg attcgtccagaggcccgatatagggcagaaaacattgtgctgttagagctggacatgccc agccggaccaccagagactatgagggtccagccattgcccccaaggtccaggctgcattg gatgcggctctgcaggatgaagatgagatacaggtggatgcatcatcatttgaaagcact gcaaataagaaatccaaggccagaatcccgcaggcccccagccagttctcgcacctcgcg agactccacccagccccttccccgaacagcgacccgtccccgttcctagctgagccccgc gccaccgccccagcccgcccagcccggaggtcccgcgtggagctgccgccgccgccgggg agaaggatgaaggacaaacagaagaagaagaaggagcgcacgtgggccgaggccgcgcgc ctggtattagaaaactactcggatgctccaatgacaccaaaacagattctgcaggtcata gaggcagaaggactaaaggaaatgagtgggacttcccctctcgcatgcctcaatgctatg ctacattccaattcaagaggaggagaggggttgttttataaactgcctggccgaatcagc cttttcacgctcaagaaggatgccctgcagtggtctcgccatccagctacagtggaggga gaggagccagaggacacggctgatgtggagagctgtgggtctaatgaagccagcactgtg agtggtgaaaacgatgtatctcttgatgaaacatcttcgaacgcatcctgttctacagaa tctcagagtcgacctctttccaatcccagggacagctacagagcttcctcacaggcgaac aaacaaaagaaaaagactggggtgatgctgcctcgagttgtcctgactcctctgaaggta aacggggcccacgtggaatctgcatcagggttctcgggctgccacgccgatggcgagagc ggcagcccgtccagcagcagcagcggctctctggccctgggcagcgctgctattcgtggc caggccgaggtcacccaggaccctgccccgctcctgagaggcttccggaagccagccaca gnn