GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:22:00 Sequence gi568815585r:100439904_100640248 : 200345 bp : 44.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9349 9402 54 0 0 97 103 22 0.397 3.78 1.02 Term + 16741 16920 180 0 0 27 48 202 0.994 7.71 1.03 PlyA + 20375 20380 6 1.05 2.00 Prom + 22394 22433 40 -5.06 2.01 Init + 23683 23754 72 0 0 33 113 66 0.481 4.67 2.02 Term + 30090 30296 207 2 0 -6 41 230 0.485 6.24 2.03 PlyA + 30727 30732 6 1.05 3.03 PlyA - 31348 31343 6 1.05 3.02 Term - 56753 56574 180 2 0 96 48 124 0.990 6.81 3.01 Init - 59113 59030 84 1 0 50 86 86 0.799 5.42 3.00 Prom - 70132 70093 40 -3.16 4.00 Prom + 70202 70241 40 -5.16 4.01 Init + 72942 72980 39 2 0 22 91 46 0.339 -1.46 4.02 Intr + 75524 75664 141 1 0 60 91 88 0.501 6.85 4.03 Intr + 79773 79820 48 2 0 77 77 40 0.449 0.68 4.04 Intr + 87772 87849 78 0 0 131 113 53 0.690 11.85 4.05 Term + 90195 90263 69 2 0 53 49 69 0.306 -2.56 4.06 PlyA + 90509 90514 6 1.05 5.08 PlyA - 90668 90663 6 1.05 5.07 Term - 92698 92227 472 0 1 139 49 861 0.997 81.70 5.06 Intr - 93871 93660 212 1 2 63 37 100 0.668 0.11 5.05 Intr - 95012 94832 181 1 1 93 47 50 0.596 1.27 5.04 Intr - 100380 100002 379 1 1 1 75 389 0.009 22.92 5.03 Intr - 104144 104029 116 1 2 49 69 104 0.058 4.69 5.02 Intr - 113640 113526 115 1 1 114 74 96 0.054 10.31 5.01 Init - 129613 129535 79 1 1 61 77 77 0.007 3.15 5.00 Prom - 132923 132884 40 -3.76 6.16 PlyA - 133072 133067 6 1.05 6.15 Term - 148784 148508 277 1 1 90 48 102 0.285 1.33 6.14 Intr - 157241 157163 79 0 1 90 66 39 0.007 0.51 6.13 Intr - 166524 166455 70 0 1 95 86 27 0.288 1.95 6.12 Intr - 172607 172495 113 0 2 109 100 56 0.702 9.00 6.11 Intr - 174527 174413 115 2 1 69 92 119 0.956 10.42 6.10 Intr - 184598 184506 93 2 0 -14 86 160 0.973 5.76 6.09 Intr - 185773 185632 142 0 1 75 107 74 0.999 8.36 6.08 Intr - 185989 185882 108 0 0 90 72 75 0.978 5.50 6.07 Intr - 186247 186168 80 1 2 110 86 184 0.976 18.75 6.06 Intr - 195033 194902 132 0 0 63 77 75 0.913 4.74 6.05 Intr - 195292 195121 172 0 1 57 64 156 0.767 10.05 6.04 Intr - 196831 196629 203 1 2 54 116 109 0.985 8.38 6.03 Intr - 197799 197635 165 0 0 74 96 185 0.868 18.06 6.02 Intr - 198119 198027 93 2 0 104 105 91 0.976 12.56 6.01 Intr - 198841 198778 64 0 1 76 88 52 0.726 2.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 66331 66219 113 2 2 66 48 112 0.923 3.72 S.002 Init - 113670 113641 30 0 0 114 97 18 0.833 3.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:100439904_100640248|GENSCAN_predicted_peptide_1|77_aa CLSREAGGNMSIQFLGTVVAEAEREKAAYVSVFFYALIRKIKDFKTFTISSTAIYYELQR GTRSYGRNMKVDKERDR >gi568815585r:100439904_100640248|GENSCAN_predicted_CDS_1|234_bp tgtctttctcgagaagcaggtggaaacatgagcattcagtttcttggtacagtggtagcc gaagctgagagagagaaggcagcatacgtcagcgttttcttctatgcacttataagaaag atcaaagactttaagacttttactatttcttctaccgctatctactacgagcttcaaaga ggaaccaggagttacgggaggaacatgaaagtggacaaggagcgtgaccgttga >gi568815585r:100439904_100640248|GENSCAN_predicted_peptide_2|92_aa MPRDIYDIREISGLESKKKIKSEHVKNKTIPLTDNTVVAEHLGKYFQETSWFLRRFHLSV AHHATKNRVGFLEMGTPGYRGEHVNQLIRQLN >gi568815585r:100439904_100640248|GENSCAN_predicted_CDS_2|279_bp atgccaagagacatctatgatataagggaaatttctggtctggagagcaagaagaagatt aagagtgagcacgtcaagaataagaccatccctctgacagacaacacagtggttgcggag cacctggggaagtatttccaggagacttcatggttcttgcgtcgtttccacctctcagtg gcccatcatgctaccaaaaatagagtgggcttcctcgagatgggcacacctggctatcgg ggtgaacacgtcaatcagctcatccgccagctgaactag >gi568815585r:100439904_100640248|GENSCAN_predicted_peptide_3|87_aa MRVEVLSSTAKFSEPENQINALLKGFDKINGKRKVVTSEWRKGQTPPEPNDQSDAINSGT KRNVNMRLPRGDNEDNAAAPPLRSCQK >gi568815585r:100439904_100640248|GENSCAN_predicted_CDS_3|264_bp atgcgggtggaagtcttgtcttccactgccaagttctcagaaccagagaaccaaatcaat gccttgttaaaaggttttgacaagataaatggaaaaaggaaagtagtaacttcagagtgg agaaagggacaaacaccacctgaaccaaatgatcaaagtgatgccattaacagtgggaca aagagaaacgtgaacatgcgcctcccgagaggggacaacgaggacaacgcagcagcacca ccgttacgttcctgccaaaaatga >gi568815585r:100439904_100640248|GENSCAN_predicted_peptide_4|124_aa MPTVKPSLYPTVQYKVNILTRLAAELNKFMLEKVTEDTSSVLRSPMPGVVVAVSVKPGDA PHVTELIADHTELANLVAEGQEICVIEAMKMQNSMTAGKTGTVKSVHCQAGDTVGEGDLL VELE >gi568815585r:100439904_100640248|GENSCAN_predicted_CDS_4|375_bp atgcccacggtgaaaccgagcttgtaccccacggtccagtacaaggtgaatatcttaacc agacttgccgcagaattgaacaaatttatgctggaaaaagtgactgaggacacaagcagt gttctgcgttccccgatgcccggagtggtggtggccgtctctgtcaagcctggagacgcg cctcatgtaactgagcttatagccgaccatacagaactggcaaacttggtagcagaaggt caagaaatttgtgtgattgaagccatgaaaatgcagaatagtatgacagctgggaaaact ggcacggtgaaatctgtgcactgtcaagctggagacacagttggagaaggggatctgctc gtggagctggaatga >gi568815585r:100439904_100640248|GENSCAN_predicted_peptide_5|517_aa MQPRDMVPPCLPVAAAPAVAKRVQCKGKLVLQCVPLSGRSQVVLLMVNAIVLLPLFILVA IEYPRMAAFAQGWHSVGLLVVTEGLEQQLVSLKPVYSRDHHALQKPTLSPVRASKMTKKR RNNGRAKKGRGHVQPIRCTNCARCVPKDNAIKKFVIRNIVEAAAVRDISEASVFDAYVLP KLYVKLHYCVSCAIHSKVVRNRSREARKDRTPPPRFRPAGAAPRPPPKPMSGAGRGRSHL DEGREGAGDEVGELSLQQEIDKDPLGGALGAGSRGTKTTGMQAGIGGGEQSVAQTQASGK YLPSRWSQDRCGESREAPLPSKTLEGETPEGVGAELRCVRLPGPEWNSVLKPGFSLQLDS SSARMALVFVYGTLKRGQPNHRVLRDGAHGSAAFRARGRTLEPYPLVIAGEHNIPWLLHL PGSGRLVEGEVYAVDERMLRFLDDFESCPALYQRTVLRVQLLEDRAPGAEEPPAPTAVQC FVYSRATFPPEWAQLPHHDSYDSEGPHGLRYNPRENR >gi568815585r:100439904_100640248|GENSCAN_predicted_CDS_5|1554_bp atgcagcctcgggacatggtgcccccctgcctcccagttgctgcagctccagctgtggct aaaagggtccagtgtaaaggtaaacttgtcctgcagtgtgtgcccttgtctggccgctct caggtggtgctgctcatggtgaatgccatcgtgttgctgcccctattcattcttgttgcc atagagtatcccagaatggctgcgtttgctcagggctggcacagtgtggggctgctggtg gtcacagaaggactagagcagcagttggtgtccttgaaaccagtgtacagccgtgatcac catgcccttcaaaagcccactctctctccggtccgtgcctccaagatgacaaagaaaaga aggaacaatggtcgtgccaaaaagggccgcggccacgtgcagcctattcgctgcactaac tgtgcccgatgcgtgcccaaggacaacgccattaagaaattcgtcattcgaaacatagtg gaggccgcagcagtcagggacatttctgaagcgagcgtcttcgatgcctatgtgcttccc aagctgtatgtgaagctacattactgtgtgagttgtgcaattcacagcaaagtagtcagg aatcgatctcgtgaagcccgcaaggaccgaacacccccacctcgatttagacctgcgggt gctgccccacgtcccccaccaaagcccatgtcaggcgctggtcggggcaggagccacctg gatgaggggagagagggtgcaggagatgaggtgggagagttaagtctgcagcaggagatt gacaaggaccctctgggtggtgcactgggggcaggatccaggggaacaaagacaacaggg atgcaggcaggcattggaggaggagagcagtctgtggcccagacccaggcaagtggtaaa tatttaccttcacgctggagccaagatcgctgcggggagtcccgtgaagcaccactgccc tctaagaccttggaaggggaaacaccagaaggtgtgggtgctgagctccgctgcgtcaga ctgccaggacctgagtggaactcagtgctgaaacctgggttctcactgcagctggatagc agctctgcccggatggccctagtcttcgtgtacggcaccctgaagcggggtcagcccaac cacagggtcctgcgggacggcgcccacggctccgcagcctttcgggcgcgcggccgcacg ctggagccctacccgttggtgatcgcgggggagcacaacatcccgtggctgctgcacctg cccggctcggggcgcctcgtggagggcgaggtctacgcggtagacgagcggatgctgcgc tttctggatgacttcgagagttgcccggccctgtaccagcgcacggtgctgcgggtacag ctgctggaggaccgggccccgggcgcagaggagccgccagcgcccaccgcggtgcagtgc ttcgtgtacagcagggccaccttcccgccggagtgggcccagctcccgcaccatgacagc tacgactccgaggggccgcacgggctgcgctacaacccccgggagaacagataa >gi568815585r:100439904_100640248|GENSCAN_predicted_peptide_6|635_aa XQLEGETKWECVPKFYNQTSKMGLNAVFDILVIGKFNVLEIVQKVLHKDKSLENLGMLRN GGLLFRMTLLTSGGAGMLYVRWRIMGTGPPAFTEVDNPASFADSMLVRAVNYNYYYSLNA WLLLCPWWLCFDWSMGCIPLIKSISDWRVIALAALWFCLIGLICQALCSEDGHKRRILTL GLGFLVIPFLPASNLFFRVGFVVAERVLYLPSVGYCVLLTFGFGALSKHTKKKKLIAAVV LGILFINTLRCVLRSGEWRSEEQLFRSALSVCPLNAKVHYNIGKNLADKGNQTAAIRYYR EAVRLNPKYVHAMNNLGNILKERNELQEAEELLSLAVQIQPDFAAAWMNLGIVQNSLKRF EAAEQSYRTAIKHRRKYPDCYYNLGRLLLVDGFQGYFMELERLAAQALRAGGPNGMATYA DLNRHVDALNAWRNATVLKPEHSLAWNNMIILLDNTGNLAQAEAVGREALELIPNDHSLM FSLANVLGKSQKYKESEALFLKAIKANPNAASYHGNLDYSWEIKTSACRVRVLTKALPKT TSSRRGVLAPAGTWGVAPCAASGSRPQHSASPRVREEAWGAAGRRGQPEGGDGGPRNASS IAPRGGHGRDCQNGQRAVKARNRLTHGRCFPVKEA >gi568815585r:100439904_100640248|GENSCAN_predicted_CDS_6|1908_bp nnccagctggaaggggaaaccaagtgggaatgtgtgcccaagttttacaaccaaacatcc aaaatgggtttaaatgcggtatttgacatcttggtgataggcaaattcaatgttctggaa attgtccagaaggtactacataaggacaagtcattagagaatctcggcatgctcaggaac gggggcctcctcttcagaatgaccctgctcacctctggaggggctgggatgctctacgtg cgctggaggatcatgggcacgggcccgccggccttcaccgaggtggacaacccggcctcc tttgctgacagcatgctggtgagggccgtaaactacaattactactattcattgaatgcc tggctgctgctgtgtccctggtggctgtgttttgattggtcaatgggctgcatccccctc attaagtccatcagcgactggagggtaattgcacttgcagcactctggttctgcctaatt ggcctgatatgccaagccctgtgctctgaagacggccacaagagaaggatccttactctg ggcctgggatttctcgttatcccatttctccccgcgagtaacctgttcttccgagtgggc ttcgtggtcgcagagcgtgtcctctacctccccagcgttgggtactgtgtgctgctgact tttggattcggagccctgagcaaacataccaagaaaaagaaactcattgccgctgtcgtg ctgggaatcttattcatcaacacgctgagatgtgtgctgcgcagcggcgagtggcggagt gaggaacagcttttcagaagtgctctgtctgtgtgtcccctcaatgctaaggttcactac aacattggcaaaaacctggctgataaaggcaaccagacagctgccatcagatactaccgg gaagctgtaagattaaatcccaagtatgttcatgccatgaataatcttggaaatatctta aaagaaaggaatgagctacaggaagctgaggagctgctgtctttggctgttcaaatacag ccagactttgccgctgcgtggatgaatctaggcatagtgcagaatagcctgaaacggttt gaagcagcagagcaaagttaccggacagcaattaaacacagaaggaaatacccagactgt tactacaacctcgggcgtctgctgcttgtcgatggctttcaaggttacttcatggagctg gagcgcttggctgcgcaggccctgagggctggcggtcccaatggcatggccacgtatgca gatctcaatcgccacgtggatgccttgaatgcgtggagaaatgccaccgtgctgaaacca gagcacagcctggcctggaacaacatgattatactcctcgacaatacaggtaatttagcc caagctgaagcagttggaagagaggcactggaattaatacctaatgatcactctctcatg ttctcgttggcaaacgtgctggggaaatcccagaaatacaaggaatctgaagctttattc ctcaaggcaattaaagcaaatccaaatgctgcaagttaccatggtaatttggattattct tgggaaattaaaacgtcagcatgtcgagtcagagtcttaacaaaggctctgccaaagacg acatccagcaggcggggtgtcctggcgcccgcgggaacctggggcgtcgctccctgcgcg gcgtccgggtcgaggccgcagcactcggcgtccccgcgggtgagggaggaggcctggggc gccgccggccggcgaggccagccggagggaggggacggtggaccccgaaacgcgtcctcc attgccccgcgcgggggccacgggcgcgattgccagaacgggcagagggcggtaaaagcc aggaaccgcctcacgcacgggcgctgcttccctgtaaaggaagcgtga