GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:00:36 Sequence gi568815594f:68276323_68476501 : 200179 bp : 37.66% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2269 2409 141 0 0 9 33 165 0.510 2.73 1.02 Intr + 6494 6553 60 1 0 62 79 93 0.087 3.81 1.03 Intr + 24349 24639 291 0 0 -15 61 214 0.626 4.81 1.04 Intr + 25988 26137 150 1 0 119 92 84 0.983 11.34 1.05 Intr + 28365 28529 165 2 0 -12 105 124 0.719 3.34 1.06 Term + 33111 33122 12 2 0 106 45 7 0.289 -4.57 1.07 PlyA + 35072 35077 6 1.05 2.18 PlyA - 35322 35317 6 1.05 2.17 Term - 38001 37777 225 0 0 119 39 253 0.982 19.30 2.16 Intr - 40126 39992 135 1 0 75 86 82 0.765 6.54 2.15 Intr - 42540 42504 37 2 1 47 111 43 0.759 -0.15 2.14 Intr - 46593 46427 167 0 2 95 72 151 0.991 12.04 2.13 Intr - 53198 53068 131 0 2 78 59 34 0.543 -1.01 2.12 Intr - 53988 53880 109 0 1 56 116 32 0.635 1.74 2.11 Intr - 55875 55781 95 1 2 119 62 53 0.977 4.56 2.10 Intr - 56525 56472 54 0 0 36 64 98 0.538 0.23 2.09 Intr - 57075 56986 90 1 0 80 66 59 0.793 1.95 2.08 Intr - 60218 60126 93 0 0 54 110 43 0.807 2.12 2.07 Intr - 61128 60705 424 0 1 73 79 887 0.986 78.81 2.06 Intr - 61578 61250 329 1 2 59 52 423 0.979 30.29 2.05 Intr - 62062 61961 102 2 0 80 75 142 0.992 11.33 2.04 Intr - 73535 73404 132 0 0 27 96 215 0.989 15.90 2.03 Intr - 96210 96133 78 1 0 85 58 50 0.001 0.30 2.02 Intr - 100146 99931 216 1 0 75 -6 203 0.012 7.15 2.01 Init - 112227 112206 22 0 1 100 101 29 0.442 5.30 2.00 Prom - 114777 114738 40 -5.05 3.00 Prom + 119563 119602 40 -3.15 3.01 Init + 124491 124585 95 2 2 56 64 64 0.550 0.70 3.02 Term + 130143 130356 214 0 1 76 38 243 0.961 13.82 3.03 PlyA + 131556 131561 6 1.05 4.00 Prom + 134915 134954 40 -4.95 4.01 Init + 144997 145289 293 0 2 39 36 286 0.573 15.17 4.02 Intr + 147738 147835 98 0 2 46 40 107 0.041 0.43 4.03 Term + 148134 148708 575 1 2 21 42 257 0.548 8.03 4.04 PlyA + 148769 148774 6 1.05 5.00 Prom + 148886 148925 40 -5.65 5.01 Init + 148940 149355 416 1 2 71 38 171 0.451 6.39 5.02 Intr + 159807 159901 95 0 2 105 95 -23 0.458 -1.21 5.03 Intr + 171108 171201 94 1 1 67 109 43 0.010 2.40 5.04 Intr + 185499 185623 125 0 2 106 92 44 0.971 5.91 5.05 Intr + 190309 190430 122 2 2 115 103 40 0.997 7.49 5.06 Intr + 192557 192624 68 1 2 70 91 25 0.898 -2.22 5.07 Intr + 195138 195301 164 0 2 88 106 40 0.777 4.50 5.08 Intr + 198401 198439 39 0 0 108 87 18 0.614 0.98 5.09 Intr + 199939 200116 178 2 1 129 111 90 0.779 13.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:68276323_68476501|GENSCAN_predicted_peptide_1|272_aa LLVAEEASGNLQSRWKVKKKQAPSSQGSKREKRRGGELPNTLKPSDLISEEEAPKPTGEV SEGYVKKNILWEYSGYTEKFFKDVVLNKKLMTNLQESRSDVVHANAIGPFGELLAELLKI SFVYSLHFSPGYTFEKYSGGFLLPPSYGAVILSELSGSMTFMETENALQLSEIMGKAEMW LIRNYWYLEFPRPLLPNFEFVVRLYCKPVNPLPKFGSTFPDTNFYVAHNSVGTVRCLLQA GSLTLPHTSIDPRELSTVKAHQPSWDFEELLP >gi568815594f:68276323_68476501|GENSCAN_predicted_CDS_1|819_bp ttactcgtggctgaggaagcctcaggaaacttacaatcccggtggaaggtgaagaagaag caagcaccttcttcacaaggcagcaagagagagaaaaggagagggggtgaactgccaaac actttaaaaccatcagatcttatctcagaggaagaagcacctaagcccactggggaggta agtgaaggctacgtgaaaaagaatattctttgggaatattctggttatactgagaagttc tttaaagatgtagttttgaacaagaaacttatgacaaacctacaagaatcaaggtctgat gtcgttcatgcaaatgccattggtccctttggagagctgctggctgagctattaaaaata tcctttgtgtacagtctccacttctctcctggctacacatttgagaaatacagtggagga tttctacttccaccttcctatggagctgttattctgtcagaattaagtggttcgatgaca ttcatggagacagaaaatgcactacaattatctgagataatgggaaaagctgaaatgtgg ctcattcgaaactactggtatttggaatttcctcgcccactcttacctaattttgaattt gttgtaagactctactgcaaacctgtcaaccccctgcctaagtttgggagtacgtttcct gacacaaatttctatgtggctcacaatagcgtcggaactgtgagatgcctacttcaagct ggctccttgactcttccacacacttcgattgaccctcgggaactgagtacagtgaaagct catcaaccttcatgggattttgaggagcttttaccttga >gi568815594f:68276323_68476501|GENSCAN_predicted_peptide_2|812_aa MERQCLKYAALGTLGTAHRTAGAAAFLAGGAFALFALAGAGAGAGVTGGTGAVGIHGELK RRLESGQVVHGLAGTWRRRGCHYTHFRPVSLKVSRRRFDMQTKVRYGLTDGPVLLPRLRR PRESRLSGGGGGSGGNRRGAMAADSREEKDGELNVLDDILTEVPEQDDELYNPESEQDKN EKKGSKRKSDRMESTDTKRQKPSVHSRQLVSKPLSSSVSNNKRIVSTKGKSATEYKNEEY QRSERNKRLDADRKIRLSSSASREPYKNQPEKTCVRKRDPERRAKSPTPDGSERIGLEVD RRASRSSQSSKEEVNSEEYGSDHETGSSGSSDEQGNNTENEEEGVEEDVEEDEEVEEDAE EDEEVDEDGEEEEEEEEEEEEEEEEEEEEYEQDERDQKEEGNDYDTRSEASDSGSESVSF TDGSVRSGSGTDGSVDWLRQGRQEGWGVRYKMEMKTINKILVEKSDEKKKERKRARGISP IVFDRSGSSASESYAGSEKKHEKLSSSVRAVRKDQTSKLKYVLQDARFFLIKSNNHENVS LAKAKGVWSTLPVNEKKLNLAFRSARSVILIFSVRESGKFQGPYCTLKYCTQEAHYLSDL YVAYTNVLPVKLILVKKCLKGDGYFEIELECGTQLCLLFPPDESIDLYQVIHKMRHKRRM HSQPRSRGRPSRREPVRDVGRVFKGSTIPGSGQPPYPGMEQPPHHPYYQHHAPPPQAHPP YSGHHPVPHEARYRDKRVHDYDMRVDDFLRRTQAVVSGRRSRPRERDRERERDRPRDNRR DRERDRGRDRERERERLCDRDRDRGERGRYRR >gi568815594f:68276323_68476501|GENSCAN_predicted_CDS_2|2439_bp atggagcgacaatgtctgaaatatgcagccctgggcacacttggcacagcccacaggaca gcaggagcagcagcttttcttgcaggaggtgcatttgcactctttgcacttgcaggagcc ggcgcaggtgcaggagtcactggcggcacaggagcagttgggatccatggcgagctgaag aggcggctggagtcgggacaggttgtacacgggctcgctgggacttggaggaggcgtgga tgccactacacccacttcagacccgtgagcctcaaagtgtctaggaggcggtttgatatg cagactaaagtgagatacggactgacggacgggcccgtgcttctgccgcggctgcggcgc ccgcgcgagtcgcgtctaagcggcggcggcggtggcagcggcggaaaccgaaggggagcc atggcggctgacagtcgggaggagaaagatggagaacttaatgttctggatgatatttta actgaagtaccagaacaagatgatgaactgtataatccagagagtgaacaagataaaaat gagaaaaagggatcaaaaagaaaaagtgatcgaatggaatctactgataccaaacgacaa aagccttctgtccattctagacaactggtttctaagccactgagctcatctgttagcaat aacaaaagaatagttagtacaaaaggaaagtcagccacagagtataaaaatgaggaatat caaagatctgaaagaaacaagcgtctagatgctgatcggaaaattcgtctatcaagtagt gcctccagagaaccttataagaatcaacctgaaaaaacctgtgtccggaaaagggatcct gaaaggagggccaaatctcctacgccagatggttctgagagaattgggcttgaagtggat agacgtgcaagcagatccagccagtcttctaaggaagaagtgaactctgaagaatatggc tctgaccatgagactggcagcagtggttcttctgatgagcaagggaacaacactgagaat gaggaggaaggagtggaagaagatgtggaggaagatgaagaagtagaagaagatgcagaa gaagatgaagaggtggatgaagatggagaggaggaggaggaagaggaggaggaggaagag gaggaggaggaggaggaagaagaagaatatgaacaggatgagagagaccagaaagaggag ggaaatgattatgacactcgaagtgaggccagtgactctggttctgaatctgtttccttc acagatgggtctgtcagatctggttcaggcacagatggatcagttgactggctgagacag ggtcgacaagagggctggggtgtaaggtataagatggagatgaagactataaataaaatt cttgtggaaaaatcagatgagaaaaagaaggaaaggaagagagctagaggcatatctcca attgtttttgatagaagtggaagctctgcatcagagtcatatgcaggttcagaaaagaag catgagaaattatcatcttccgttcgtgctgtccgaaaagatcaaaccagtaaactcaaa tatgtgcttcaagatgcaagatttttcctcataaagagtaacaaccatgagaatgtgtct cttgccaaagcgaagggtgtatggtccacgctccctgtaaatgagaagaaattaaatctt gcatttagatctgcaaggagtgttatcttaatattttctgtcagagagagtggaaaattt caaggtccttactgcacactgaagtattgtacacaagaagctcattatttaagtgatctt tacgtggcatacacaaatgtattacctgtaaaattaattttagttaaaaagtgtctgaaa ggagatggttactttgaaattgaacttgaatgtggaacccagctttgtcttctgtttccc cccgatgaaagtattgacttgtatcaggtcattcataaaatgcgtcacaagagaagaatg cattctcagccccgatcacgaggacgtccatcccgtcgagaaccagtccgggatgtggga agggtatttaaaggatccacgataccaggaagtggacagcccccttacccaggaatggaa caacctccacaccatccttactatcagcaccatgctccacctcctcaagctcatccccct tactcaggacatcatccagtaccacatgaagcaagatacagagataaacgagtacatgat tatgatatgagggtggatgatttccttcgtcgcacacaagctgttgtcagtggccggaga agtagaccccgtgaaagagaccgggaacgagagcgagaccgccctagagataacagacga gacagagagcgagatagaggacgtgatagagaaagagaaagagagcgattatgtgatcga gacagagaccgaggggagagaggtcgatatagaagataa >gi568815594f:68276323_68476501|GENSCAN_predicted_peptide_3|102_aa MPFPEVTEHPSPAKVSSLLQEMTLPGSYAKSILFLIGFDQLYPTRSNPKEVPNYGTRPLK WLNSQKNDDNNNSNNNYKNGGVVAEKYTSKRKKKRKGFDFDY >gi568815594f:68276323_68476501|GENSCAN_predicted_CDS_3|309_bp atgcctttccctgaagtgacagaacatccttcacctgccaaggtcagcagtttgctacag gagatgacacttccaggatcatatgccaagtctattctttttctcattgggtttgatcaa ctctatccaactagatcaaatccaaaggaagttccaaattatggaacaaggcctctgaag tggctaaattcccagaaaaacgacgacaacaacaacagcaacaacaactacaaaaacggt ggtgtggtagcagagaaatataccagcaaaaggaaaaaaaagaggaaaggctttgatttt gactactaa >gi568815594f:68276323_68476501|GENSCAN_predicted_peptide_4|321_aa MENDFDELKEDGFRRLVITNFSKLQEDVQTHCKEAKNLEKRLDKWLTRINSVEKSLKDLM ELKTMAREVCDACTSFSSRFDQLEEGVSVIEDQMNEMQTKDNNHMIISIDAEKAFDKIQQ PFMLKTLNKLAISNFSKFSGYKFNVQKSQGFLYTNNRQTERQVMSELPFTIASKTIKYQG IQLTRDVKDLFKENYKPLFNEIKEDTNKWKKIPCSWIGRINILKMAILPKVIYRFNAIPI KLPMTFFTELEKPTLKFMWNQKRAHIAKSVLSQKNKAGGIILPDFKLYYKATVTKTAWYW YQNRDIDQWNRIEPPEIIPHI >gi568815594f:68276323_68476501|GENSCAN_predicted_CDS_4|966_bp atggagaatgactttgacgagttgaaagaagacggcttcagacgattggtaataacaaac ttctccaagctacaggaggatgttcaaacccattgcaaagaagctaaaaaccttgaaaaa agactagacaaatggctaactagaataaacagtgtagagaagtccttaaaggacctgatg gagctgaaaaccatggcacgagaagtatgtgacgcatgcacaagcttcagtagccgattc gatcaactggaagaaggggtatcagtgattgaagatcaaatgaatgaaatgcaaaccaaa gacaacaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaacag cccttcatgctaaaaactctcaataaactagcgataagcaacttcagcaaattctcgggt tacaaattcaatgtgcaaaaatcacaaggattcctatacaccaataatagacaaacagag aggcaagtcatgagtgaactcccattcacaattgcttcaaagacaataaaataccaagga atccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgttcaac gaaataaaagaggacacaaacaaatggaagaaaattccatgctcatggataggaagaatc aatattttgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccatc aagctaccaatgactttcttcacagaattggaaaaacctactttaaagttcatgtggaac caaaaaagagcccacattgccaagtcagtcctaagccaaaagaacaaagctggaggcatc atactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccaaaacagagatatagaccaatggaacagaatagagcccccggaaataataccacac atctag >gi568815594f:68276323_68476501|GENSCAN_predicted_peptide_5|434_aa MGNDFMIKAPKARATKAKIDKWDLIKLKGFCTAKETTIRVNRQPTEWEEIFAIYPSDKGL ISRIYKELKQIYKKKNNSIKKWAKDMNRQFSKEDIYAANGHMKKCSSSLAIREMEIKTTM RYHLTPVRTAIIKKSRNKRPPQLVCPILGLILTFQSWPGLAQYSARGSWTSHKVIDRAGF THLDVDLDLHRTLHCWLAMMYRPDVVRARKRVCWEPWVIGLVIFISLIVLAVCIGLTVHY VRYNQKKTYNYYSTLSFTTDKLYAEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVK SQVIKFSQQKHGVLAHMLLICRFHSTEDPETVDKIVQLVLHEKLQDAVGPPKVDPHSVKI KKINKTETDSYLNHCCGTRRSKTLGQSLRIVGGTEVEEGEWPWQASLQWDGSHRCGATLI NATWLVSAAHCFTT >gi568815594f:68276323_68476501|GENSCAN_predicted_CDS_5|1302_bp atgggcaatgacttcatgattaaagcaccaaaagcaagggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagggcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggaggaaatttttgcaatctacccatctgacaaagggcta atatccagaatctacaaagaacttaaacaaatttacaagaaaaaaaacaactccatcaaa aaatgggcaaaggatatgaacagacaattctcaaaagaagacatttatgcagccaacgga cacatgaaaaagtgctcatcgtcactggccatcagagaaatggaaatcaaaaccacaatg agataccatctcacaccagttagaacagcgatcattaaaaagtcaagaaacaagagacct cctcagctcgtatgccctatcttaggattaatactcactttccagtcctggccaggcctg gcccagtactctgcaagaggatcttggacctcccacaaagtcattgatagagctggattc acacacttggatgtagacctcgaccttcacaggactcttcattgctggttggcaatgatg tatcggccagatgtggtgagggctaggaaaagagtttgttgggaaccctgggttatcggc ctcgtcatcttcatatccctgattgtcctggcagtgtgcattggactcactgttcattat gtgagatataatcaaaagaagacctacaattactatagcacattgtcatttacaactgac aaactatatgctgagtttggcagagaggcttctaacaattttacagaaatgagccagaga cttgaatcaatggtgaaaaatgcattttataaatctccattaagggaagaatttgtcaag tctcaggttatcaagttcagtcaacagaagcatggagtgttggctcatatgctgttgatt tgtagatttcactctactgaggatcctgaaactgtagataaaattgttcaacttgtttta catgaaaagctgcaagatgctgtaggaccccctaaagtagatcctcactcagttaaaatt aaaaaaatcaacaagacagaaacagacagctatctaaaccattgctgcggaacacgaaga agtaaaactctaggtcagagtctcaggatcgttggtgggacagaagtagaagagggtgaa tggccctggcaggctagcctgcagtgggatgggagtcatcgctgtggagcaaccttaatt aatgccacatggcttgtgagtgctgctcactgttttacaacn