GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:27:20 Sequence gi568815575r:120526213_120727166 : 200954 bp : 40.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.24 PlyA - 434 429 6 -1.75 1.23 Term - 644 549 96 2 0 77 39 128 0.943 3.79 1.22 Intr - 4042 3890 153 1 0 33 100 117 0.953 6.75 1.21 Intr - 6382 6210 173 2 2 97 20 131 0.992 5.94 1.20 Intr - 8374 8269 106 1 1 101 58 69 0.996 4.07 1.19 Intr - 9731 9618 114 2 0 77 107 15 0.801 2.02 1.18 Intr - 10822 10715 108 1 0 62 94 62 0.927 3.76 1.17 Intr - 11263 11202 62 2 2 118 60 48 0.421 2.53 1.16 Intr - 12558 12448 111 1 0 66 78 103 0.989 6.53 1.15 Intr - 13160 13056 105 0 0 82 115 87 0.998 10.07 1.14 Intr - 14350 14158 193 1 1 53 77 166 0.998 10.14 1.13 Intr - 15508 15390 119 2 2 48 62 105 0.993 3.16 1.12 Intr - 16821 16754 68 2 2 124 86 32 0.993 4.23 1.11 Intr - 17597 17515 83 2 2 44 100 81 0.998 2.42 1.10 Intr - 17991 17902 90 0 0 118 53 107 0.999 9.47 1.09 Intr - 18431 18269 163 1 1 70 108 103 0.336 9.56 1.08 Intr - 28741 28665 77 1 2 70 49 41 0.080 -4.21 1.07 Intr - 31827 31712 116 1 2 58 91 128 0.966 9.35 1.06 Intr - 34413 33871 543 1 0 110 86 412 0.992 35.14 1.05 Intr - 34510 34470 41 0 2 130 61 47 0.821 3.15 1.04 Intr - 34786 34626 161 1 2 104 -2 134 0.585 3.86 1.03 Intr - 40608 40479 130 2 1 86 41 55 0.279 0.38 1.02 Intr - 42349 42253 97 2 1 16 105 31 0.066 -4.25 1.01 Init - 48465 48339 127 0 1 53 89 115 0.674 8.47 1.00 Prom - 48520 48481 40 -8.95 2.00 Prom + 49004 49043 40 -9.45 2.01 Init + 49676 49731 56 1 2 70 81 8 0.153 -0.69 2.02 Term + 55670 55994 325 2 1 16 52 910 0.981 73.45 2.03 PlyA + 56136 56141 6 1.05 3.04 PlyA - 59547 59542 6 1.05 3.03 Term - 77934 77798 137 1 2 -44 48 265 0.041 6.50 3.02 Intr - 88500 88429 72 1 0 81 100 50 0.034 3.96 3.01 Init - 91447 91384 64 1 1 53 37 43 0.035 -2.94 3.00 Prom - 91943 91904 40 -5.05 4.05 PlyA - 92491 92486 6 1.05 4.04 Term - 100959 99998 962 1 2 98 54 435 0.065 32.33 4.03 Intr - 103588 103506 83 0 2 44 15 33 0.030 -10.04 4.02 Intr - 103834 103705 130 2 1 71 100 128 0.175 11.13 4.01 Init - 110518 110449 70 1 1 90 74 18 0.067 1.89 4.00 Prom - 121296 121257 40 -3.45 5.02 PlyA - 121784 121779 6 1.05 5.01 Sngl - 130028 129609 420 2 0 36 48 214 0.924 8.25 5.00 Prom - 133772 133733 40 -0.65 6.03 PlyA - 134102 134097 6 1.05 6.02 Term - 134574 134466 109 2 1 88 47 64 0.156 -0.70 6.01 Init - 149708 149509 200 2 2 72 9 206 0.529 9.52 6.00 Prom - 155185 155146 40 -5.95 7.07 PlyA - 155437 155432 6 1.05 7.06 Term - 157979 157631 349 1 1 73 43 123 0.076 -0.73 7.05 Intr - 170285 170148 138 1 0 25 93 85 0.230 1.36 7.04 Intr - 170583 170393 191 0 2 70 76 143 0.443 8.76 7.03 Intr - 170816 170759 58 1 1 90 61 32 0.381 -1.33 7.02 Intr - 178913 178730 184 0 1 58 91 138 0.011 9.02 7.01 Init - 189889 189583 307 1 1 85 37 98 0.031 1.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 173037 173645 609 2 0 88 54 224 0.960 14.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:120526213_120727166|GENSCAN_predicted_peptide_1|1011_aa MGEDCWKSEFSVFYRRPKGRMMSQSSGSGDGNDDEATTSKDGGIGERNNAVGGNKGHLVI EAGKIHDLVIETSLRWFTGMTDKISELRNEVLEWLMYQVSNFRRCKINNFAKNMKFGWGE GEKEEELQPHRLAESLLQKGGERDRKAGEGDGTAVYLSASSFALPPRSAHSCIATFSPTR TQAYKRFSSPSPSAAAAAQEVRSATDGNTSTTPPTSAKKRKLNSSSSSSSNSSNEREDFD STSSSSSTPPLQPRDSASPSTSSFCLGVSVAASSHVPIQKKLRFEDTLEFVGFDAKMAEE SSSSSSSSSPTAATSQQQQLKNKSILISSVASVHHANGLAKSSTTVSSFANSKPGSAKKL VIKNFKDKPKLPENYTDETWQKLKEAVEAIQNSTSIKYNLEELYQKDYFEMFKFKQEDKG GKAISYNLDFRDMGLELFRAHIISDQKVQNKTIDGILLLIERERNGEAIDRSLLRSLLSM LSDLQIYQDSFEQRFLEETNRLYAAEGQKLMQEREVPEYLHHVNKRLEEEADRLITYLDQ TTQKSLIATVEKQLLGEHLTAILQKGLNNLLDENRIQDLSLLYQLFSRVRGGVQVLLQQW IEYIKAFGSTIVINPEKDKTMVQELLDFKDKVDHIIDICFLKNEKFINAMKEAFETFINK RPNKPAELIAKYVDSKLRAGNKEATDEELEKMLDKIMIIFRFIYGKDVFEAFYKKDLAKR LLVGKSASVDAEKSMLSKLKHGHSLQTGSGASEKLETWAGWGYMQNQNVPGNIELTVNIL TMGYWPTYVPMEVHLPPEMVKLQEIFKTFYLGKHSGRKLQWQSTLGHCVLKAEFKEGKKE LQVSLFQTLVLLMFNEGEEFSLEEIKQATGIEDGELRRTLQSLACGKARVLAKNPKGKDI EDGDKFICNDDFKHKLFRIKINQIQMKETVEEQASTTERVFQDRQYQIDAAIVRIMKMRK TLSHNLLVSEVYNQLKFPVKPADLKKRIESLIDRDYMERDKENPNQYNYIA >gi568815575r:120526213_120727166|GENSCAN_predicted_CDS_1|3036_bp atgggagaggactgctggaaaagtgaattctctgttttctatcgtagacccaaaggacgg atgatgtcacagtcatctggatcaggagatgggaatgatgatgaggctactacctctaaa gacggtggaattggagagagaaataatgctgttggtgggaataaaggacatttagtaata gaagcaggaaaaatacatgacttggtaatagaaacttccctgagatggttcacaggtatg acagataaaatctctgagctaaggaacgaagtgcttgaatggttgatgtaccaagtcagt aacttcaggcgatgtaagataaacaattttgctaaaaatatgaaatttggctggggagag ggagaaaaggaggaggagctgcagcctcacagactcgctgagtcgctcctgcagaaaggg ggggagagagatcgaaaagcaggggagggggacggcacggccgtttacctgtctgcctcc tcattcgctctcccccctcgttctgctcactcctgcattgctaccttctctcctacacgc acgcaggcatataaacgtttttcttcccccagtccctcagctgctgctgctgctcaggag gtcagatctgccactgatggtaataccagcaccactccgcccacctctgccaagaagaga aagttaaacagcagcagcagtagcagcagtaacagtagtaacgagagagaagactttgat tccacctcttcctcctcttccactcctcctttacaacccagggattcggcatccccttca acctcgtccttctgcctgggggtttcagtggctgcttccagccacgtaccgatacagaag aagctgcgttttgaagacaccctggagtttgtagggtttgatgcgaagatggctgaggaa tcctcctcctcctcctcctcatcttcaccaactgctgcaacatctcagcagcagcaactt aaaaataagagtatattaatctcttctgtggcttcggtgcatcatgcaaacggcctagcc aaatcttctaccaccgtctctagctttgctaacagcaaacctggctctgctaagaagtta gtgatcaagaactttaaagataagcctaaattaccagaaaactacacagatgaaacctgg caaaaactgaaagaagcagtggaagctattcagaatagtacttcaattaagtacaattta gaagaactctaccagaaagattattttgagatgttcaagttcaaacaggaggataaagga ggtaaagctatttcatataatctggacttcagggacatgggactggagttatttagggct catattataagtgatcagaaagtgcagaataagacaattgatggcattcttctcttgatt gagagggaaaggaatggtgaagcaattgatagaagtttacttcgaagccttttaagcatg ctgtctgatttgcaaatttatcaagattcttttgaacaacgatttttggaagaaactaac cggctctatgcagctgaaggccaaaaattaatgcaagaaagagaggttcctgaatatcta catcatgttaacaaacgtctagaagaagaagcagacagacttattacttacttagatcag accacccagaagtcattaattgctactgtagaaaaacaacttctaggtgaacacttaaca gcaattcttcagaaaggtttaaataacctccttgatgaaaaccgaattcaagatttgtct cttctgtatcagctcttcagtagagttcgaggtggagttcaggttcttttgcagcagtgg atcgaatatatcaaggcatttggcagcactattgtaattaatcctgaaaaagataaaacc atggttcaagaattgctggattttaaagataaggttgaccatataattgatatctgcttt ctgaagaatgagaaatttatcaatgccatgaaagaagcatttgaaacgttcattaacaaa agaccaaataaaccagctgaacttatagctaagtatgtagattcaaaacttcgtgcaggc aacaaagaagctacagatgaagaacttgagaaaatgttggataaaattatgatcatattt agatttatctatggcaaggatgtttttgaggccttctataagaaagatttagccaagcgc ctgttagtcggaaagagtgcatctgtagatgctgaaaaatcaatgctgtccaaacttaaa catggtcactccctgcagacaggcagtggagcaagtgaaaaactggagacttgggctggc tgggggtatatgcagaatcagaatgttccgggaaatattgagttaactgtgaatatcctg acaatgggctattggccgacatatgtgcctatggaagttcatttaccaccagagatggta aaacttcaggagattttcaagacattttacctaggcaaacatagtggcaggaaacttcag tggcagtcaaccctaggacactgtgtgttaaaagcagaatttaaagagggtaaaaaggaa ctccaggtctctctttttcaaacactggtgctgctaatgtttaatgagggagaggagttc agtttagaagagatcaagcaggcaactggaatagaggatggagagttaaggagaacactg cagtcattagcctgtggcaaagctagagttctggcgaaaaatccaaagggcaaagacatt gaagatggtgacaagttcatttgtaatgatgatttcaaacataaacttttcaggataaag atcaatcaaatccagatgaaagaaacggttgaagaacaagcaagcactacagaaagagta tttcaagacagacagtatcaaattgatgctgcaattgttcgaattatgaagatgagaaag acacttagccacaatctccttgtttcagaagtgtacaaccagttgaaatttccagtaaag cctgctgatcttaagaagagaatagaatctttaattgaccgggactacatggaaagagat aaagaaaatccaaaccagtacaactatattgcatag >gi568815575r:120526213_120727166|GENSCAN_predicted_peptide_2|126_aa MGRVRGIRFSTWSLLSGKSKKEKKKKKKKKKKKKKKKKKKKKKKKKKRKKRRREEEEKKK KKKKKKKKKKKKKKRRRRRKEEEEEEEEEEEEEEEEEEEEEEEEEEEILSNQKPSLEEAL SPCLSE >gi568815575r:120526213_120727166|GENSCAN_predicted_CDS_2|381_bp atggggagggtgaggggcatcaggttctccacgtggtctctgctgagcgggaaaagtaag aaagagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagaagaagaagaagaagaagaggaagaaaagaagaagagaagaagaagagaagaagaag aagaagaagaagaagaagaagaagaagaagaagaagaagaaaagaagaagaagaagaaaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaggaagaagaaatcctgagtaatcagaagccttccctggaagaggccctt tctccttgcctctctgagtga >gi568815575r:120526213_120727166|GENSCAN_predicted_peptide_3|90_aa MNFKGVIDYAFEVSYIHILAEKHKELQASNRVPPPPNRETSEFEARRPALTVGAVVVIRK TRASRALRLSVKSDYFRSSGLHVGAFAGSF >gi568815575r:120526213_120727166|GENSCAN_predicted_CDS_3|273_bp atgaactttaagggagtaattgattatgcttttgaagtctcctatattcacatattggct gaaaaacataaggagctacaggcctcaaatagggtgcctcctcccccaaatagggagaca tctgaatttgaagccagaaggccggcactgacagtcggagctgtagtcgttatccggaaa acgagagccagccgcgctttacggctctcagtcaaatctgactacttccgctcctctgga ctccacgtaggcgcttttgccggctccttttag >gi568815575r:120526213_120727166|GENSCAN_predicted_peptide_4|414_aa MPCFPFTFHYDCTFPEASLAMQNSRPTREHRLQLEQPGQERNGVVRQRERKPVRGCAFLS PSRSRRGCSGRIVSTGDRESSSWGRSLGPSEGYRGKMLSESSSFLKGVMLGSIFCALITM LGHIRIGHGNRMHHHEHHHLQAPNKEDILKISEDERMELSKSFRVYCIILVKPKDVSLWA AVKETWTKHCDKAEFFSSENVKVFESINMDTNDMWLMMRKAYKYAFDKYRDQYNWFFLAR PTTFAIIENLKYFLLKKDPSQPFYLGHTIKSGDLEYVGMEGGIVLSVESMKRLNSLLNIP EKCPEQGGMIWKISEDKQLAVCLKYAGVFAENAEDADGKDVFNTKSVGLSIKEAMTYHPN QVVEGCCSDMAVTFNGLTPNQMHVMMYGVYRLRAFGHIFNDALVFLPPNGSDND >gi568815575r:120526213_120727166|GENSCAN_predicted_CDS_4|1245_bp atgccttgcttccccttcaccttccactatgattgtacgtttcctgaggcttctctagcc atgcagaactcgagaccaacgagagaacaccgcctgcagctagaacagcctggtcaggag cgtaacggagtggtgcgccaacgtgagaggaaacccgtgcgcggctgcgctttcctgtcc ccaagccgttctagacgcgggtgcagcgggaggattgtcagcacaggagaccgagagtct tcttcctgggggcgctctcttggtccctcagagggatatcgaggaaaaatgctttctgaa agcagctcctttttgaagggtgtgatgcttggaagcattttctgtgctttgatcactatg ctaggacacattaggattggtcatggaaatagaatgcaccaccatgagcatcatcaccta caagctcctaacaaagaagatatcttgaaaatttcagaggatgagcgcatggagctcagt aagagctttcgagtatactgtattatccttgtaaaacccaaagatgtgagtctttgggct gcagtaaaggagacttggaccaaacactgtgacaaagcagagttcttcagttctgaaaat gttaaagtgtttgagtcaattaatatggacacaaatgacatgtggttaatgatgagaaaa gcttacaaatacgcctttgataagtatagagaccaatacaactggttcttccttgcacgc cccactacgtttgctatcattgaaaacctaaagtattttttgttaaaaaaggatccatca cagcctttctatctaggccacactataaaatctggagaccttgaatatgtgggtatggaa ggaggaattgtcttaagtgtagaatcaatgaaaagacttaacagccttctcaatatccca gaaaagtgtcctgaacagggagggatgatttggaagatatctgaagataaacagctagca gtttgcctgaaatatgctggagtatttgcagaaaatgcagaagatgctgatggaaaagat gtatttaataccaaatctgttgggctttctattaaagaggcaatgacttatcaccccaac caggtagtagaaggctgttgttcagatatggctgttacttttaatggactgactccaaat cagatgcatgtgatgatgtatggggtataccgccttagggcatttgggcatattttcaat gatgcattggttttcttacctccaaatggttctgacaatgactga >gi568815575r:120526213_120727166|GENSCAN_predicted_peptide_5|139_aa MASVQHLKTFGRFSRYHRSCLKDCSKIAKLINELIQDGQVRIGAHIRKGCGVEEEGRSRY KMPGPDSEEECLRLNYVAYQNKAALTEGSPRKQTNRRRSFHLGKLLSCAPPLTSTFWKLS TCLAQGSVLAYAVHYLQDL >gi568815575r:120526213_120727166|GENSCAN_predicted_CDS_5|420_bp atggccagtgttcaacatttaaagacatttgggagattcagcagataccacaggagctgt ttgaaggactgttcaaagattgcaaagctcatcaatgagttgattcaggatggtcaggtc agaattggagcacacattagaaagggctgtggagttgaagaggaagggagaagcaggtac aaaatgccagggcctgacagtgaggaagagtgtttgcggctcaactacgtggcctatcaa aataaagctgcccttactgagggatcccccagaaagcaaactaacagacgaaggagtttt cacctgggcaaacttctctcatgtgctcctcctcttacttcaacattttggaaattgtca acctgcttggctcaaggttctgttttagcatatgctgtccactacctccaagacctttga >gi568815575r:120526213_120727166|GENSCAN_predicted_peptide_6|102_aa MVDPPIAYTVHLEKLQTMPAHESNWKWGCTLQSHRVELPKTMRANLLHQCALDVTHGVKG DHFETLRPRVKGAAMIRSPDHWSHVTGNIHMQDIKGKTWKDR >gi568815575r:120526213_120727166|GENSCAN_predicted_CDS_6|309_bp atggtagatccaccaatagcttacaccgtgcacctggaaaagctgcagacaatgccagcc catgaaagcaactggaagtggggctgtaccctgcaaagccacagggtggagctgcccaag accatgagagccaacctcttgcatcagtgtgccctggatgtaacacatggagtcaaagga gatcattttgaaactttaagaccaagagtgaaaggagctgcaatgatcaggagcccagac cattggtcccacgtaactggtaacatccacatgcaggacataaaggggaaaacttggaaa gacagatga >gi568815575r:120526213_120727166|GENSCAN_predicted_peptide_7|408_aa MDNVCHSSIMWYQLGQLNWELEDLYPRWLTSIGWQSGIGCQVGVQPGWKARSLDSSPQEP LHGLSHSMWLDSKNKHSQNRKMPVSSDIGPETDTVSLLPYSIGLLRPFNMIINIVNLQEG FECALVLKLTLAQNPPFNEEHVSGLMFPKAQFEKQDFRKIDLARGGYVSEVFFVLKLWGV IQQLWGNLRAAKEEILVMVVAQGRLLDSWGLYAREMQVSNCSVQSAQDGGFVLWTQARGS LPGDEPRLRVARAVAEKLSNAPGGSAQGAAEFVLAQELWWQGVWRPRPGEPARRLNGQRW SPSNTGLSRETRKRLKILCLPPHQPREGEVCAGGAEQSARKERPPPPPSPKSQESQAEAT STSNAGNTKEETGLQATELSTPSSQRPAPTPRVRKHNEELGWEHTSDC >gi568815575r:120526213_120727166|GENSCAN_predicted_CDS_7|1227_bp atggacaatgtttgtcacagtagtatcatgtggtatcagctggggcagctcaactgggag ctagaggacctgtatccaagatggctcactagcattggctggcaaagtggtattggctgt caggtgggagtccagccagggtggaaagccaggagccttgattcctctccacaggaacct ctgcatgggctttctcacagcatgtggctggattccaagaacaaacactcccagaatagg aagatgccagtttcttcagatataggcccagaaactgacacagtgtcacttctaccatat tctattggacttctcagaccctttaacatgataataaacatagtgaatcttcaggaggga tttgaatgtgccctggttcttaaactcacattagcacaaaatcctccttttaatgaggag catgtatcagggctaatgtttcccaaagcacagtttgagaaacaggattttaggaagatt gatctggcaagagggggatatgttagtgaagtatttttcgtgttgaagctttggggtgtg atccagcagctctggggcaatctcagagctgctaaggaagagatcttagtaatggttgtg gcccagggtcgtttgctcgactcctggggcctctatgccagagagatgcaggtcagcaat tgctcagtgcagtcagcccaagatggagggtttgtgctgtggacccaagctaggggttcc ctgcctggtgatgagccacgtctgagggtggcaagggcagtggcagagaagctttcaaat gcccctggaggatctgcacagggagctgctgagtttgtactggctcaagagctctggtgg cagggtgtctggaggcccaggcctggagaacctgcccgcagactgaatggacaaagatgg agtccttcaaatacaggcctttcaagagaaaccagaaagaggttgaagatattgtgtctc ccacctcaccaaccaagagaaggggaagtttgtgctggtggtgctgaacagtctgccagg aaggaaaggccaccgccaccaccttctcccaagagccaggagtcccaagcagaggctacc tctaccagtaatgctggaaacacaaaagaagagacagggctccaagccacagaattgtcc accccctcaagccagagaccagcaccgacaccaagggtcaggaagcacaatgaagagctg ggttgggagcacacaagtgattgctaa