GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:10:40 Sequence gi568815593r:9529136_9730032 : 200897 bp : 40.51% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5388 5495 108 2 0 101 -32 119 0.045 1.47 1.02 Intr + 5958 6041 84 2 0 67 80 67 0.122 2.90 1.03 Intr + 7678 7764 87 0 0 91 40 75 0.063 2.25 1.04 Intr + 14989 15055 67 0 1 66 111 37 0.312 1.36 1.05 Intr + 15461 15609 149 0 2 63 51 107 0.410 3.53 1.06 Intr + 16195 16398 204 0 0 71 37 186 0.842 10.27 1.07 Intr + 16576 16805 230 0 2 59 44 145 0.470 2.94 1.08 Term + 20881 21100 220 1 1 49 38 191 0.583 5.73 1.09 PlyA + 23241 23246 6 1.05 2.00 Prom + 23784 23823 40 -11.84 2.01 Sngl + 24098 24547 450 1 0 81 51 249 0.991 16.40 2.02 PlyA + 25957 25962 6 1.05 3.00 Prom + 29569 29608 40 -4.35 3.01 Init + 36434 36618 185 1 2 78 75 65 0.351 2.84 3.02 Term + 38683 38869 187 1 1 94 33 128 0.828 3.88 3.03 PlyA + 39432 39437 6 1.05 4.03 PlyA - 39730 39725 6 1.05 4.02 Term - 51445 51299 147 1 0 96 49 84 0.424 2.22 4.01 Init - 53877 53803 75 0 0 91 60 74 0.386 4.30 4.00 Prom - 55519 55480 40 -3.65 5.03 PlyA - 55813 55808 6 -0.45 5.02 Term - 56836 55849 988 0 1 6 47 293 0.523 7.55 5.01 Init - 58110 57038 1073 0 2 39 110 314 0.285 22.57 5.00 Prom - 62810 62771 40 -7.55 6.00 Prom + 73834 73873 40 -8.15 6.01 Init + 74958 75014 57 2 0 80 31 75 0.242 2.37 6.02 Intr + 76070 76260 191 1 2 15 37 174 0.182 2.46 6.03 Intr + 83513 83602 90 2 0 18 85 92 0.141 0.09 6.04 Intr + 84209 84341 133 2 1 84 88 77 0.399 7.03 6.05 Intr + 94172 94278 107 1 2 90 38 81 0.016 1.39 6.06 Intr + 100663 100882 220 1 1 30 78 200 0.696 10.58 6.07 Term + 109013 109105 93 1 0 124 40 49 0.089 0.55 6.08 PlyA + 111232 111237 6 1.05 7.00 Prom + 113354 113393 40 -6.55 7.01 Init + 115346 115459 114 1 0 75 115 46 0.900 6.26 7.02 Intr + 116391 116474 84 2 0 111 64 52 0.653 4.20 7.03 Term + 122696 122827 132 1 0 80 34 92 0.535 0.11 7.04 PlyA + 123550 123555 6 1.05 8.05 PlyA - 125229 125224 6 1.05 8.04 Term - 136144 135938 207 1 0 50 48 155 0.346 4.06 8.03 Intr - 138455 138256 200 0 2 78 68 168 0.404 11.95 8.02 Intr - 144751 144637 115 1 1 72 67 28 0.355 -1.80 8.01 Init - 145420 145394 27 1 0 39 116 40 0.372 1.55 8.00 Prom - 149193 149154 40 -3.25 9.00 Prom + 150009 150048 40 -3.55 9.01 Sngl + 166383 166583 201 2 0 57 50 166 0.760 4.63 9.02 PlyA + 166814 166819 6 1.05 10.02 PlyA - 169008 169003 6 1.05 10.01 Sngl - 171708 171547 162 0 0 75 38 198 0.631 7.85 10.00 Prom - 176188 176149 40 -4.25 11.06 PlyA - 177000 176995 6 1.05 11.05 Term - 178080 178064 17 1 2 101 55 7 0.149 -3.88 11.04 Intr - 181289 181097 193 2 1 18 70 156 0.241 4.94 11.03 Intr - 193057 192929 129 1 0 53 71 74 0.038 2.17 11.02 Intr - 194833 194628 206 2 2 71 57 78 0.028 1.00 11.01 Init - 196451 196376 76 2 1 88 82 88 0.402 7.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 64373 64451 79 2 1 90 53 77 0.830 3.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:9529136_9730032|GENSCAN_predicted_peptide_1|382_aa MEPTLRGKPASVASPWNIMVPESGPDPDPKREFLHLLHPEDPRCESLRAFLCSKTNLREL ETPQAQASMKFLFTDTGKDILILNFRTVTGGQETTSIINNISADSKLSHYDRALLGPSNC LWRLVLRALARSAPCRESRGRRRRRKVKSQLRPRHPGTKQAKGEETSAQAGTQTSSPCTS STVGAPGLTVVLILYAQPPPRCLLMDPTRDRHHPPADRCSPRVPASGAARRGASRPLLAE FQNGGTGLGATSTVPERAANRWVGRFAPSRDDPRGGKVGCPSQLRGAVLGGRRERPGMSD TGRAAPARGRQTLLIFELSGTHFSRENHIPPPHSQVQGSSLAKAMTQGYQKQSTHLRRCA DSQEQLTLLKKSLLILLSVRAF >gi568815593r:9529136_9730032|GENSCAN_predicted_CDS_1|1149_bp atggagcccactctgaggggcaagccagccagtgttgcatccccatggaacatcatggta ccagaaagcggtcccgatccagaccccaagagagagttcttgcacctccttcatcctgaa gatcctcgctgtgaatccttaagggctttcctatgcagcaagacaaacctcagggagctg gaaactccgcaggctcaagcttcaatgaagtttctcttcactgacactggcaaagatatt ctgattttaaacttcagaacagtgacaggaggacaggagactacctctataattaataac atatctgctgattccaaactcagccattatgacagagctcttctcggacccagcaactgc ctgtggcgcctcgtcctccgagcgctcgctcgcagcgccccctgccgggaatcccgagga cgccgcaggcgcaggaaagttaaatcccaactccggccgcggcatcctggtacaaagcaa gcaaagggcgaggagacctcggcacaggcgggtacacaaacttccagcccgtgcactagc tcaactgtgggcgcccccggactgacggtcgtcctaatcctgtacgcgcaacccccgccc aggtgcctgctgatggatcccacccgagatcggcaccacccaccagccgaccgctgcagt ccccgggtcccggccagcggcgcggcgcggcgcggcgcgagccgccccctactcgctgag ttccagaatgggggcaccggcttgggggccacgagcacggtccccgagcgcgcggccaac cggtgggtgggcaggttcgcgcccagcagagacgaccctcgaggtggcaaagttgggtgt ccaagccagctccgcggggcagtcctgggtgggagaagggagcgtcccgggatgagcgac actggccgggccgcccctgcacggggaaggcaaactctgctcatcttcgagctctctgga acccatttctcaagggagaaccacattcccccacctcacagccaagttcaaggaagcagc ctggctaaagcaatgactcaaggataccagaagcagagcacgcatttaagaagatgcgct gactcgcaggagcagctgactcttcttaagaagagtttactgattcttctttcagtacgg gccttttaa >gi568815593r:9529136_9730032|GENSCAN_predicted_peptide_2|149_aa MALLGTAHAVALTSWSPVPVALLGWHCTLVALPVWDLRGNPDPTALLIIVLVGTVCTYLT FVAVFCLGSGLSKTSFEIYVETAMPPQLIHSIYMQSWHFKNVIKAYYLYLSEGWPRPHLS PLEPQLGQLKSTVLECGEQRSEEALGSET >gi568815593r:9529136_9730032|GENSCAN_predicted_CDS_2|450_bp atggctttgctgggcacagcccatgctgtagctctcacaagttggagtccagtgcctgtg gctctcctaggctggcattgcacgctggtggctctaccagtttgggatctcagaggcaac cctgaccccacagctctgctgatcattgtcctagtagggaccgtctgcacctatctcacc tttgtggcagttttctgtctgggctctgggctctccaagacatcttttgaaatctacgtg gagacagccatgcctccacagctcatacactctatatacatgcagagttggcacttcaag aatgtcatcaaggcttattacttgtacctttcagaagggtggcccaggccacatctgagt ccacttgagccccagctggggcagctaaaaagcactgtgttagaatgcggggagcaaaga tctgaggaggccctaggcagtgaaacctga >gi568815593r:9529136_9730032|GENSCAN_predicted_peptide_3|123_aa MAEFMGLAPLFFHPRDDSFSVDFLRVFLTVGGSSVASQYCVLTVDRGKWEPALDAQRKAD SRVLLPLFLMHRSGEKLKRRKAPSSHSELLQPGLAPHSCCQRGSPETQLAFAWRPPFMVG TFP >gi568815593r:9529136_9730032|GENSCAN_predicted_CDS_3|372_bp atggcagagtttatgggattggcaccactctttttccacccaagagacgatagtttttct gttgacttcctcagggtattcttgactgttggtggcagcagtgtggcttcccaatactgt gtcttgacagtagacaggggcaagtgggagccagcgctagacgcgcagaggaaggcagac agcagagttttactcccattgtttctgatgcacagatctggagagaagctcaagagaagg aaggctccctcttcccactctgagctcctacaaccagggctggcccctcacagctgctgc caaagaggatctccagagacccagttggcttttgcttggaggcccccatttatggttggg acgttcccttag >gi568815593r:9529136_9730032|GENSCAN_predicted_peptide_4|73_aa MAAMAGALTSCLDTEVTRHILGTEATSKYSGEKASGFWKRQGYKIENFSPRGEVLKSVHF KEEEDTGPMTSAG >gi568815593r:9529136_9730032|GENSCAN_predicted_CDS_4|222_bp atggctgcgatggctggagctctgacatcctgcctggacactgaggtcacgagacacatt ctgggaacagaagcgacttccaaatattcaggagaaaaggcatctggtttctggaaaagg cagggctataaaatagaaaacttctcaccaagaggggaagttttaaaatcagtacatttc aaagaagaggaggacacagggccaatgaccagtgcaggatga >gi568815593r:9529136_9730032|GENSCAN_predicted_peptide_5|686_aa MQKNYLCRHSKASRRQEITKIRAELKEKETQKTLQKINEPRSWFFERINKIDRPLARLIK KKREKNQIDAIKNDKGDITTDPTEIQTIIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPTTGSEIVAIINSLPTKKSPGPDGLTAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEASIILIPKPGKDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHD QVGFIPGIQGWFNIRKSINVIQHINRTKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGI DGTYLKIIRAIYDKPTANIIRNGQKLEAFPLKTGTRQGCPLSPLPFNIVLEVLARAIRQT ESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGR INIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARITKSILSQKNKAGG ITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKD SLFNKWYWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIG MGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGL ISRIYNELKQIYKKKQTTPSKSGRRT >gi568815593r:9529136_9730032|GENSCAN_predicted_CDS_5|2061_bp atgcagaagaattacctgtgtaggcattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaaaagagacacaaaaaacccttcaaaaaattaatgaaccc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggggatatcaccacg gatcccacagaaatacaaactatcatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattccttgacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaacaacaggatctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattgacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcatcctgataccaaagccgggcaaa gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccatcatgat caagtgggcttcatccctgggatacaaggctggttcaatatacgcaaatcaataaatgta atccagcatataaacagaaccaaagacaaaaaccacatgattatctcaatagatgcagaa aaggcctttgacaaaattcaacagcccttcatgctaaaaactctcaataaattaggtatt gatgggacgtatctcaaaataataagagctatctatgacaaacccacagccaatatcata cggaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccactcccattcaacatagtgttggaagttctggccagggcaatcagacaaaca gagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaataccta ggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctc aatgaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaaga atcaatatcgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatgg aaccaaaaaagagcccgcatcaccaagtcaatcctaagccaaaagaacaaagctggaggc atcacactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtac tggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacaccg catatctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggat tccctatttaataaatggtactgggaaaactggctagccatatgtagaaagctgaaactg gatcccttccttacaccttatacaaaaatcaattcaagatggattaaagacttaaacgtt agacctaaaaccataaaaaccctagaagaaaacctaggcattaccattcaggacataggc atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactcaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacaaaatgggagaaaattttcgcaacctactcatctgacaaagggcta atatccagaatctacaatgaacttaaacaaatttacaagaaaaaacaaacaaccccatca aaaagtgggcgaaggacatga >gi568815593r:9529136_9730032|GENSCAN_predicted_peptide_6|296_aa MSRDTYAGPALMVILAGLWDSNTKTKCKDPDVTIRVASSSNKNIGESGTGIEGDDIKETD SGWTCAALYGTPTLHGMEFHPSLNTKGSPSGRNEETLGSNLDSREEIKERQLRLLSCSYL LQGWLPLSSGFTITRMKLTVGDFLQRFRDVADSVTGQPVPNRSCSCRSVNSNRAAEQPQP VHDTSENEPVSWRTHDEFYEEDNNHINVEDEQLQKNSRNCQTRKKEIQRSHFSVLDQVNA IHHHNDAICENPKKKLYHCKKKIDNEGWGFLTAGLTAVIVVALLGLATQQGYQALG >gi568815593r:9529136_9730032|GENSCAN_predicted_CDS_6|891_bp atgtccagagacacttacgcgggtcctgcattgatggtcatccttgcaggtctatgggac tctaatacaaagactaagtgcaaagatcctgatgtcacaatcagagttgcaagttccagc aacaagaacattggtgaatctgggacgggtattgaaggagatgatatcaaagagacagac agtggctggacatgtgcagctctgtatgggactcccacccttcatgggatggaattccac ccttcattaaatactaaaggaagtccttcaggcagaaatgaagaaacactaggcagtaac ttggattcacgtgaagaaataaaagagcgacagttaaggctcttgagctgttcctacctg ctccagggttggctcccactcagtagtgggtttaccataacaagaatgaagcttacagta ggagacttccttcagagattcagggatgtggcagattcagtcactggccagcctgttcca aacaggagctgctcctgcaggagtgtgaacagtaacagggctgctgaacagccacagcca gtgcacgacacaagtgagaatgaacctgtgtcttggagaacacatgatgaattctatgaa gaagataacaatcacattaacgtagaagatgaacaactgcagaaaaattctagaaactgc cagacaagaaagaaggagatccagcggagccatttttctgtgcttgatcaagtcaatgcc attcaccaccacaatgatgccatttgtgaaaatcccaagaagaaattgtatcactgcaag aagaaaatagataatgagggatggggcttcctgacagctggactgactgcagtgattgtt gttgctcttctgggtttagccacccagcaaggctaccaagctctgggctga >gi568815593r:9529136_9730032|GENSCAN_predicted_peptide_7|109_aa MEPFFKKEKVEGRLKSRALCGHAPETATAGRDDIMSWMEPLSSTLVYGPYLGLTPSLNPG VVTETQCQASEMGKSSSAHQQALSGWASISPSVPVAHITVPFIMSMNLI >gi568815593r:9529136_9730032|GENSCAN_predicted_CDS_7|330_bp atggagccatttttcaagaaggagaaggttgaaggaaggttaaaatcaagagctctgtgt ggacatgctcctgagacggctactgcaggtagagatgatattatgtcatggatggaacca ctctcctccacccttgtttatgggccttacttggggctgactccatcactgaatccagga gtagtcacagaaacccagtgccaagcatctgagatgggaaagtcatcttcagcccaccaa caagccctgagcggctgggcgtccatctccccatctgtaccagtggcacacatcactgtt ccattcatcatgtctatgaacttgatctga >gi568815593r:9529136_9730032|GENSCAN_predicted_peptide_8|182_aa MDDEQKHGKGMSFAGPGEQPQLFQSHSIRQGMEKCHKGQQEEFRLCTGAKAGRQAISFPG QPKSTVQVGIRGTYGYACLQNSDKSEATGQEALAGVAHLATRSKVDRVARPAIQEQEPLK AERGSQREPRHEQGLICHGLLTDGGHVSKNGTSVLPLLNSANNKEEPGSGSSRKECGPAD TL >gi568815593r:9529136_9730032|GENSCAN_predicted_CDS_8|549_bp atggatgatgaacagaagcatggaaaggggatgagctttgctgggcctggggagcagcct cagttgtttcagagccattccattagacagggtatggagaagtgccacaagggacagcaa gaagaatttcgtttgtgcacaggggccaaagctggcagacaggctatatcctttccaggt caacccaaatctactgtccaagtggggataaggggaacatacggttatgcctgcctgcag aattcagacaaaagtgaggccactgggcaggaagctctagcaggtgtggcccatctggct acaagaagtaaggtggatagagttgcccgtcctgcaatccaggagcaggagcccttaaaa gcagagagaggaagccagagagaaccaaggcatgagcagggcttgatatgtcatggcttg ctcacagatggaggtcatgtgtcaaagaatgggacctcagttctaccactgttgaattct gccaacaacaaggaggagcctggaagtggatcttccagaaaagaatgtggcccagctgac accttgtaa >gi568815593r:9529136_9730032|GENSCAN_predicted_peptide_9|66_aa MLHFEDLKMDKPFSDHLKSGLGALGVGVWLHRVQRPKLGELPGLHDSTPEDGLWKEKDHN QTLRKY >gi568815593r:9529136_9730032|GENSCAN_predicted_CDS_9|201_bp atgctgcattttgaagatctgaagatggacaaacctttctctgatcatttaaaaagcggc ttgggtgcactgggggtgggagtatggctgcatagagtccagagacccaaacttggagaa ttgccaggtctccacgactctacccctgaggatggcctctggaaggaaaaagaccacaac caaacactgagaaagtactag >gi568815593r:9529136_9730032|GENSCAN_predicted_peptide_10|53_aa MTGILIRRGDEDRDMHRKKPHEDREKTVIYRPRRESTEKTEKTDPADTLILGF >gi568815593r:9529136_9730032|GENSCAN_predicted_CDS_10|162_bp atgaccggcatcctgataagaagaggagatgaggacagagacatgcacagaaagaagcca catgaagacagagagaagacggtcatctacaggccaaggagagagtccacagagaaaaca gagaaaactgatcctgctgacaccttgatcttgggtttctag >gi568815593r:9529136_9730032|GENSCAN_predicted_peptide_11|206_aa MAGCRSRALPRGEAAKAWREIERSAGHRSQGHPKGTMRKNKERALCLISVTLGPMSKRYS FLPGSSAKLLPGLREQRAAGSLVPGSTTGHHASQRFLLGCWIEGLRSLPAVGQRPSSSSV PLPLGLQNGSLLHQSEQGSKSGNYDMSSLLLGELEWRALNELESILFVKETANPLAQALG EWRALDKLESILFVKETANPLTPPPA >gi568815593r:9529136_9730032|GENSCAN_predicted_CDS_11|621_bp atggcgggctgcaggtcccgagccctgccccgtggggaggcagctaaggcctggcgagaa atcgagcgcagcgccggtcataggtcacagggtcatcccaaaggaacaatgaggaagaac aaagaaagagccctctgccttatttcggtgacattgggaccaatgtccaagcgctactcc tttctccctggatccagtgcaaagctcctcccaggactaagagagcaaagagcagctggg agccttgtgcccggcagcacaactggtcatcatgccagccagcggttcctcctgggctgt tggattgagggcctcaggtccttaccagctgttggccagagaccatcatcatcatcagtt cctttacccttaggtttacaaaatggcagcttgctccatcagagtgagcaaggatcaaaa agtggcaactatgacatgtcctcactactattgggggaattagagtggagggcactcaat gaattggagagcatactctttgtaaaggagacagccaatcctctggcccaggccttgggt gagtggagggcactcgataagttggagagcatactctttgtaaaggagacagccaatcct ctcactccaccaccagcctga