GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:15:39 Sequence gi568815583r:51110811_51342912 : 232102 bp : 43.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1567 1562 6 1.05 1.05 Term - 3510 3497 14 1 2 126 50 9 0.356 -0.74 1.04 Intr - 10009 9880 130 1 1 100 115 44 0.289 8.57 1.03 Intr - 19435 19359 77 2 2 60 44 13 0.003 -6.67 1.02 Intr - 25437 25297 141 1 0 75 75 27 0.382 0.42 1.01 Init - 25978 25960 19 1 1 84 106 58 0.399 5.43 1.00 Prom - 29140 29101 40 -4.36 2.02 PlyA - 30303 30298 6 1.05 2.01 Sngl - 47265 47128 138 0 0 72 43 173 0.188 4.80 2.00 Prom - 53777 53738 40 -4.66 3.00 Prom + 61601 61640 40 -2.06 3.01 Sngl + 63433 64092 660 0 0 65 43 299 0.406 19.28 3.02 PlyA + 64320 64325 6 1.05 4.00 Prom + 64489 64528 40 -4.96 4.01 Sngl + 64582 67983 3402 0 0 44 47 874 0.498 69.37 4.02 PlyA + 68210 68215 6 1.05 5.12 PlyA - 68364 68359 6 1.05 5.11 Term - 71377 71265 113 2 2 78 42 137 0.924 6.82 5.10 Intr - 99812 99682 131 1 2 54 48 54 0.147 -1.66 5.09 Intr - 100246 100014 233 1 2 82 39 202 0.340 11.27 5.08 Intr - 101751 101510 242 1 2 110 64 253 0.998 22.37 5.07 Intr - 104422 104260 163 1 1 52 91 51 0.959 1.35 5.06 Intr - 105007 104893 115 0 1 49 69 116 0.949 6.25 5.05 Intr - 107845 107731 115 2 1 99 75 58 0.949 5.11 5.04 Intr - 111715 111539 177 2 0 76 36 180 0.770 11.49 5.03 Intr - 117123 116969 155 2 2 116 121 82 0.979 14.02 5.02 Intr - 126199 126049 151 2 1 73 89 196 0.804 17.42 5.01 Init - 132102 131958 145 0 1 71 111 83 0.500 9.18 5.00 Prom - 133853 133814 40 -5.76 6.00 Prom + 141103 141142 40 -5.06 6.01 Init + 144817 144913 97 0 1 76 75 117 0.867 9.67 6.02 Intr + 146305 146330 26 2 2 102 99 15 0.669 1.74 6.03 Intr + 147883 147938 56 0 2 27 116 58 0.534 0.28 6.04 Intr + 155072 155232 161 2 2 83 93 41 0.670 3.73 6.05 Intr + 163250 163321 72 0 0 70 89 48 0.545 2.48 6.06 Term + 167173 167357 185 2 2 123 44 64 0.335 3.21 6.07 PlyA + 168211 168216 6 1.05 7.05 PlyA - 168472 168467 6 1.05 7.04 Term - 185031 184902 130 2 1 135 53 79 0.759 6.75 7.03 Intr - 190337 190189 149 2 2 53 63 56 0.024 -1.37 7.02 Intr - 200139 200063 77 1 2 54 78 44 0.045 -0.77 7.01 Init - 204467 204362 106 2 1 74 102 97 0.554 8.94 7.00 Prom - 208858 208819 40 -4.76 8.00 Prom + 212752 212791 40 -5.66 8.01 Init + 230875 231237 363 0 0 72 82 585 0.965 51.35 8.02 Term + 231242 231370 129 1 0 83 37 45 0.292 -2.92 8.03 PlyA + 231671 231676 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:51110811_51342912|GENSCAN_predicted_peptide_1|126_aa MQPRRLGPKSPSLAPSKFRKSVLINARAINPLKENLRTSGQWEKVLTGKLDAAVSVLVSE AWVEVEYYPSRPGPQNYPLAMKEFSNLLKVGHKSLIPETSSLYLEERNVTQRHREASEQT GHDETS >gi568815583r:51110811_51342912|GENSCAN_predicted_CDS_1|381_bp atgcagccccgccgcctgggtcccaaatcccccagcttagccccttccaagttcaggaaa tctgtgctcattaatgcaagagccattaaccccttgaaagagaacctcagaacctcagga caatgggagaaagtgctgacaggcaagcttgacgcagcagtctcagttctagtcagtgaa gcatgggtggaagtggagtattacccttccaggcctggcccacaaaactacccacttgcc atgaaagaattctctaatcttctaaaagtaggtcataaaagcctcattccagagacatcc tccttatacctggaggaaagaaatgtcacacagagacacagagaagcatctgaacaaaca ggccatgatgaaacatcgtga >gi568815583r:51110811_51342912|GENSCAN_predicted_peptide_2|45_aa MRKPKEVPKSGGSKMFWSNDRQSAITVMKPFLCWLLALEALLSVD >gi568815583r:51110811_51342912|GENSCAN_predicted_CDS_2|138_bp atgaggaagcccaaagaggtccctaagagtgggggctccaagatgttctggtcaaatgac aggcagtcggccatcaccgtcatgaagccctttctctgctggcttctggccttggaggct ctcctgagtgtggactga >gi568815583r:51110811_51342912|GENSCAN_predicted_peptide_3|219_aa MEDEMNEMKQEGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENATKLGNTLQD IIQENFPNLARQANNVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGR VTLKGKPIRLTVDLSAETLQARREWGPIFNILKGKNFQPRISYPAKLSFISEGEIKYFTD KQMLRDFVTTRPALKELLKEALNMERNNQYQPLQNHAKM >gi568815583r:51110811_51342912|GENSCAN_predicted_CDS_3|660_bp atggaagatgaaatgaatgaaatgaagcaagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatgcaaccaagttgggaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacaacgttcagattcaggaa atacagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtc agattcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcga gttaccctcaaagggaagcccatcagactaacagtggatctctcggcagaaaccctacaa gccagaagagagtgggggccaatattcaacattcttaaaggaaagaattttcagcccaga atttcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagac aagcaaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaa gcgctaaacatggaaaggaacaaccagtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815583r:51110811_51342912|GENSCAN_predicted_peptide_4|1133_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQGDLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKWKRTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYSNKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGAEIVAIINSLPTKKIPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILAKRIQQHIKKLIHHD EVGFIPGMQGWFNIHKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGI DGMYFKIIRAIYDKPTANIMLNGQKLEVFPLKSSTRQGCPLSPLLFNMVLNVLARAIRQE KAIKGIQIGREEVKLSLFADDVIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNI PCSWVGRINIVKMAILPKVIYRFNAIPINLPMTFFTELEKTTLKFIWNQKRARIAKSILS QKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMLHTYNYLIFDKPEK NKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLG ITIQDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAGETTIRVNRQPTTWEKIFAT YSSDKGLISRIYNELKQIYKKKTNNPIKKWARDMNRHFSKEDIYAAKKHMKKCSSSLAIR EMQIKTTMRYHLTPVRMAIIEKSGNNRCWRGCGEIGTLLHCWWDCKLVQPLWKSVWRFLR DLELEIPFDPAIPLLGIYPKDYKSCCYKDTCTRMFIAALFTIAKTWNQPKCPTMIDWIKK MWHIYTMEYYAAIKNDEFMSFVGTWMKLETIILSKLSQEQKTKYRIFSLIGGN >gi568815583r:51110811_51342912|GENSCAN_predicted_CDS_4|3402_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaaggggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctctcctcagcaaatggaaaagaacagaaatt ataacaaactatctctcagaccacagcgcaatcaaactagaactcaggattaagaatctc actcaaagccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggatgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacagttaaaa gaactagaaaagcaagagcaaacacattccaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccactagcaagactaataaag aaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctattcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatc aatagtttaccaaccaaaaagattccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcattctgataccaaagccgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaacgaatccagcagcacatcaaaaagcttatccaccatgat gaagtgggcttcatccctgggatgcaaggctggttcaatatacacaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggatgtatttcaaaataataagagctatctatgacaaacccacagccaatatcatg ctgaatgggcaaaagctggaagtattccctttgaaaagcagcacaagacaaggatgccct ctctcaccactcctattcaacatggtactgaatgttctggccagggcaatcaggcaagag aaagcaataaagggtattcaaataggaagagaggaagtcaaattgtccctgtttgcagat gacgtgattgtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctaata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattccta tacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatt tatagattcaatgccatccccatcaacctaccaatgactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacactacctgacttcaagctatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaaca gagccctcagaaataatgctgcatacctacaactatctgatctttgacaaacctgagaaa aacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattcaaga tggattaaagatttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggc attaccattcaggacataggcgtgggcaaggacttcatgtccaaaactccaaaagcaatg gcaaccaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagcg ggggaaactaccatcagagtgaacaggcaacctacaacatgggagaaaattttcgcaacc tactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaag aaaaaaacaaacaaccccatcaaaaagtgggcgagggacatgaacagacacttctcaaaa gaagacatttatgcagccaaaaaacacatgaaaaaatgctcatcatcactggccatcaga gaaatgcaaatcaaaaccactatgagataccatctcacaccagttagaatggcaatcatt gaaaagtcaggaaacaacaggtgctggagaggatgtggagaaataggaacacttttacac tgttggtgggactgtaaactagttcaaccattgtggaagtcagtgtggcgattcctcagg gatctagaactagaaataccatttgacccagccatcccattactgggtatatacccaaag gactataaatcatgctgctataaagacacatgcacacgtatgtttattgcggcattattc acaatagcaaagacttggaaccaacccaaatgtccaacaatgatagactggatcaagaaa atgtggcacatatacaccatggaatactatgcagccataaaaaatgatgagttcatgtcc tttgtagggacatggatgaaattggaaaccatcattctcagtaaactatcgcaagaacaa aaaaccaaataccgcatattctcactcataggtgggaattga >gi568815583r:51110811_51342912|GENSCAN_predicted_peptide_5|579_aa MVLEMLNPIHYNITSIVPEAMPAATMPVLLLTGLFLLVWNYEGTSSIPGPGYCMGIGPLI SHGRFLWMGIGSACNYYNRVYGEFMRVWISGEETLIISKSSSMFHIMKHNHYSSRFGSKL GLQCIGMHEKGIIFNNNPELWKTTRPFFMKALSGPGLVRMVTVCAESLKTHLDRLEEVTN ESGYVDVLTLLRRVMLDTSNTLFLRIPLDESAIVVKIQGYFDAWQALLIKPDIFFKISWL YKKYEKSVKDLKDAIEVLIAEKRRRISTEEKLEECMDFATELILAEKRGDLTRENVNQCI LEMLIAAPDTMSVSLFFMLFLIAKHPNVEEAIIKEIQTVIGERDIKIDDIQKLKVMENFI YESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNIGRMHRLEFFPKPNEFTLENFAK NVPYRYFQPFGFGPRGCAGKYIAMVMMKAILVTLLRRFHVKTLQGQCVESIQKIHDLSLH PDETKNMLEMIFTPRNSDREPGYKRKSRGQEFEGEIVGEETVSIKTRFHQMCFEKDRPSL TKWEGNCQVMRTRKQPYRKVHMAKNQGLLPIVSINLPAL >gi568815583r:51110811_51342912|GENSCAN_predicted_CDS_5|1740_bp atggttttggaaatgctgaacccgatacattataacatcaccagcatcgtgcctgaagcc atgcctgctgccaccatgccagtcctgctcctcactggcctttttctcttggtgtggaat tatgagggcacatcctcaataccaggtcctggctactgcatgggaattggacccctcatc tcccacggcagattcctgtggatggggatcggcagtgcctgcaactactacaaccgggta tatggagaattcatgcgagtctggatctctggagaggaaacactcattatcagcaagtcc tcaagtatgttccacataatgaagcacaatcattacagctctcgattcggcagcaaactt gggctgcagtgcatcggtatgcatgagaaaggcatcatatttaacaacaatccagagctc tggaaaacaactcgacccttctttatgaaagctctgtcaggccccggccttgttcgtatg gtcacagtctgtgctgaatccctcaaaacacatctggacaggttggaggaggtgaccaat gaatcgggctatgtggacgtgttgacccttctgcgtcgtgtcatgctggacacctctaac acgctcttcttgaggatccctttggacgaaagtgctatcgtggttaaaatccaaggttat tttgatgcatggcaagctctcctcatcaaaccagacatcttctttaagatttcttggcta tacaaaaagtatgagaagtctgtcaaggatttgaaagatgccatagaagttctgatagca gaaaaaagacgcaggatttccacagaagagaaactggaagaatgtatggactttgccact gagttgattttagcagagaaacgtggtgacctgacaagagagaatgtgaaccagtgcata ttggaaatgctgatcgcagctcctgacaccatgtctgtctctttgttcttcatgctattt ctcattgcaaagcaccctaatgttgaagaggcaataataaaggaaatccagactgttatt ggtgagagagacataaagattgatgatatacaaaaattaaaagtgatggaaaacttcatt tatgagagcatgcggtaccagcctgtcgtggacttggtcatgcgcaaagccttagaagat gatgtaatcgatggctacccagtgaaaaaggggacaaacattatcctgaatattggaagg atgcacagactcgagtttttccccaaacccaatgaatttactcttgaaaattttgcaaag aatgttccttataggtactttcagccatttggctttgggccccgtggctgtgcaggaaag tacatcgccatggtgatgatgaaagccatcctcgttacacttctgagacgattccacgtg aagacattgcaaggacagtgtgttgagagcatacagaagatacacgacttgtccttgcac ccagatgagactaaaaacatgctggaaatgatctttaccccaagaaactcagacagagaa ccaggctacaagagaaaaagcagaggccaagagtttgagggagaaatagtcggtgaagaa accgtatccataaagacccgattccaccaaatgtgctttgagaaggataggccttcatta acaaaatgggaaggcaactgccaggtcatgaggacacgcaagcagccctacagaaaagtc cacatggccaagaatcaaggcctcctgccaatagtcagcatcaacttgccagccctgtga >gi568815583r:51110811_51342912|GENSCAN_predicted_peptide_6|198_aa MPKFSGDKIHGLLDINPKFHMKAVISGPEKSAASILPSLLLSLEELTEVYDMYLLPEILS CLSFIRIPGCPLSQSGYRLEDMPGLDKSSDSSRNIANLASANLEGNGWLPLHKELPDKRR LPLHQVESGTGHVLSTAGKCANIKVTWVCWSLLVFGLLKMVSFANDARLFKNQVAFLAIT LRVPSFIKVLLVAKRSVA >gi568815583r:51110811_51342912|GENSCAN_predicted_CDS_6|597_bp atgcccaagttcagtggagacaaaatccatggcttgcttgatattaatcccaagttccac atgaaagctgttatcagcggtcctgagaaatcagcagcatcaatactgccctcgttgctg ctgtctctggaagaactaacagaggtgtacgatatgtacctattgcctgaaattctaagc tgtttgagcttcattcggatccctggatgcccactgagccagtcaggctacaggctagag gatatgcctggtctggataaatctagtgatagctccagaaacatagcaaatctagcttcc gccaaccttgagggaaatggctggcttcctttgcacaaagaattgccagataaaaggaga cttcctttgcaccaggtagagagtgggactgggcacgtgctgtccacggcaggaaaatgt gcaaatatcaaggttacctgggtctgctggtcacttctagtttttggactcttgaagatg gtgagttttgctaatgatgcgcggctgtttaagaaccaggtggcttttctggcaatcact cttcgtgtgccatcttttataaaggtgcttcttgttgctaagagatcagttgcttag >gi568815583r:51110811_51342912|GENSCAN_predicted_peptide_7|153_aa MERMCRSTHNLLTGLQQAWVTVVVKVGSVEKHNQQGSGLNVSSVFKAFVNAVCVPHMHIL EVASKAQFSFDHSKPQRNTSLMLNVAVTFGSDFPSDSSPPCELVPLSSLERPPCCQDFLT EGLGWLPEAIQNWTRGTQLNLNVLLSAQEMGRC >gi568815583r:51110811_51342912|GENSCAN_predicted_CDS_7|462_bp atggagagaatgtgcagaagcacgcataaccttctgaccggactgcagcaggcctgggtg acggtggtggtgaaggtggggagtgttgagaaacacaaccagcaaggcagtggtctcaac gtcagctcagttttcaaagcctttgttaatgctgtttgtgttccacacatgcacatcttg gaggttgcctcaaaagcccagttttcctttgatcattccaagccccagcgaaatacctct ttaatgcttaatgtggcagtcacctttggatctgacttcccttctgacagttcaccaccc tgtgaactggttccattgtcttccctggaaagacctccgtgctgccaggacttcctaacg gaagggcttggctggctccccgaagccatccagaactggacccgagggacccagctcaac ttgaatgtccttttatctgcccaggagatgggaagatgctga >gi568815583r:51110811_51342912|GENSCAN_predicted_peptide_8|163_aa MARGAEGGRGDAGWGLRGALAAVALLSALNAAGTVFALCQWRGLSSALRALEAQRGREQR EDSALRSFLAELSRAPRGASAPPQDPASSARNKRSHSGEPAPHIRAESHDMLMMMTYSMV PAGSLFPVAPRPGGRLGVWGEDADALFSPGQAARCCGEGCGHE >gi568815583r:51110811_51342912|GENSCAN_predicted_CDS_8|492_bp atggcccgaggcgctgagggaggccgtggggacgcgggttggggcctgcgtggcgccctg gcggccgtggcgctgctctcggcgctcaacgctgcgggcacggtgttcgcgctgtgccag tggcgcgggctgagctcggcgctgcgggctttggaggcgcagcggggccgggagcagcgc gaggacagtgccctgcgctccttcctggccgagttgagccgcgcgccgcgcggggcgtcc gcaccaccccaagacccggccagctcagctcgcaacaagcgcagccacagcggcgagccc gcgccgcatatccgcgccgagagccatgacatgctgatgatgatgacctactccatggtg ccggcggggtctctgttccccgtggcgccccggccaggtgggcggctgggggtgtggggc gaggacgccgacgccctcttctcccctggccaggctgcgaggtgctgcggagagggctgt gggcatgagtag