GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:58:12 Sequence gi568815584r:36199086_36420588 : 221503 bp : 38.15% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15875 15973 99 1 0 86 89 123 0.945 12.51 1.02 Term + 16659 16856 198 2 0 64 48 99 0.809 -0.08 1.03 PlyA + 16982 16987 6 1.05 2.00 Prom + 24600 24639 40 -5.35 2.01 Init + 45462 45597 136 2 1 46 91 125 0.806 8.95 2.02 Intr + 63111 63302 192 1 0 79 99 45 0.262 3.24 2.03 Intr + 63457 63562 106 2 1 83 107 17 0.110 1.45 2.04 Intr + 73234 73414 181 1 1 81 13 204 0.718 11.15 2.05 Intr + 78917 78985 69 1 0 81 99 57 0.861 4.56 2.06 Term + 80894 81067 174 1 0 50 45 142 0.841 2.88 2.07 PlyA + 81465 81470 6 1.05 3.10 PlyA - 82151 82146 6 1.05 3.09 Term - 83876 83829 48 2 0 98 41 46 0.168 -2.77 3.08 Intr - 87856 87635 222 1 0 50 55 148 0.053 5.10 3.07 Intr - 101738 101700 39 2 0 85 108 30 0.212 2.20 3.06 Intr - 109104 109007 98 1 2 50 67 44 0.158 -2.69 3.05 Intr - 112640 112488 153 0 0 -13 115 175 0.298 9.22 3.04 Intr - 115523 115427 97 2 1 57 88 52 0.750 0.76 3.03 Intr - 115820 115606 215 0 2 28 63 258 0.611 14.81 3.02 Intr - 121301 121004 298 2 1 75 57 130 0.291 4.12 3.01 Init - 121503 121375 129 0 0 74 94 92 0.930 8.60 3.00 Prom - 128999 128960 40 -0.75 4.03 PlyA - 129253 129248 6 1.05 4.02 Term - 130372 130090 283 0 1 52 48 196 0.929 5.91 4.01 Init - 134808 134780 29 0 2 61 65 48 0.590 -1.08 4.00 Prom - 136039 136000 40 -3.85 5.00 Prom + 136256 136295 40 -5.25 5.01 Init + 145937 145984 48 1 0 88 111 52 0.578 8.50 5.02 Intr + 153785 153887 103 1 1 17 53 75 0.028 -4.27 5.03 Term + 155265 155881 617 1 2 36 28 231 0.384 5.54 5.04 PlyA + 156009 156014 6 1.05 6.00 Prom + 158491 158530 40 -3.65 6.01 Init + 159944 160091 148 1 1 72 85 141 0.862 12.50 6.02 Term + 172366 172808 443 2 2 17 35 270 0.004 9.03 6.03 PlyA + 173259 173264 6 1.05 7.00 Prom + 177628 177667 40 -7.15 7.01 Init + 180067 180123 57 0 0 85 73 50 0.161 4.56 7.02 Intr + 188363 188505 143 1 2 55 2 149 0.007 1.23 7.03 Term + 202934 202997 64 2 1 106 48 112 0.025 5.38 7.04 PlyA + 203818 203823 6 1.05 8.00 Prom + 208120 208159 40 -7.75 8.01 Init + 211334 212059 726 1 0 88 -3 429 0.080 28.65 8.02 Term + 213721 213984 264 0 0 62 37 333 0.100 20.12 8.03 PlyA + 214068 214073 6 1.05 9.00 Prom + 215067 215106 40 -5.35 9.01 Init + 215544 216178 635 2 2 47 72 282 0.527 17.36 9.02 Term + 218459 218639 181 2 1 74 49 64 0.139 -2.80 9.03 PlyA + 219510 219515 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 172329 172808 480 2 0 60 35 284 0.930 16.13 S.002 Sngl + 211334 212200 867 1 0 88 43 461 0.861 37.24 S.003 Sngl + 213742 213984 243 0 0 71 37 310 0.886 19.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:36199086_36420588|GENSCAN_predicted_peptide_1|98_aa MESPLQLCLPVHLRQLEDESHQADECVTLFIPQSRGYRVVDSIQIRIPKTLFHPESLRGT QPLRISFSSSGFEGIYRETSKTPQLYPNESLSIMVWQS >gi568815584r:36199086_36420588|GENSCAN_predicted_CDS_1|297_bp atggaatcccctctgcagttatgtttgcctgtgcatctccggcagctggaggacgaatca catcaggctgatgagtgtgtcacgctcttcatcccccaaagcaggggctatcgagtggtt gacagtatccaaatcagaattccaaagacgctgttccatccggaatcattaagaggaaca cagcctcttagaatctcattcagctcctctggatttgaaggaatttacagagaaacttcc aagacacctcagctatatccaaacgaaagcctaagcattatggtttggcagagctga >gi568815584r:36199086_36420588|GENSCAN_predicted_peptide_2|285_aa MRTGFISVEITGDLANAVLVAWWHAENKSLIGVGSGINENDISGNECWIFIHMRVPTWSQ NGCTLTFTIKATGRRRGLYWHVWKASLSGAKERGLGYHYCISQSEILATGELGVDTSEGT MQKTLETEFLDFQTFTSFISYKNVSSPDKAPGLPTPAEAARSRRPALARQLLFEPSSDRK IKSGSEGSGKHCAMSLLKAADASFGVLKNKGKNGLENKESYLPVDMEETLTEADSPTSCF VTGEWKLVMNNECAFLLLKWLRNEPVLTCCKSGKVWVNKSVFDKR >gi568815584r:36199086_36420588|GENSCAN_predicted_CDS_2|858_bp atgagaactggatttatcagtgtggaaatcactggtgatttggcaaatgcagttttggtg gcttggtggcatgctgagaacaaaagcctgattggagtgggttcaggaataaatgagaac gacatcagtgggaatgagtgttggattttcatccacatgcgtgtccccacatggtctcaa aacggctgcactctcactttcacaatcaaggcaacaggaaggaggagggggctatactgg catgtctggaaagcaagcctttctggggcaaaggagagggggcttgggtatcactattgt ataagccagtcggaaatattggccacaggggagttgggagtggatacatcagagggtacc atgcagaagacacttgaaacggagttccttgattttcaaacattcacatcctttatttca tataagaatgtgagctccccagacaaagccccaggactcccgactcccgcggaagctgcg aggagccgcagacctgcccttgccagacagctgctgtttgaacccagttctgaccgtaaa atcaaaagtgggtccgagggaagcggtaaacactgtgcaatgagccttttgaaggcagct gatgcctcatttggggtgctgaaaaacaagggaaaaaatggtcttgaaaacaaggaatca tatttgcctgtggatatggaagagacactgaccgaggctgactctccaactagctgtttt gtcacaggcgagtggaaactagtcatgaacaatgagtgtgcattcttgcttcttaaatgg ctaaggaacgaaccagttttgacatgctgcaaatccggcaaagtctgggttaacaaaagt gtgtttgacaaaagatga >gi568815584r:36199086_36420588|GENSCAN_predicted_peptide_3|432_aa MAAATELNRPSSGDRNLERRCRPNLSREVLYEIFRSLHTLVGQRPGAARPAEVWDAVPKP RCGRSPGPASSVGVRRTAVPVAESFTLQLSSPRVGEVGLSENPAPREAGLERMNVEKLHC EFRRRLAFRVLFPPTYSLWELVAKLQSPIKEENTTAVEEIGRTEMGNKNEVNDKFSIGDL QEEEKHKESDLRDVKKTQIHFDPEVVQIKAGKAEIDRRISAFIERKQAEINENNVREFCN VIDCNQVSRVVNTYGPQTRPEGIPGSGHKPNSMLRDCGNQAVEERLQNIEAHLRLQTGGP VPRDIYQRIKKLEDKILELEGISPEYFQSVSFSGKRRKVQPPQPSCMKHCVIRWGGKSVR GTLESEERSRSEKPCKSLNHWLKEPVGRAGGWPDQSGTLEKSLWLLATVCRMDHYARGKP LSTGSVRSKKSS >gi568815584r:36199086_36420588|GENSCAN_predicted_CDS_3|1299_bp atggctgctgccacggagcttaatcgcccgagcagcggtgacaggaacctggagcgaaga tgcagacccaacctctcccgagaggtgctctacgaaatctttcgctccctacacaccctg gttggacagaggccgggggcagctaggcctgcagaggtgtgggatgcggtcccgaagccg aggtgtggaaggagcccgggcccagcctcctcggtcggagtgaggaggacggcggtaccg gttgcagagtcgtttaccttgcagctgtcctccccaagagtgggtgaggttggcctaagt gaaaacccggctccgagagaggcaggcttggagagaatgaatgtggaaaagcttcactgc gagtttaggaggcgcttagcgttcagggtcctttttcctccaacttactctttatgggaa ttggttgcaaaacttcagtctccgattaaagaggagaatacaactgctgttgaagagata ggaagaacagaaatggggaacaaaaatgaagtaaatgacaaattttccattggcgaccta caagaggaagaaaagcacaaagaaagtgatttaagagatgtgaaaaagacacagatccat tttgatccagaagtagttcagataaaggctggaaaagcagaaattgacagacgaatatct gcatttattgaaagaaagcaagctgaaatcaatgaaaacaacgtcagggaattttgcaat gttattgattgtaatcaagtttctagagttgtgaatacatacggaccacagactagacct gaaggaattccagggtcaggtcataaacctaacagcatgcttcgagactgtggtaatcag gctgtagaagaacgactacaaaatattgaggcccacttgcggttacagacaggtggtcca gtgccaagagacatttatcagagaattaaaaaacttgaggataaaatccttgaattggaa ggcatctctcctgaatattttcagtctgtaagcttttctggaaaaagaagaaaagttcaa ccacctcaaccgtcatgcatgaagcactgtgtcatacgatggggtgggaagagtgtgaga gggacccttgagagtgaagaacggtccagatctgaaaagccttgcaaatcactgaatcac tggcttaaggagccggtgggaagagcaggtggatggcccgatcagagtggcacgttagaa aaatcactctggcttctggctacagtgtgcagaatggatcactatgctcggggtaagccc ctgtccacaggttcagttcgttctaagaaatcatcttaa >gi568815584r:36199086_36420588|GENSCAN_predicted_peptide_4|103_aa MGKPPLKEARGSVDPRTETPKLVAVAFLTPQNPVLDSLHLVSRQGRDRTQRTMQQVLGAK SGKGAHHFCSHFIGQNSVIYQHLTVREAETKSPILEEVHVSKV >gi568815584r:36199086_36420588|GENSCAN_predicted_CDS_4|312_bp atggggaaaccgccccttaaagaggctagggggtctgtggatcctagaacggaaactcct aaacttgtggctgtggcattccttacacctcaaaatcctgtcctggattctctacatctt gtcagcagacagggaagagatagaacacagagaaccatgcagcaggttttaggagccaag tctggaaaaggtgcacatcacttctgctcacatttcattggtcaaaactcagtcatatac cagcacctaactgtaagagaagctgaaactaaatctccaatactggaagaagtccacgtg tccaaagtctga >gi568815584r:36199086_36420588|GENSCAN_predicted_peptide_5|255_aa MASSDKVNAFVTQLDQNALKVEGGIINNCTVTMGINWDCSWTNQDEYHPSYAEKAFDKIQ QPFMLKTLNKLSIDWTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGYPLSPLLFN IVLEVMARAIRQEKEIKGIRLGKEEVKLSLFADDMIVYLENPLVSAQNLLKLISNFSKVS EYKISVQKSQAFLYNNNRQTESQIMSELPFTIASKRIKYLGLQLTRDVKDLFKENYKPLL NEIKEDRNKWKNIPC >gi568815584r:36199086_36420588|GENSCAN_predicted_CDS_5|768_bp atggcttcaagtgacaaagtaaacgcttttgtgactcagttggaccagaacgctttgaaa gtagaaggcggcattatcaataattgtactgtgacaatgggcataaactgggactgttcc tggacaaaccaggatgaatatcatcctagttatgcagaaaaggcctttgacaaaattcaa cagcccttcatgctaaaaactctcaataaattaagtattgattggacgtatctcaaaata ataagagctatttatgacaaacccacagccaatatcatactgaatgggcaaaaactggaa gcattccctttgaaaactggcacaagacagggatatcctctctcaccactcctattcaac atagtgttggaagttatggccagggcaatcaggcaggagaaagaaataaagggtattcga ttaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatttagaa aaccccctcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctca gaatacaaaataagtgtgcaaaaatcgcaagcattcttatacaacaataacagacaaaca gagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaataccta ggactccaacttacaagggatgtgaaggacctcttcaaggagaactataaaccactgctc aacgaaataaaagaggacagaaacaaatggaaaaacattccatgttaa >gi568815584r:36199086_36420588|GENSCAN_predicted_peptide_6|196_aa MTKKCGQVLGVEGGVQKETKTSVLNQQRRSLEADLFKAKALVRPQGLQPGSPQMLTEENS RDDSGASQISSETLIKNLSNLTINASSESVSPLLEALLRRESVGAAVLREIEDEWLYSRR GVRTLLSVQREKMARLRYMLLGGVRTHERRPTNKEPKGVKKESRPFKCPCSFCVSNGWDP SENARIENQDTKPLQP >gi568815584r:36199086_36420588|GENSCAN_predicted_CDS_6|591_bp atgacaaagaaatgtggacaggttctaggagttgagggtggtgtccagaaggaaactaaa acttcagtcctgaaccagcaaagaaggagcttggaagcagatctttttaaagccaaggct ctggtgagaccacagggacttcagcctgggtctccacaaatgctcaccgaagaaaattcc cgggacgattcaggggcctctcaaatctcctccgagacgttgataaagaaccttagtaac ttgactatcaacgctagtagcgaatctgtttcccctctattggaagctttactccgtcga gagtctgtgggggcagcagtcctcagggaaatcgaagatgagtggctttacagcaggaga ggagtaagaacactgctgtctgtgcagagagaaaagatggcaagattgagatacatgtta ctgggcggagttcgtacgcatgaaagaagaccaacaaacaaggagcctaagggagttaag aaggaatcaagaccattcaaatgtccctgcagtttctgcgtgtctaatggatgggatcct tctgagaatgctagaatagagaatcaagacaccaagccacttcagccataa >gi568815584r:36199086_36420588|GENSCAN_predicted_peptide_7|87_aa MAQEVPRSHSSINLSVGDLELPQPPQPLATTTLISQQPSTSRQDPPSAKRLQLAEDSDDH QHFLTVKSENRQLNEEPEKHSPDNLSS >gi568815584r:36199086_36420588|GENSCAN_predicted_CDS_7|264_bp atggcccaggaagttccaagatcccactcctcaatcaatcttagtgtaggagacttggaa ttgccacaaccaccccaacctttagcaaccacaaccctgatcagtcagcagccatcaaca tcaaggcaagatcctccatcagcaaaaagattacaacttgctgaagactcagatgatcat cagcattttttaacagtaaagtcagaaaaccgtcagctaaatgaggaaccagagaaacac agtccagataatctgagctcttaa >gi568815584r:36199086_36420588|GENSCAN_predicted_peptide_8|329_aa MGRNQIRKAENSKNQSTSSPPKDCSSSLAMKQSWMENDFDELTGAGFRRLVITDFSELKE DVQTHHKEAKNLEKRLEKWVTRINSEEKTLNYLMELKTMAQELCDACTSFSSQFDQVEER VSVIEDQINKMKLVEKFREKRVKRNEQSLQEIQNYVKRPNLHLIGVPESDGENQTKLENT LQDIIQENFPNLARQANIQIQEIQRTPQRCSSRRATPRHIIIRFTKVEMKEKMLRAAREK GRERSSSPAMEQSSTENDFDKLREEGFRRSNYSELKEEVRTHGKEDKNLEKRLDEWRTTT TNAEKSLKDLMELKTMAQELCDECTSLSS >gi568815584r:36199086_36420588|GENSCAN_predicted_CDS_8|990_bp atggggagaaaccagatcagaaaagctgaaaattctaaaaaccagagcacctcttctcct ccaaaggattgcagctcctcactagcaatgaaacaaagctggatggagaatgactttgac gagttgacaggagcaggttttagaaggttggtaattacagacttctcagagctaaaggag gatgttcaaacccatcacaaggaagctaaaaaccttgaaaaaagattagaaaaatgggta actagaataaacagtgaagagaagaccttaaattacctgatggagctgaaaaccatggca caagaactatgtgatgcatgcacaagcttcagtagccaatttgatcaagtggaagaaagg gtatcagtgattgaagatcaaattaacaaaatgaagctagtagagaagttcagagaaaaa agagtaaaaagaaacgaacaaagcctccaagaaatacagaactatgtgaaaagaccaaat ctacatttgattggtgtacctgaaagtgatggggagaatcaaaccaagctggaaaacact ctgcaggatattatccaggagaacttccccaacctagcaaggcaggccaacattcaaatt caggaaatacagagaacaccacaaagatgctcctcaagaagagcaactccaagacacata attatcaggttcaccaaggttgaaatgaaggaaaaaatgttaagggcagccagagagaaa ggtcgggaacgcagctcctcaccagcaatggaacaaagctcgacggagaatgactttgac aagttgagagaagaaggcttcagacgatcaaactactctgagctaaaggaggaagtgcga acccatggcaaagaagataaaaaccttgaaaaaagattagatgaatggcgaactacaaca accaatgcagagaagtccttaaaggacctgatggagctgaaaaccatggcacaagaacta tgtgacgaatgcacaagccttagtagctga >gi568815584r:36199086_36420588|GENSCAN_predicted_peptide_9|271_aa MFFETNENKDITYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTH SKASRRQEITKIREELKEIKTQKTLQIINESRSWFFEKINKIDRPLARLIKKKREKNQID AIKNDKGDITTDPTKIQTTIREYYKHFYANKLENLEEMDKFLDTYTLPRLNQEEVESLNR PITGSEIEAIINSLPTKQSPGPDGFTAEFYWRDMDEAGNHHSQQTITRTENQTPHVFTRG WELYDQNTWTQGGEHYTPGPVVGWGLEEGQH >gi568815584r:36199086_36420588|GENSCAN_predicted_CDS_9|816_bp atgttctttgaaaccaatgagaacaaagacataacataccagaatctctgggacacattc aaagcagtgtgtagagggaaatttatagcactaaatgcccacaagagaaagcaggaaaga tctaaaattgacaccctaacatcacaattaaaagaactagagaagcaagagcaaacacat tcaaaagctagcagaaggcaagaaataactaagatcagagaagaactgaaggaaataaag actcaaaaaacccttcaaataatcaatgaatccaggagctggttttttgaaaagatcaac aaaattgatagaccactagcaagactaataaagaagaaaagagagaagaatcaaatagat gcaataaaaaatgataaaggggatatcaccaccgatcccacaaaaatacaaactaccatc agagaatactataaacacttctatgcaaataaactagaaaatctagaagaaatggataaa ttcctggacacatacaccctcccaagactaaaccaggaagaagttgaatctctgaataga ccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaacaaagtcca ggaccagacggattcacagccgaattctactggagggacatggatgaagctggaaaccat cattctcagcaaactatcacaaggacagaaaaccaaacacctcatgttttcactcgtggg tgggaattgtacgatcagaacacttggacacagggtggggaacattacacaccagggcct gttgtggggtggggactggaggagggacaacattag