GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:26:50 Sequence gi568815591r:16362551_16565668 : 203118 bp : 38.38% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 272 267 6 1.05 1.03 Term - 2644 2540 105 1 0 42 55 95 0.143 -0.97 1.02 Intr - 13691 13542 150 2 0 106 110 83 0.866 11.74 1.01 Init - 17700 17323 378 0 0 42 54 152 0.173 4.15 1.00 Prom - 17859 17820 40 -10.25 2.02 PlyA - 18222 18217 6 1.05 2.01 Sngl - 19056 18226 831 0 0 49 49 381 0.841 25.78 2.00 Prom - 19149 19110 40 -6.15 3.07 PlyA - 19318 19313 6 1.05 3.06 Term - 20394 19675 720 0 0 -35 29 503 0.094 24.59 3.05 Intr - 36046 35910 137 2 2 14 94 83 0.246 0.87 3.04 Intr - 37171 37043 129 2 0 -10 86 132 0.447 2.95 3.03 Intr - 37688 37415 274 2 1 36 77 187 0.494 8.59 3.02 Intr - 43787 43511 277 1 1 51 98 245 0.304 18.40 3.01 Init - 58772 58516 257 2 2 112 80 195 0.626 15.85 3.00 Prom - 61302 61263 40 -2.15 4.03 PlyA - 61986 61981 6 1.05 4.02 Term - 71279 71224 56 0 2 91 41 76 0.532 -0.06 4.01 Init - 77558 77459 100 2 1 62 2 132 0.299 2.47 4.00 Prom - 94695 94656 40 -1.35 5.03 PlyA - 96155 96150 6 1.05 5.02 Term - 100413 99998 416 1 2 115 42 374 0.995 29.94 5.01 Init - 103118 102914 205 2 1 78 98 114 0.899 8.98 5.00 Prom - 103225 103186 40 -4.15 6.03 PlyA - 104380 104375 6 1.05 6.02 Term - 117358 117209 150 1 0 74 49 72 0.477 -1.17 6.01 Init - 122316 122149 168 0 0 45 64 134 0.438 6.28 6.00 Prom - 130991 130952 40 -5.15 7.00 Prom + 134175 134214 40 -5.85 7.01 Init + 134774 134825 52 1 1 94 106 53 0.779 9.17 7.02 Term + 139145 139404 260 0 2 19 38 193 0.589 2.13 7.03 PlyA + 139539 139544 6 1.05 8.00 Prom + 146088 146127 40 -7.45 8.01 Init + 150269 150320 52 1 1 58 86 65 0.446 4.67 8.02 Intr + 164117 164290 174 0 0 72 35 211 0.679 13.19 8.03 Intr + 164308 164492 185 2 2 19 102 111 0.709 4.19 8.04 Intr + 184213 184741 529 0 1 76 -8 304 0.207 11.18 8.05 Intr + 184930 185129 200 2 2 1 38 171 0.125 1.55 8.06 Term + 185189 185383 195 1 0 89 55 134 0.936 6.63 8.07 PlyA + 186513 186518 6 1.05 9.03 PlyA - 186678 186673 6 1.05 9.02 Term - 188970 188775 196 2 1 93 41 77 0.107 -0.60 9.01 Init - 200039 199978 62 2 2 106 74 59 0.817 7.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:16362551_16565668|GENSCAN_predicted_peptide_1|210_aa MIISIDAEKAFDKIQQLFMLKTLNKLGIDGTYLKIIRAVFDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNVVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AQNLLKAAGAIRPLVSTVVSPSADGCLDYSLERARHRASEMPQAFLFDVIYEAYQQGEAR SYDLYHGSLSPEEAIGRNQRLWLLSSVKGP >gi568815591r:16362551_16565668|GENSCAN_predicted_CDS_1|633_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcagcaactcttcatgcta aaaactctcaataaattaggtattgatgggacgtatctcaaaataataagagctgtcttt gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacgtagtgttggaagtt ctggccagggcaattaggcaggagaaggaaataaagggtattcagttgggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccattgtctca gcccaaaatctccttaaggcagcaggagccattcgacctcttgtatctactgtcgtcagt ccatctgctgatggttgcttagactactcgctagaacgtgccagacacagagcaagtgaa atgccccaagcttttctatttgatgtgatttatgaagcatatcagcagggagaagccaga tcctatgacctgtatcatggttctctgtctccagaggaagctataggtagaaatcagaga ctctggctgctgtcctcagttaaaggaccatga >gi568815591r:16362551_16565668|GENSCAN_predicted_peptide_2|276_aa MGDFNTPLSTLDRSMRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHQTYS KIDHIVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNHSTTWKLNNLLRNDYWI HNEMKAEIKMFFETNQNNETTYRNLWDSFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTMKNDKGDITTDPTEIRTTIREYLKHL >gi568815591r:16362551_16565668|GENSCAN_predicted_CDS_2|831_bp atgggagactttaacaccccactgtcaacattagacagatcaatgagacagaaagttaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccaaacctattcc aaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaaccactcaactacatggaaactgaacaacctgctccggaatgactactggata cataacgaaatgaaggcagaaataaagatgttctttgaaaccaaccagaacaatgaaaca acataccggaatctgtgggactcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aagaaaagagagaagaatcaaatagacacaatgaaaaatgataaaggggatatcaccact gatcccacagaaatacgaactaccatcagagaatacctcaaacacctctag >gi568815591r:16362551_16565668|GENSCAN_predicted_peptide_3|597_aa MEAGPPGSARPAEPGPCLSGQRGADHTASASLQSVAGTEPGRHPQAVAAVLPAGGCGERM GVPTPKQFCPILERPLISYTLQALERVCWIKDIVVAVTGENMEVMKSIIQKYQHKRISLV EAGVTRHRSIFNGLKALAEDQINSKLSKPEVVIIHDAVRPFVEEGVLLKVVTAAKEHGIL CQPRVGHMPTMFQSCQSRHKRVITSIMSITCQPCWSRVGHVSHSSHVTVMSYTLPITCQS RVVYSSRTNHVIRESVTCQMRQSRVNRVTFMSVMLVTCCSRVTGHASATCHTLITCRSCV DRVSHVLDTCQSSTSVTCHTCRSLDDHVSLMSIICHILDLCQSRANDIGHVPIMSIAHEE IQAKGKEVKNFEKNLDECITRITNTEKCLKELMELKAKARELREVCRSLRSRCDQLEERV SAMEDEMNEMKPEGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTL QDIIQENFPNLARQANVQIQEIQRMPQRYSSRRATPRYIIVRFTKVEMKEKMLRAAREKG WVTRKGKPIRLTVDLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIK >gi568815591r:16362551_16565668|GENSCAN_predicted_CDS_3|1794_bp atggaggccgggccgccgggcagcgccaggccggcggagccgggtccttgcctgagtggt cagcgcggcgcggaccacacggcttccgcctccctgcagagcgtggccgggaccgagccc gggcgccacccgcaagccgtggcagctgtgttgcctgccggggggtgcggggagaggatg ggggtccccaccccgaagcaattctgccccatcctggagaggccgctcatcagctacacc ctacaggccctggagagagtatgttggataaaggacattgttgtggcagtaactggagag aacatggaagtaatgaaaagtattattcagaagtatcagcataaacgcatctcactggtc gaagctggagtgacccgccacaggtcaattttcaatggactaaaagcactggcagaagat cagatcaactctaaactctctaagccagaagtagtgattatccatgatgctgtgagacca tttgttgaggaaggtgtccttcttaaagttgtcacagctgctaaggaacacgggatcttg tgtcagccacgtgttggacacatgccaaccatgtttcaatcgtgtcagtcacgtcacaaa cgtgtcatcacgtcaatcatgtcaatcacgtgtcaaccatgttggtcacgtgttggtcat gtatcacactccagtcacgtgacagtcatgtcatacacgttgccgatcacgtgccaatca cgtgttgtgtattcgtcacgtactaatcatgtcattcgcgagtcagtcacatgtcaaatg cgtcagtcacgtgtcaatcgtgtcacattcatgtcagtcatgttggtcacgtgttgctca cgtgtcactggtcatgcgtcggccacgtgtcacacgttgattacatgtcgatcatgtgtc gatcgggtcagtcacgtgttggacacatgccaatcgtcaacgtcagtcacgtgccacacg tgtcgatcacttgacgatcacgtgtcacttatgtcgatcatttgtcacattttggaccta tgccaatcacgtgccaatgatataggtcatgtgccaatcatgtcaatcgcacacgaggaa attcaagccaaaggcaaagaagttaaaaactttgaaaaaaatttagacgaatgtataact agaataaccaatacagagaagtgcttaaaggagctgatggagctgaaagccaaggctcga gaactacgtgaagtatgcagaagcctcaggagccgatgcgatcaactggaagaaagggta tcagcgatggaagatgaaatgaatgaaatgaagccagaagggaagtttagagaaaaaaga ataaaaagaaacgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatcta cgtctgattggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctg caggatattatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcag gaaatacagagaatgccacaaagatactcctcaagaagagcaactccaagatacataatt gtcagattcaccaaagttgaaatgaaagaaaaaatgttaagggcagccagagagaaaggt tgggttacccgcaaagggaagcccatcagactaacagtggatctctcagcagaaactcta caagccagaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaaccc agaatttcatatccagctaaactaagcttcataagtgaaggagaaataaaatag >gi568815591r:16362551_16565668|GENSCAN_predicted_peptide_4|51_aa MTSTEDTIVKKIDIILHTYTDGKVEIGFTTLENVESEAGIQQAMAKVKVTP >gi568815591r:16362551_16565668|GENSCAN_predicted_CDS_4|156_bp atgacaagtacagaagatacaattgtgaagaaaatagatataatccttcacacatacact gacggcaaagtggaaattggttttaccactttggaaaacgtggaatcagaagctggaata cagcaggcaatggcaaaagtgaaagtgacaccatga >gi568815591r:16362551_16565668|GENSCAN_predicted_peptide_5|206_aa MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF SNTGLDRNTRVQVGCRELRSTKYISDGQCTSISPLKELVCAGECLPLPVLPNWIGGGYGT KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSTRTYKITVVTACKCKRYTRQHNESSHNFE SMSPAKPVQHHRERKRASKSSKHSMS >gi568815591r:16362551_16565668|GENSCAN_predicted_CDS_5|621_bp atgcttcctcctgccattcatttctatctccttccccttgcatgcatcctaatgaaaagc tgtttggcttttaaaaatgatgccacagaaatcctttattcacatgtggttaaacctgtt ccagcacaccccagcagcaacagcacgttgaatcaagccagaaatggaggcaggcatttc agtaacactggactggatcggaacactcgggttcaagtgggttgccgggaactgcgttcc accaaatacatctctgatggccagtgcaccagcatcagccctctgaaggagctggtgtgt gctggcgagtgcttgcccctgccagtgctccctaactggattggaggaggctatggaaca aagtactggagcaggaggagctcccaggagtggcggtgtgtcaatgacaaaacccgtacc cagagaatccagctgcagtgccaagatggcagcacacgcacctacaaaatcacagtagtc actgcctgcaagtgcaagaggtacacccggcagcacaacgagtccagtcacaactttgag agcatgtcacctgccaagccagtccagcatcacagagagcggaaaagagccagcaaatcc agcaagcacagcatgagttag >gi568815591r:16362551_16565668|GENSCAN_predicted_peptide_6|105_aa MFCEDSEDPLEEIKKKLHCSQALIVNSGSDQRMLIPPYFQVAPMSTSEFWEISYCGIEHG GVSDEDEDLRYKFSKGCRPCFESYDDFTGSKGVWQAGILKLCQHF >gi568815591r:16362551_16565668|GENSCAN_predicted_CDS_6|318_bp atgttctgcgaagactcagaagatccacttgaagaaataaagaagaagctccattgttcc caggccttaatagtcaatagtggcagtgaccaaaggatgttgattcctccatattttcag gttgcaccaatgagtacaagtgaattttgggagatatcctactgtgggatcgaacatgga ggggttagcgacgaagatgaggatttaagatataaattctcgaaaggctgtaggccttgt tttgaatcatacgatgatttcactggatcaaagggtgtttggcaagcgggaatcctgaaa ctgtgccaacatttctga >gi568815591r:16362551_16565668|GENSCAN_predicted_peptide_7|103_aa MGESTFAGVQQPAGTVLGTGGREILGRRGPPPGPYLQAKKTETVAQSEKLHPCFPARMLP FPKPPMARPTPHSVLIKTPELSGRERSSSWMSETIAGRRREAA >gi568815591r:16362551_16565668|GENSCAN_predicted_CDS_7|312_bp atgggggaatccaccttcgcaggagtccagcagcctgcaggaactgtgctggggacagga ggcagggaaattctgggcagaagagggcccccgccagggccctacctgcaagccaaaaaa actgaaaccgtggcccaaagtgagaagttacatccctgttttcctgctcgaatgttgcct tttccaaaaccacccatggcccgccccaccccccattctgtgcttataaaaaccccagaa cttagcggcagagagagaagcagcagctggatgtcagagactatagctggacgtcggaga gaagcagcttga >gi568815591r:16362551_16565668|GENSCAN_predicted_peptide_8|444_aa MSVCESWNCDGYVVMRKADPRSPASTGRITMSRGNTRVSEAQGWVLQGHTLKRHKSQRLA SNISQCGLAFSQRECEQRAVHQAKSLFGATGGRGRINHRCFGRPCVLMSWDPNPVPRTLR CWRLRRASETALQSSRRCCSYKYCSGPLPSCDQQSIQTLHPPQGLLCPTPLKCNTTSLYH FQLMVVADRLEQLLPLYQLQQGGMDSGSRSSSGSSSGGSGTPVPCIPKAANCTTSTLAQP GRTRSQAQSLHCGLNLTPCCVLGTCDYPAEDTARKTWRGGGKALEPTRETPPEPATLGAA AMGPGRVTHWWRSSTAAAAQTAAVGPGISLHSRGPRKAPLMPAGSEVPPPTVWLLPAVST HSNNRVKLIPSLGAVTTWLGTLGAKKHGREAKGELKAARHWPAGAPWHEQPQLHEQRQEA HRFLGRGGQVPNEAPPSSWKGPKG >gi568815591r:16362551_16565668|GENSCAN_predicted_CDS_8|1335_bp atgtcagtatgtgaatcctggaactgtgatggctatgtcgtgatgcggaaagctgacccc cgctctccggcatccaccggacgcatcaccatgagcaggggtaacacgcgtgtgtccgag gcacaagggtgggtgttgcagggccacacgctcaaacgtcacaaaagtcagcgtttggct tcaaatatttcccagtgtggactggcttttagtcaacgggaatgcgaacaacgagctgtg caccaagccaagtctctcttcggtgccaccggcgggcgaggccggattaatcaccgctgc ttcggccgcccatgtgtcctgatgtcctgggacccgaaccccgtgccccgtaccttgcga tgctggcgcctacggagggcatccgaaactgccctacagagcagtcgccggtgctgttcc tacaaatattgcagtggccctctcccttcctgtgatcaacagtctattcaaacattgcat cctccacaaggcctactctgtccaaccccactaaaatgtaacactacttcactctaccac ttccaactgatggtggtggcagaccgtctggagcagttgctgccattataccagctgcag cagggaggtatggacagtggcagcaggagcagctctgggagcagcagtggtggcagtggg acccctgtgccctgcatccccaaggcagccaactgcaccacctccaccctcgcacagcca ggcaggacccgctcccaggcccagagcctccactgtggccttaacctcactccttgctgt gtcttgggaacctgtgattacccagctgaagacacagccaggaaaacgtggaggggaggt ggaaaggccctggagcccacccgagagaccccaccagagcctgccaccctgggagctgct gcaatggggccaggccgagtcacccactggtggaggagcagcacagctgcagctgcccaa actgctgctgtgggcccaggcatctccctgcactctcggggacccaggaaggcccctctt atgcctgcaggctcagaagtgcctcctcctactgtctggcttctccctgctgtcagcacc cactccaataacagagtaaagttgatcccaagcctgggcgctgtcacaacctggctgggg actttgggcgccaagaagcatgggagagaggctaaaggggagctgaaggcagctcggcac tggcctgcaggtgccccttggcatgaacaacctcagctccatgaacagcggcaggaggca cacaggttcctgggcagaggggggcaggtccccaatgaagccccaccttcaagctggaaa gggcctaaaggctga >gi568815591r:16362551_16565668|GENSCAN_predicted_peptide_9|85_aa METLSRKEELPVPKADSWKLRSRASRKQKSQPSNFALVFLMASPHPEAAWELPTITQFFS ILKDAYHFEDSKDLGSCMPGNEDWD >gi568815591r:16362551_16565668|GENSCAN_predicted_CDS_9|258_bp atggagaccctctctcgaaaggaggaactccctgttcccaaggctgattcctggaagctc cgaagtagggcttccaggaagcagaagtcccaaccttctaattttgccttagtgtttttg atggccagcccccatcctgaagctgcctgggagctgccaaccatcactcaattctttagc atactaaaagatgcttatcactttgaagattccaaggatttggggagttgtatgccagga aatgaggactgggactaa