GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:08:15 Sequence gi568815596f:65127910_65368731 : 240822 bp : 42.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1888 2105 218 2 2 102 44 268 0.624 20.12 1.02 PlyA + 3276 3281 6 1.05 2.00 Prom + 7190 7229 40 -8.75 2.01 Init + 23933 23989 57 1 0 96 94 108 0.908 13.56 2.02 Term + 28535 28558 24 1 0 110 37 61 0.809 0.45 2.03 PlyA + 29242 29247 6 1.05 3.04 PlyA - 29274 29269 6 1.05 3.03 Term - 33242 33188 55 1 1 108 46 47 0.389 -1.45 3.02 Intr - 35346 35240 107 0 2 80 63 61 0.422 0.89 3.01 Init - 37972 37802 171 1 0 75 83 126 0.952 10.29 3.00 Prom - 44560 44521 40 -7.05 4.04 PlyA - 45057 45052 6 1.05 4.03 Term - 48278 48088 191 0 2 86 43 81 0.505 0.03 4.02 Intr - 49816 49649 168 2 0 75 81 67 0.383 3.70 4.01 Init - 50115 49908 208 0 1 41 100 98 0.376 3.60 4.00 Prom - 52698 52659 40 -6.95 5.00 Prom + 53843 53882 40 -5.15 5.01 Sngl + 61152 61448 297 2 0 68 50 227 0.951 12.40 5.02 PlyA + 62473 62478 6 1.05 6.00 Prom + 68703 68742 40 -3.95 6.01 Init + 77200 78051 852 0 0 106 53 826 0.003 75.59 6.02 Intr + 99526 99710 185 0 2 53 51 164 0.001 6.86 6.03 Intr + 100826 100832 7 2 1 97 95 0 0.000 -5.58 6.04 Intr + 111943 112053 111 0 0 86 85 94 0.933 8.56 6.05 Intr + 118615 118830 216 0 0 29 105 236 0.982 17.18 6.06 Intr + 123118 123190 73 0 1 112 100 40 0.992 5.66 6.07 Intr + 125819 125955 137 0 2 87 115 11 0.986 3.07 6.08 Intr + 127636 127785 150 0 0 67 98 164 0.999 14.74 6.09 Intr + 133338 133483 146 2 2 76 88 177 0.999 14.66 6.10 Intr + 137134 137266 133 1 1 79 84 122 0.999 10.63 6.11 Intr + 140655 140808 154 2 1 118 42 198 0.004 16.92 6.12 Intr + 143836 143858 23 2 2 68 83 19 0.000 -4.26 6.13 Intr + 158214 158500 287 2 2 78 48 208 0.510 11.02 6.14 Intr + 162823 162973 151 1 1 60 29 128 0.600 3.34 6.15 Intr + 163643 163847 205 1 1 86 70 84 0.558 4.25 6.16 Term + 164624 164772 149 0 2 50 52 125 0.624 2.18 6.17 PlyA + 164960 164965 6 1.05 7.00 Prom + 169871 169910 40 -5.55 7.01 Sngl + 173702 173968 267 1 0 75 42 203 0.568 9.39 7.02 PlyA + 175192 175197 6 1.05 8.14 PlyA - 177062 177057 6 1.05 8.13 Term - 182912 182732 181 1 1 36 48 168 0.339 3.70 8.12 Intr - 185134 185046 89 1 2 53 83 52 0.068 -0.95 8.11 Intr - 186260 185832 429 2 0 68 88 457 0.127 36.69 8.10 Intr - 188974 188825 150 1 0 107 99 121 0.581 14.54 8.09 Intr - 192692 192426 267 2 0 43 37 139 0.028 1.11 8.08 Intr - 204142 204078 65 2 2 108 98 98 0.970 10.32 8.07 Intr - 206864 206696 169 2 1 106 67 209 0.880 19.20 8.06 Intr - 207123 206978 146 1 2 34 12 217 0.955 7.78 8.05 Intr - 213118 213022 97 1 1 139 89 25 0.866 6.36 8.04 Intr - 216987 216810 178 2 1 109 55 183 0.267 16.10 8.03 Intr - 226592 226514 79 0 1 74 73 32 0.003 -2.01 8.02 Intr - 231002 230910 93 0 0 104 63 25 0.005 0.62 8.01 Init - 238915 238786 130 1 1 45 76 120 0.505 6.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 70576 70674 99 0 0 95 99 65 0.918 8.61 S.002 Sngl + 77200 78081 882 0 0 106 36 865 0.995 78.87 S.003 Term + 140655 140825 171 2 0 118 37 249 0.996 19.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:65127910_65368731|GENSCAN_predicted_peptide_1|72_aa XDAPTSAPTLLDPFKIPQAGPRTQPTGALLNSLFGIHAGHVTAAAAAATAALAAAAAALT LRATGNRKKGMR >gi568815596f:65127910_65368731|GENSCAN_predicted_CDS_1|219_bp nccgatgccccgacttctgccccaaccctcctcgacccctttaagatcccccaagctggc ccacggacccagccgaccggtgctctcctgaactcactattcgggattcatgctggacat gtcactgcagctgccgccgccgccaccgccgcccttgctgccgcagccgccgccctgact ctccgcgccacgggtaatcgaaagaaaggaatgagatag >gi568815596f:65127910_65368731|GENSCAN_predicted_peptide_2|26_aa MRPQTATVDEEDPNLEKTQIVDEIYA >gi568815596f:65127910_65368731|GENSCAN_predicted_CDS_2|81_bp atgaggccacagacagcaactgtggatgaagaggatccaaatctggagaagacgcagatt gtggacgagatctatgcatga >gi568815596f:65127910_65368731|GENSCAN_predicted_peptide_3|110_aa MDVWAFTRHRYFHINSRCFVTGEELEKKGSVSMQRQQGKLTMQPKQGGKQLVKDHVLLCP RTRYRKGVCPEAADTLFYVSCGQAMNNIYDERRTHPECDPFLPLPLYTLV >gi568815596f:65127910_65368731|GENSCAN_predicted_CDS_3|333_bp atggacgtgtgggcttttacaagacacagatattttcacatcaactctagatgctttgtt actggggaagaacttgagaaaaagggaagtgtctccatgcaacggcaacaagggaaactc actatgcagccaaaacagggaggaaaacaacttgtaaaagatcatgtgttattatgccca aggaccaggtacaggaaaggagtttgccctgaagctgcagataccctattttatgtcagc tgtggacaggctatgaataatatttacgacgaaagaagaactcatccagaatgtgatcct tttttacccctgccactctacacgctggtttga >gi568815596f:65127910_65368731|GENSCAN_predicted_peptide_4|188_aa MGALPRFCWKGGSPALRGTAAAAQSQMWNPVSLNSQRPGRPPAPHSLGCACSHCLASPHS WCLLQLQSKDCGRLMSMGGRLSGGRRQLDVGLQVLFGMNSLGAMGTMDGRVMAAGGRQAP GQKKAGTGACSRSCVQYIWFSHSLARGLCWHLELLALLQQPARLAVHSGWTPRLLTYTIL AALRLVCV >gi568815596f:65127910_65368731|GENSCAN_predicted_CDS_4|567_bp atgggagccctgccccgtttttgttggaagggtgggagccctgccctccggggcacagct gcagctgcccagtcgcagatgtggaatccagtatccctgaactctcagcgaccaggaagg ccccctgccccccacagtcttgggtgtgcctgctcccactgcctggcctctccccactcc tggtgcctgctccaacttcagagcaaagactgtgggcgcctaatgagcatgggagggagg ctgagtgggggccgaaggcagcttgatgtgggcctgcaggtgctctttggcatgaacagc ctgggtgccatgggcaccatggatggcagagtgatggcagcaggaggcagacaggctcct gggcagaaaaaggcaggtacgggtgcctgcagcagaagctgcgtgcagtatatctggttc agtcacagccttgcacggggcctgtgctggcacctggagctgcttgccctgctgcagcag ccagcacgcctggctgtgcacagtgggtggaccccgcgcttgctcacctacacaatcctc gcagctctgcgcctggtttgcgtttag >gi568815596f:65127910_65368731|GENSCAN_predicted_peptide_5|98_aa MSSENILQAKVARGGRVERAVLDVDYTPSEDRAKYGWFLWTKCGSWVPLGSFPRKTMGSQ KRGGLKDDRERSKWQSDERHYQESLRDSRRGSPGISIP >gi568815596f:65127910_65368731|GENSCAN_predicted_CDS_5|297_bp atgagttctgaaaacattctccaggccaaagtggccagaggtggcagagtagagagggct gtgttagatgtggactacacgccctctgaggacagggcaaagtacgggtggttcttatgg accaaatgtggatcctgggtccctcttggatccttccctagaaagaccatgggatcccaa aagagaggtggcctgaaagatgacagggagcggtcaaaatggcagtcagatgagaggcat taccaggaatctctgagggactcaaggaggggaagccctgggatctccatcccctag >gi568815596f:65127910_65368731|GENSCAN_predicted_peptide_6|992_aa MATHGQTCARPMCIPPSYADLGKAARDIFNKGFGFGLVKLDVKTKSCSGVEFSTSGSSNT DTGKVTGTLETKYKWCEYGLTFTEKWNTDNTLGTEIAIEDQICQGLKLTFDTTFSPNTGK KSGKIKSYKRECINLGCDVDFDFAGPAIHGSAVFGYEGWLAGYQMTFDSAKSKLTRNNFA VGYRTGDFQLHTNVNDGTEFGGSIYQKVCEDLDTSVNLAWTSGTNCTRFGIAAKYQLDPT ASISAKVNNSSLIGVGYTQTLRPGVKLTLSALVDGKSINAGGHKLLRHIPLPQQNRCPSP ACVFCSSLSSALSSCDTLGRKPQHRRCWPPLTPGKAPGTRSSEEPGIQFVKCGYAGSNFP EHIFPALVGRPIIRSTTKVGNIEIKDLMVGDEASELRSMLEVNYPMENGIVRNWDDMKHL WDYTFGPEKLNIDTRNCKILLTEPPMNPTKNREKIVEVMFETYQFSGVYVAIQAVLTLYA QGLLTGVVVDSGDGVTHICPVYEGFSLPHLTRRLDIAGRDITRYLIKLLLLRGYAFNHSA DFETVRMIKEKLCYVGYNIEQEQKLALETTVLVESYTLPDGRIIKVGGERFEAPEALFQP HLINVEGVGVAELLFNTIQAADIDTRSEFYKHIVLSGGSTMYPGLPSRLERELKQLYLER VLKGDVEKLSKFKIRIEDPPRRKHMVFLGGAVLADIMKDKDNFWMTRQEYQEKGVRVLEK LGPESQTANCHEGTSKISESYNVTYGRRHEKQDMQQRASLEMAGAGAGGSNKSVILKGVF MCSLRVSCEENSQGDTDYETARALELFAKSRSQGFTPSPDQKQGITGGEVLLESSGERPG MLLNIPQCTGQPPHKSKVAQNVNSVEVEKPCVRVRSDDDCLGNTQESLNCSQDLVETTPV SVTSRPWVFITIAILEEHSCTKAPRASSLCHKYPKKNVSELFQGLHPFLLEKSLPKPAHT APGRRRRGEASQPFSETSNSPGLELVPFSAQR >gi568815596f:65127910_65368731|GENSCAN_predicted_CDS_6|2979_bp atggcgacccacggacagacttgcgcgcgtccaatgtgtattcctccatcatatgctgac cttggcaaagctgccagagatattttcaacaaaggatttggttttgggttggtgaaactg gatgtgaaaacaaagtcttgcagtggcgtggaattttcaacgtccggttcatctaataca gacactggtaaagttactgggaccctggagaccaaatacaagtggtgtgagtatggtctg actttcacagaaaagtggaacactgataacactctgggaacagaaatcgcaattgaagac cagatttgtcaaggtttgaaactgacatttgatactaccttctcaccaaacacaggaaag aaaagtggtaaaatcaagtcttacaagagggagtgtataaaccttggttgtgatgttgac tttgattttgctggacctgcaatccatggttcagctgtctttggttatgagggctggctt gctggctaccagatgacctttgacagtgccaaatcaaagctgacaaggaataactttgca gtgggctacaggactggggacttccagctacacactaatgtcaatgatgggacagaattt ggaggatcaatttatcagaaagtttgtgaagatcttgacacttcagtaaaccttgcttgg acatcaggtaccaactgcactcgttttggcattgcagctaaatatcagttggatcccact gcttccatttctgcaaaagtcaacaactctagcttaattggagtaggctatactcagact ctgaggcctggtgtgaagcttacactctctgctctggtagacgggaagagcattaatgct ggaggccacaagcttctccgccacattccactgccgcagcagaaccggtgtccttcacct gcctgcgtcttctgcagttcgttaagcagcgcccttagctcctgtgacactttagggaga aagccgcagcaccggcgatgttggccgccattaacgcccggcaaagcgcccggaactaga agcagcgaagaacccggtatacagtttgtgaagtgtggatatgcaggctctaactttcca gaacacatcttcccagctttggttggaagacctattatcagatcaaccaccaaagtggga aacattgaaatcaaggatcttatggttggtgatgaggcaagtgaattacgatcaatgtta gaagttaactaccctatggaaaatggcatagtacgaaattgggatgacatgaaacacctg tgggactacacatttggaccagagaaacttaatatagataccagaaattgtaaaatctta ctcacagaacctcctatgaacccaaccaaaaacagagagaagattgtagaggtaatgttt gaaacttaccagttttccggtgtatatgtagccatccaggcagttctgactttgtacgct caaggtttattgactggtgtagtggtagactctggagatggtgtgactcacatttgccca gtatatgaaggcttttctctccctcatcttaccaggagactggatattgctgggagggat ataactagatatcttatcaagctacttctgttgcgaggatacgccttcaaccactctgct gattttgaaacggttcgcatgattaaagaaaaactgtgttacgtgggatataatattgag caagagcagaaactggccttagaaaccacagtattagttgaatcttatacactcccagat ggacgtatcatcaaagttgggggagagagatttgaagcaccagaagctttatttcagcct cacttgatcaatgttgaaggagttggtgttgctgaattgctttttaacacaattcaggca gctgacattgataccagatctgaattctacaaacacattgtgctttctggagggtctact atgtatcctggcctgccatcacggttggaacgagaacttaaacagctttacttagaacga gttttgaagggtgatgtggaaaaactttctaaatttaagatccgcattgaagacccaccc cgcagaaagcacatggtattcctgggtggtgcagttctagcggatatcatgaaagacaaa gacaacttttggatgacccgacaagagtaccaagaaaagggtgtccgtgtgctagagaaa cttgggccggagtcccagacagcgaactgccatgaagggacatcaaagataagcgaatca tacaatgtaacctatggcaggaggcatgagaaacaagatatgcaacagagggcctctttg gaaatggcaggagcaggggctggaggcagcaacaaaagtgtcatcttgaaaggcgtcttc atgtgcagcttgagagtcagctgtgaggaaaatagccaaggagacacagactatgagact gcccgtgccctagagctctttgcaaaatcaaggtctcagggcttcaccccatcccctgac caaaagcaagggataactgggggtgaggtgcttcttgaatctagtggggagaggcctggg atgctgctgaacatcccacaatgcacaggacagccccctcacaagagtaaagtggcccaa aatgtcaatagtgttgaggttgagaaaccctgtgttagagttaggtctgatgatgactgt ttagggaatacccaagagagtctaaattgctcacaggatttggtagaaaccactcccgtt tcagtcaccagccggccctgggtctttatcacaattgctatcctagaggaacacagctgc acgaaggcaccaagggcttcctccttatgccacaaatatcctaagaagaatgtctcagaa ttatttcaagggcttcatcccttcctgctggagaaatcgctgccaaagcctgctcacaca gcccctgggcggcggaggcgaggtgaagccagtcagcctttctccgagacttccaactca ccgggcctggagctcgtccccttctcagcccagcgctga >gi568815596f:65127910_65368731|GENSCAN_predicted_peptide_7|88_aa MRQRQLLPRLDKRGKNVGWILQQAPNVDSKDPRMSVEAEQAHLTPLAASPSDSREVSQRH ETWHQLPDQDETMNQSECSRITLSWGCI >gi568815596f:65127910_65368731|GENSCAN_predicted_CDS_7|267_bp atgagacagaggcagctattacccaggctcgacaaaagaggaaaaaacgtgggatggatt ttgcaacaagccccaaatgttgattccaaggacccgaggatgagtgtagaagcggagcag gcacacctgacacccctcgcggcatccccttcagattcccgggaagtctcacagagacat gaaacctggcatcagctccctgaccaagatgagaccatgaaccagtcagaatgttcacga atcacattgtcctggggatgcatttag >gi568815596f:65127910_65368731|GENSCAN_predicted_peptide_8|690_aa MTQQAYKMRTNPRMTHPVQCSIFVKETESYMGETTSQAASWFSDRKALGQNPARAERNWP FWQWGKRLLVKETVRFIRVQATGDKHLEVKFKIAFKHSTMHDSYIVRVKAVVMTRDDSSG GWFPQEGGGISRVGVCKVMHPEGNGRSGFLIHGERQKDKLRYDNSSFGLSSFSNAMLVPT GDHAPTWELCKCARRKRNNRDEESVVKGYNQDSGEQRRSDGQQPSKASFPIGETHRYVKA GVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRKAIEDLIEGST TSSSTIHNEAELGDDDVFTEAVDCGHASRGLLAFLLHCRNSSSDKAFLSLTSSDKCLGSF ALDCAGREGWGGGMGISEGSISDRLMSPRKSRMCLISQKDFLHIVNPKTATDSSSNSSQK REQPTRTISSPTSCEHRRIYTLGHLHDSYPTDHYHLDQPMPRPYRQVSFPDDDEEIVRIN PREKIWMTGYEDYRHAPVRGKYPDPSEDADSSYVRFAKGEVPKHDYNYPYVDSSDFGLGE DPKGRGGSVIKTQPSRGKSRRRKEDGERSRCVYCRDMFNHEENRRGHCQDAPDSVRTCIR RQASGSAEMGCENDETRQKYLSLQSVIMYRRRRGDARALADLGVPAYRMGQGFLCRRTCG HRSVFSRPFPPMVMSYSQGETAAPWARMPQ >gi568815596f:65127910_65368731|GENSCAN_predicted_CDS_8|2073_bp atgacacaacaggcctataaaatgcggactaaccctcggatgactcatccggtacaatgc agtatatttgtcaaggagactgaatcatatatgggggaaaccacttctcaggcagccagc tggttctcagataggaaagcccttggccaaaacccagcaagagctgagaggaactggcct ttctggcagtggggaaagaggcttcttgtcaaggagactgtcagatttatacgtgttcaa gctacaggagataaacacttagaggtgaaatttaaaattgcgtttaaacacagtacaatg catgacagctatattgtgcgtgtcaaggctgtggttatgaccagagatgactccagcggg ggatggttcccacaggaaggaggcgggatcagtcgcgtcggggtctgtaaggtcatgcac cccgaaggcaatggacgaagcggctttctcatccatggtgaacgacagaaagacaaactg agatatgataactcttcctttggcctctcaagcttctctaatgcaatgctagtgcccact ggagaccatgcgcccacatgggagctctgcaaatgtgccaggaggaagaggaataaccgt gatgaagagagtgtagtgaaggggtacaaccaggacagtggggagcagcgaagatcggat ggtcagcagccaagcaaagcctcctttcccattggagagactcacagatatgtgaaggca ggggtggtattggaatgctatgtaagaaaggacttggtctacaccaaagccaatccaacg tttcatcactggaaggtcgataataggaagtttggacttactttccaaagccctgctgat gcccgagcctttgacaggggagtaaggaaagcaatcgaagaccttatagaaggttcaaca acgtcatcttccaccatccataatgaagctgagcttggcgatgatgacgtttttacagag gcagtagactgtggacatgcctctcgaggtcttctggccttcttgcttcactgtagaaat tcgtcatcagataaggcctttctcagtctcacatcctctgacaaatgtctggggagcttt gcactggactgtgcagggagagagggctggggaggtgggatgggaatatcggaaggtagc atttctgatcgactaatgagtcctaggaaaagccggatgtgcctcatttctcagaaggac tttctacacattgttaacccaaaaacagctacagacagttcttctaattcctctcagaag agagagcaacctactcggacaatctcctctcccacatcctgtgagcaccggaggatttat accctgggccacctccacgactcataccccacagaccactatcacctcgatcagccgatg ccaaggccctaccgccaggtgagcttcccggacgacgacgaggagatcgtgcgcatcaac ccccgggagaagatctggatgacggggtacgaggattaccggcacgcacccgtcaggggc aagtacccggacccctcggaggacgcggactcctcctacgtgcgcttcgccaagggcgag gtccccaagcatgactacaactacccctacgtggactcctcagactttggcctaggcgag gaccccaaaggccgcgggggcagcgtgatcaagacgcagccctcccggggcaagtcgcgg cggcggaaggaggacggagagcgctcgcggtgcgtgtactgcagggacatgttcaaccac gaggagaaccgccggggccactgccaggacgcgcccgactccgtgagaacttgcatccgc cggcaagcatctggttcagcggaaatgggatgtgagaatgatgaaacccgacagaagtat ctcagcctgcagtcagttattatgtataggaggagacgaggtgatgctagagcgttggca gatcttggagttcccgcatacaggatgggacagggcttcctgtgccgaaggacatgtgga caccggtctgttttttcaaggccctttcctcctatggttatgtcatattcccaaggggag actgcagctccttgggctcggatgcctcagtga