GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:04:22 Sequence gi568815596f:65105109_65305987 : 200879 bp : 42.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 24689 24906 218 0 2 102 44 268 0.614 20.12 1.02 PlyA + 26077 26082 6 1.05 2.00 Prom + 29991 30030 40 -8.75 2.01 Init + 46734 46790 57 2 0 96 94 108 0.908 13.56 2.02 Term + 51336 51359 24 2 0 110 37 61 0.809 0.45 2.03 PlyA + 52043 52048 6 1.05 3.04 PlyA - 52075 52070 6 1.05 3.03 Term - 56043 55989 55 2 1 108 46 47 0.389 -1.45 3.02 Intr - 58147 58041 107 1 2 80 63 61 0.422 0.89 3.01 Init - 60773 60603 171 2 0 75 83 126 0.952 10.29 3.00 Prom - 67361 67322 40 -7.05 4.04 PlyA - 67858 67853 6 1.05 4.03 Term - 71079 70889 191 1 2 86 43 81 0.505 0.03 4.02 Intr - 72617 72450 168 0 0 75 81 67 0.383 3.70 4.01 Init - 72916 72709 208 1 1 41 100 98 0.376 3.60 4.00 Prom - 75499 75460 40 -6.95 5.00 Prom + 76644 76683 40 -5.15 5.01 Sngl + 83953 84249 297 0 0 68 50 227 0.951 12.40 5.02 PlyA + 85274 85279 6 1.05 6.00 Prom + 91504 91543 40 -3.95 6.01 Init + 100001 100852 852 1 0 106 53 826 0.003 75.59 6.02 Intr + 122327 122511 185 1 2 53 51 164 0.001 6.86 6.03 Intr + 123627 123633 7 0 1 97 95 0 0.000 -5.58 6.04 Intr + 134744 134854 111 1 0 86 85 94 0.933 8.56 6.05 Intr + 141416 141631 216 1 0 29 105 236 0.982 17.18 6.06 Intr + 145919 145991 73 1 1 112 100 40 0.992 5.66 6.07 Intr + 148620 148756 137 1 2 87 115 11 0.986 3.07 6.08 Intr + 150437 150586 150 1 0 67 98 164 0.999 14.74 6.09 Intr + 156139 156284 146 0 2 76 88 177 0.999 14.66 6.10 Intr + 159935 160067 133 2 1 79 84 122 0.999 10.63 6.11 Intr + 163456 163609 154 0 1 118 42 198 0.004 16.92 6.12 Intr + 166637 166659 23 0 2 68 83 19 0.000 -4.26 6.13 Intr + 181015 181301 287 0 2 78 48 208 0.510 11.02 6.14 Intr + 185624 185774 151 2 1 60 29 128 0.600 3.34 6.15 Intr + 186444 186648 205 2 1 86 70 84 0.561 4.25 6.16 Term + 187425 187573 149 1 2 50 52 125 0.627 2.18 6.17 PlyA + 187761 187766 6 1.05 7.00 Prom + 192672 192711 40 -5.55 7.01 Sngl + 196503 196769 267 2 0 75 42 203 0.544 9.39 7.02 PlyA + 197993 197998 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 93377 93475 99 1 0 95 99 65 0.918 8.61 S.002 Sngl + 100001 100882 882 1 0 106 36 865 0.995 78.87 S.003 Term + 163456 163626 171 0 0 118 37 249 0.996 19.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:65105109_65305987|GENSCAN_predicted_peptide_1|72_aa XDAPTSAPTLLDPFKIPQAGPRTQPTGALLNSLFGIHAGHVTAAAAAATAALAAAAAALT LRATGNRKKGMR >gi568815596f:65105109_65305987|GENSCAN_predicted_CDS_1|219_bp nccgatgccccgacttctgccccaaccctcctcgacccctttaagatcccccaagctggc ccacggacccagccgaccggtgctctcctgaactcactattcgggattcatgctggacat gtcactgcagctgccgccgccgccaccgccgcccttgctgccgcagccgccgccctgact ctccgcgccacgggtaatcgaaagaaaggaatgagatag >gi568815596f:65105109_65305987|GENSCAN_predicted_peptide_2|26_aa MRPQTATVDEEDPNLEKTQIVDEIYA >gi568815596f:65105109_65305987|GENSCAN_predicted_CDS_2|81_bp atgaggccacagacagcaactgtggatgaagaggatccaaatctggagaagacgcagatt gtggacgagatctatgcatga >gi568815596f:65105109_65305987|GENSCAN_predicted_peptide_3|110_aa MDVWAFTRHRYFHINSRCFVTGEELEKKGSVSMQRQQGKLTMQPKQGGKQLVKDHVLLCP RTRYRKGVCPEAADTLFYVSCGQAMNNIYDERRTHPECDPFLPLPLYTLV >gi568815596f:65105109_65305987|GENSCAN_predicted_CDS_3|333_bp atggacgtgtgggcttttacaagacacagatattttcacatcaactctagatgctttgtt actggggaagaacttgagaaaaagggaagtgtctccatgcaacggcaacaagggaaactc actatgcagccaaaacagggaggaaaacaacttgtaaaagatcatgtgttattatgccca aggaccaggtacaggaaaggagtttgccctgaagctgcagataccctattttatgtcagc tgtggacaggctatgaataatatttacgacgaaagaagaactcatccagaatgtgatcct tttttacccctgccactctacacgctggtttga >gi568815596f:65105109_65305987|GENSCAN_predicted_peptide_4|188_aa MGALPRFCWKGGSPALRGTAAAAQSQMWNPVSLNSQRPGRPPAPHSLGCACSHCLASPHS WCLLQLQSKDCGRLMSMGGRLSGGRRQLDVGLQVLFGMNSLGAMGTMDGRVMAAGGRQAP GQKKAGTGACSRSCVQYIWFSHSLARGLCWHLELLALLQQPARLAVHSGWTPRLLTYTIL AALRLVCV >gi568815596f:65105109_65305987|GENSCAN_predicted_CDS_4|567_bp atgggagccctgccccgtttttgttggaagggtgggagccctgccctccggggcacagct gcagctgcccagtcgcagatgtggaatccagtatccctgaactctcagcgaccaggaagg ccccctgccccccacagtcttgggtgtgcctgctcccactgcctggcctctccccactcc tggtgcctgctccaacttcagagcaaagactgtgggcgcctaatgagcatgggagggagg ctgagtgggggccgaaggcagcttgatgtgggcctgcaggtgctctttggcatgaacagc ctgggtgccatgggcaccatggatggcagagtgatggcagcaggaggcagacaggctcct gggcagaaaaaggcaggtacgggtgcctgcagcagaagctgcgtgcagtatatctggttc agtcacagccttgcacggggcctgtgctggcacctggagctgcttgccctgctgcagcag ccagcacgcctggctgtgcacagtgggtggaccccgcgcttgctcacctacacaatcctc gcagctctgcgcctggtttgcgtttag >gi568815596f:65105109_65305987|GENSCAN_predicted_peptide_5|98_aa MSSENILQAKVARGGRVERAVLDVDYTPSEDRAKYGWFLWTKCGSWVPLGSFPRKTMGSQ KRGGLKDDRERSKWQSDERHYQESLRDSRRGSPGISIP >gi568815596f:65105109_65305987|GENSCAN_predicted_CDS_5|297_bp atgagttctgaaaacattctccaggccaaagtggccagaggtggcagagtagagagggct gtgttagatgtggactacacgccctctgaggacagggcaaagtacgggtggttcttatgg accaaatgtggatcctgggtccctcttggatccttccctagaaagaccatgggatcccaa aagagaggtggcctgaaagatgacagggagcggtcaaaatggcagtcagatgagaggcat taccaggaatctctgagggactcaaggaggggaagccctgggatctccatcccctag >gi568815596f:65105109_65305987|GENSCAN_predicted_peptide_6|992_aa MATHGQTCARPMCIPPSYADLGKAARDIFNKGFGFGLVKLDVKTKSCSGVEFSTSGSSNT DTGKVTGTLETKYKWCEYGLTFTEKWNTDNTLGTEIAIEDQICQGLKLTFDTTFSPNTGK KSGKIKSYKRECINLGCDVDFDFAGPAIHGSAVFGYEGWLAGYQMTFDSAKSKLTRNNFA VGYRTGDFQLHTNVNDGTEFGGSIYQKVCEDLDTSVNLAWTSGTNCTRFGIAAKYQLDPT ASISAKVNNSSLIGVGYTQTLRPGVKLTLSALVDGKSINAGGHKLLRHIPLPQQNRCPSP ACVFCSSLSSALSSCDTLGRKPQHRRCWPPLTPGKAPGTRSSEEPGIQFVKCGYAGSNFP EHIFPALVGRPIIRSTTKVGNIEIKDLMVGDEASELRSMLEVNYPMENGIVRNWDDMKHL WDYTFGPEKLNIDTRNCKILLTEPPMNPTKNREKIVEVMFETYQFSGVYVAIQAVLTLYA QGLLTGVVVDSGDGVTHICPVYEGFSLPHLTRRLDIAGRDITRYLIKLLLLRGYAFNHSA DFETVRMIKEKLCYVGYNIEQEQKLALETTVLVESYTLPDGRIIKVGGERFEAPEALFQP HLINVEGVGVAELLFNTIQAADIDTRSEFYKHIVLSGGSTMYPGLPSRLERELKQLYLER VLKGDVEKLSKFKIRIEDPPRRKHMVFLGGAVLADIMKDKDNFWMTRQEYQEKGVRVLEK LGPESQTANCHEGTSKISESYNVTYGRRHEKQDMQQRASLEMAGAGAGGSNKSVILKGVF MCSLRVSCEENSQGDTDYETARALELFAKSRSQGFTPSPDQKQGITGGEVLLESSGERPG MLLNIPQCTGQPPHKSKVAQNVNSVEVEKPCVRVRSDDDCLGNTQESLNCSQDLVETTPV SVTSRPWVFITIAILEEHSCTKAPRASSLCHKYPKKNVSELFQGLHPFLLEKSLPKPAHT APGRRRRGEASQPFSETSNSPGLELVPFSAQR >gi568815596f:65105109_65305987|GENSCAN_predicted_CDS_6|2979_bp atggcgacccacggacagacttgcgcgcgtccaatgtgtattcctccatcatatgctgac cttggcaaagctgccagagatattttcaacaaaggatttggttttgggttggtgaaactg gatgtgaaaacaaagtcttgcagtggcgtggaattttcaacgtccggttcatctaataca gacactggtaaagttactgggaccctggagaccaaatacaagtggtgtgagtatggtctg actttcacagaaaagtggaacactgataacactctgggaacagaaatcgcaattgaagac cagatttgtcaaggtttgaaactgacatttgatactaccttctcaccaaacacaggaaag aaaagtggtaaaatcaagtcttacaagagggagtgtataaaccttggttgtgatgttgac tttgattttgctggacctgcaatccatggttcagctgtctttggttatgagggctggctt gctggctaccagatgacctttgacagtgccaaatcaaagctgacaaggaataactttgca gtgggctacaggactggggacttccagctacacactaatgtcaatgatgggacagaattt ggaggatcaatttatcagaaagtttgtgaagatcttgacacttcagtaaaccttgcttgg acatcaggtaccaactgcactcgttttggcattgcagctaaatatcagttggatcccact gcttccatttctgcaaaagtcaacaactctagcttaattggagtaggctatactcagact ctgaggcctggtgtgaagcttacactctctgctctggtagacgggaagagcattaatgct ggaggccacaagcttctccgccacattccactgccgcagcagaaccggtgtccttcacct gcctgcgtcttctgcagttcgttaagcagcgcccttagctcctgtgacactttagggaga aagccgcagcaccggcgatgttggccgccattaacgcccggcaaagcgcccggaactaga agcagcgaagaacccggtatacagtttgtgaagtgtggatatgcaggctctaactttcca gaacacatcttcccagctttggttggaagacctattatcagatcaaccaccaaagtggga aacattgaaatcaaggatcttatggttggtgatgaggcaagtgaattacgatcaatgtta gaagttaactaccctatggaaaatggcatagtacgaaattgggatgacatgaaacacctg tgggactacacatttggaccagagaaacttaatatagataccagaaattgtaaaatctta ctcacagaacctcctatgaacccaaccaaaaacagagagaagattgtagaggtaatgttt gaaacttaccagttttccggtgtatatgtagccatccaggcagttctgactttgtacgct caaggtttattgactggtgtagtggtagactctggagatggtgtgactcacatttgccca gtatatgaaggcttttctctccctcatcttaccaggagactggatattgctgggagggat ataactagatatcttatcaagctacttctgttgcgaggatacgccttcaaccactctgct gattttgaaacggttcgcatgattaaagaaaaactgtgttacgtgggatataatattgag caagagcagaaactggccttagaaaccacagtattagttgaatcttatacactcccagat ggacgtatcatcaaagttgggggagagagatttgaagcaccagaagctttatttcagcct cacttgatcaatgttgaaggagttggtgttgctgaattgctttttaacacaattcaggca gctgacattgataccagatctgaattctacaaacacattgtgctttctggagggtctact atgtatcctggcctgccatcacggttggaacgagaacttaaacagctttacttagaacga gttttgaagggtgatgtggaaaaactttctaaatttaagatccgcattgaagacccaccc cgcagaaagcacatggtattcctgggtggtgcagttctagcggatatcatgaaagacaaa gacaacttttggatgacccgacaagagtaccaagaaaagggtgtccgtgtgctagagaaa cttgggccggagtcccagacagcgaactgccatgaagggacatcaaagataagcgaatca tacaatgtaacctatggcaggaggcatgagaaacaagatatgcaacagagggcctctttg gaaatggcaggagcaggggctggaggcagcaacaaaagtgtcatcttgaaaggcgtcttc atgtgcagcttgagagtcagctgtgaggaaaatagccaaggagacacagactatgagact gcccgtgccctagagctctttgcaaaatcaaggtctcagggcttcaccccatcccctgac caaaagcaagggataactgggggtgaggtgcttcttgaatctagtggggagaggcctggg atgctgctgaacatcccacaatgcacaggacagccccctcacaagagtaaagtggcccaa aatgtcaatagtgttgaggttgagaaaccctgtgttagagttaggtctgatgatgactgt ttagggaatacccaagagagtctaaattgctcacaggatttggtagaaaccactcccgtt tcagtcaccagccggccctgggtctttatcacaattgctatcctagaggaacacagctgc acgaaggcaccaagggcttcctccttatgccacaaatatcctaagaagaatgtctcagaa ttatttcaagggcttcatcccttcctgctggagaaatcgctgccaaagcctgctcacaca gcccctgggcggcggaggcgaggtgaagccagtcagcctttctccgagacttccaactca ccgggcctggagctcgtccccttctcagcccagcgctga >gi568815596f:65105109_65305987|GENSCAN_predicted_peptide_7|88_aa MRQRQLLPRLDKRGKNVGWILQQAPNVDSKDPRMSVEAEQAHLTPLAASPSDSREVSQRH ETWHQLPDQDETMNQSECSRITLSWGCI >gi568815596f:65105109_65305987|GENSCAN_predicted_CDS_7|267_bp atgagacagaggcagctattacccaggctcgacaaaagaggaaaaaacgtgggatggatt ttgcaacaagccccaaatgttgattccaaggacccgaggatgagtgtagaagcggagcag gcacacctgacacccctcgcggcatccccttcagattcccgggaagtctcacagagacat gaaacctggcatcagctccctgaccaagatgagaccatgaaccagtcagaatgttcacga atcacattgtcctggggatgcatttag