GENSCAN 1.0 Date run: 8-Nov-116 Time: 02:07:44 Sequence gi568815592f:1212705_1413913 : 201209 bp : 45.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3208 3373 166 1 1 68 27 117 0.316 3.56 1.02 Term + 4854 4934 81 2 0 103 38 60 0.531 0.19 1.03 PlyA + 5635 5640 6 1.05 2.04 PlyA - 6838 6833 6 1.05 2.03 Term - 13770 13685 86 1 2 68 48 67 0.305 -1.48 2.02 Intr - 20236 20109 128 0 2 70 103 84 0.896 8.42 2.01 Init - 26339 26098 242 2 2 74 70 104 0.583 4.55 2.00 Prom - 34437 34398 40 -3.86 3.00 Prom + 35783 35822 40 -4.26 3.01 Init + 38646 38684 39 2 0 33 94 69 0.249 2.19 3.02 Intr + 53281 53344 64 0 1 43 91 82 0.072 2.29 3.03 Intr + 58796 58924 129 0 0 106 58 62 0.556 5.67 3.04 Intr + 66324 66393 70 1 1 76 86 -3 0.120 -3.46 3.05 Intr + 69240 69387 148 0 1 75 65 91 0.731 5.74 3.06 Intr + 71424 71480 57 2 0 56 94 38 0.502 0.28 3.07 Intr + 71604 71718 115 2 1 63 93 55 0.851 3.52 3.08 Intr + 73654 73744 91 2 1 33 59 45 0.088 -4.85 3.09 Intr + 83495 83672 178 2 1 16 89 119 0.408 4.72 3.10 Term + 84954 85055 102 2 0 38 41 94 0.372 -2.02 3.11 PlyA + 87267 87272 6 1.05 4.00 Prom + 91086 91125 40 -3.46 4.01 Sngl + 100001 101212 1212 1 0 88 48 1708 0.999 162.63 4.02 PlyA + 102026 102031 6 1.05 5.00 Prom + 102519 102558 40 -8.56 5.01 Init + 105171 105224 54 2 0 94 46 104 0.398 6.10 5.02 Intr + 111203 111465 263 1 2 83 33 144 0.105 4.69 5.03 Intr + 123652 123749 98 1 2 49 111 42 0.012 2.25 5.04 Term + 127380 127573 194 1 2 85 49 95 0.561 2.88 5.05 PlyA + 128354 128359 6 1.05 6.00 Prom + 140272 140311 40 -3.16 6.01 Init + 153491 153577 87 1 0 75 68 60 0.776 3.34 6.02 Term + 156340 156561 222 0 0 51 48 154 0.924 4.62 6.03 PlyA + 157774 157779 6 1.05 7.03 PlyA - 159167 159162 6 1.05 7.02 Term - 161658 161645 14 1 2 117 44 -2 0.081 -3.34 7.01 Init - 165988 165871 118 1 1 74 44 175 0.827 10.07 7.00 Prom - 167475 167436 40 -5.26 8.00 Prom + 167794 167833 40 -6.46 8.01 Init + 169133 169232 100 1 1 60 66 161 0.931 9.52 8.02 Term + 169849 169877 29 2 2 91 48 38 0.619 -1.66 8.03 PlyA + 169967 169972 6 1.05 9.00 Prom + 175960 175999 40 -5.06 9.01 Init + 177244 178414 1171 0 1 57 97 2045 0.226 194.00 9.02 Intr + 180551 180660 110 0 2 57 64 132 0.095 7.70 9.03 Intr + 186916 186990 75 0 0 99 100 11 0.011 3.11 9.04 Intr + 198533 198640 108 1 0 63 12 109 0.447 1.28 9.05 Term + 199873 200052 180 0 0 41 35 197 0.573 7.31 9.06 PlyA + 200273 200278 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:1212705_1413913|GENSCAN_predicted_peptide_1|82_aa XSYLAKLTLAQSAVGKAGAKTGEAKEGFQCFRLLGSTPAAAVAAYPGPACIRLQRGPSGM LSLRPHSLRMQNIPIGFSHFRD >gi568815592f:1212705_1413913|GENSCAN_predicted_CDS_1|249_bp nnctcctacttggccaaactcacactggcacagagcgcagtggggaaggctggagcgaaa actggagaggccaaggaaggcttccagtgcttcaggcttctaggatctacaccagcagca gcggtagcagcctacccaggtcctgcttgcatccgcctgcaacgtggaccatctggaatg ctcagcctgagaccacactctctccgcatgcaaaacatccccatcggcttctcacacttc agggactga >gi568815592f:1212705_1413913|GENSCAN_predicted_peptide_2|151_aa MEPKVRAPGIQGREGEVVPRGWHWAGPQDYAERGHSTLGKRCGGPSVHVGRLDNREKIDL AGLKDVYGDGDTDLKATSVGEWEANPISMNNMDSLRVGGRCECSLQLPPVFEQLHSVVHN SHTSDDLVSYLAERTEAVDTEPAHPCICFEY >gi568815592f:1212705_1413913|GENSCAN_predicted_CDS_2|456_bp atggagccaaaagtaagagctccaggtattcaggggagggagggagaagtcgtaccaaga gggtggcactgggctgggcctcaggactatgcagagagagggcacagcaccctgggcaag aggtgtgggggccccagtgtccacgtgggcagattggacaatcgtgagaagatcgacttg gctggactcaaagatgtatatggcgatggggacacagacctaaaggccacatcagttgga gagtgggaagcaaatcctatatccatgaacaacatggactctctcagggtaggtggacgc tgtgaatgcagcctacagctgcccccagtttttgagcagttacacagtgtggttcataac agccatacatctgatgatctggtttcttatttggctgagagaacagaagccgtggacaca gagccagctcatccctgcatctgcttcgagtactga >gi568815592f:1212705_1413913|GENSCAN_predicted_peptide_3|330_aa MFARPLAVYEEWEATDLTLERAPPANTCDANRLADDEWNKESTEPKRRLSSQGLHLPRDW GERCGLKVSTCRGTGENGLKKLLKMLQCIIYKPPHPTPNGGNYRFPVIALMMLISISLLS LTLLVIRDKLKTLVLEGSDMRAIFTGEGLVKTWKKTTKDPSRKSPDNSQVIIVMVCTTII VPYSPIARENTAVGEQFDSCVETAQIYFRKQLGRMNLRPSTELAHDVQGLLKGCEVAGQK ILGAFQKDKGYRRLVRGTRQTARTAQNHFREYAYTHDLLSRPVKTCQHTGWFFCTGVTQD IKQCKTMILRKESKGAEFYNCPGFLPDSDF >gi568815592f:1212705_1413913|GENSCAN_predicted_CDS_3|993_bp atgtttgccaggcccctggctgtctacgaggaatgggaggccactgaccttacactggaa cgagcaccaccagcaaacacctgtgatgccaatcgtcttgcagatgatgaatggaacaaa gagagcactgagccaaagcggaggctgtctagtcaaggtctccacctgccgcgggactgg ggagaacggtgtggcctcaaggtctccacctgccgtgggactggggagaacggtctgaaa aagctccttaaaatgcttcaatgtataatttacaagccccctcaccctacccccaatgga ggaaattaccggtttcctgtgattgccctcatgatgctgatatcgatctccctgttgtct ctcacacttttagttatcagggacaaacttaaaaccctagtcctcgagggcagtgacatg agggctattttcactggggaaggactggtgaagacatggaagaaaactacaaaggatcct tctcgcaagtccccagacaatagccaggtaataatagtaatggtctgtacaaccatcatt gtaccttattcacccattgcaagagaaaatacagctgtaggtgaacagtttgacagttgt gttgaaacagcccagatttacttcagaaaacaacttggaaggatgaatctcaggccatcc acggagcttgcacatgatgtccagggccttctgaagggctgtgaagttgcaggacaaaag atcttgggcgcattccagaaggacaagggataccgtagattagtacgaggaaccagacag actgcccgcacagctcagaaccactttcgggagtatgcctacacacacgacttactgagc agaccagtgaagacatgtcaacatacaggctggttcttctgcacaggtgtgacacaggat atcaaacaatgtaagactatgatcctgagaaaagaaagcaaaggagctgagttttacaac tgtcctggatttctgcctgacagcgacttttaa >gi568815592f:1212705_1413913|GENSCAN_predicted_peptide_4|403_aa MKLEVFVPRAAHGDKQGSDLEGAGGSDAPSPLSAAGDDSLGSDGDCAANSPAAGGGARDT QGDGEQSAGGGPGAEEAIPAAAAAAVVAEGAEAGAAGPGAGGAGSGEGARSKPYTRRPKP PYSYIALIAMAIRDSAGGRLTLAEINEYLMGKFPFFRGSYTGWRNSVRHNLSLNDCFVKV LRDPSRPWGKDNYWMLNPNSEYTFADGVFRRRRKRLSHRAPVPAPGLRPEEAPGLPAAPP PAPAAPASPRMRSPARQEERASPAGKFSSSFAIDSILRKPFRSRRLRDTAPGTTLQWGAA PCPPLPAFPALLPAAPCRALLPLCAYGAGEPARLGAREAEVPPTAPPLLLAPLPAAAPAK PLRGPAAGGAHLYCPLRLPAALQAASVRRPGPHLPYPVETLLA >gi568815592f:1212705_1413913|GENSCAN_predicted_CDS_4|1212_bp atgaagttggaggtgttcgtccctcgcgcggcccacggggacaagcagggcagtgacctg gagggcgcgggcggcagcgacgcgccgtccccgctgtcggcggcgggagacgactccctg ggctcagatggggactgcgcggccaacagcccggccgcgggcggcggcgccagagatacg cagggcgacggcgaacagagtgcgggaggcgggccgggcgcggaggaggcgatcccggca gcagctgctgcagcggtggtggcggagggcgcggaggccggggcggcggggccaggcgcg ggcggcgcggggagcggcgagggtgcacgcagcaagccatatacgcggcggcccaagccc ccctactcgtacatcgcgctcatcgccatggccatccgcgactcggcgggcgggcgcttg acgctggcggagatcaacgagtacctcatgggcaagttcccctttttccgcggcagctac acgggctggcgcaactccgtgcgccacaacctttcgctcaacgactgcttcgtcaaggtg ctgcgcgacccctcgcggccctggggcaaggacaactactggatgctcaaccccaacagc gagtacaccttcgccgacggggtcttccgccgccgccgcaagcgcctcagccaccgcgcg ccggtccccgcgcccgggctgcggcccgaggaggccccgggcctccccgccgccccgccg cccgcgcccgccgccccggcctcgccccgcatgcgctcgcccgcccgccaggaggagcgc gccagccccgcgggcaagttctccagctccttcgccatcgacagcatcctgcgcaagccc ttccgcagccgccgcctcagggacacggcccccgggacgacgcttcagtggggcgccgcg ccctgcccgccgctgcccgcgttccccgcgctcctccccgcggcgccctgcagggccctg ctgccgctctgcgcgtacggcgcgggcgagccggcgcggctgggcgcgcgcgaggccgag gtgccaccgaccgcgccgcccctcctgcttgcacctctcccggcggcggcccccgccaag ccactccgaggcccggcggccggcggcgcgcacctgtactgccccctgcggctgcccgca gccctgcaggcggcctcagtccgccgccctggcccgcacctgccgtacccggtggagacg ctcctagcctga >gi568815592f:1212705_1413913|GENSCAN_predicted_peptide_5|202_aa MGLRGQRALVTLRLPCLRACAVSQTQGEEASMVGPLLPPSSMKPSPPTGPFGNLCAQLED GGMQKPGAVQSRSSHSAGSSSLPLSGGQTSRAAGPQYEYPLKSQAGVYLKPPPNLFVHLE NPTWCMGWVIIMATLQPAALPHVLEARELVTFPTVLSPCPKGSPGSQGILDFAGSSSVIH SEESVAELTPTSADFAPTDSAD >gi568815592f:1212705_1413913|GENSCAN_predicted_CDS_5|609_bp atggggctgcgtggccaaagggccctggtcaccctgcgcctcccctgcctgcgagcctgt gcagtcagccagacacaaggcgaggaagccagcatggtcggcccgctcttgcccccgtcc tccatgaagccttctcctcctactggcccttttggcaacctctgtgcccagctggaggat ggagggatgcagaagccaggggccgtgcaaagcaggagcagccactctgcaggcagcagc tctctccccctctctgggggacaaacatcacgggctgctggtccccaatatgaataccct ctgaagtctcaggctggtgtctacctgaaacctccacccaacctgtttgtacacttggaa aacccaacctggtgcatgggctgggtgataatcatggcaacactgcagcctgcagccctc cctcatgttctggaggccagggaacttgtaacattcccaacagttctgagtccatgccca aaaggttctccaggatctcagggaatactggattttgcaggatcctcctccgtgattcat tcagaagagtccgtggctgagctgacaccaacgtcagctgattttgccccaacagactct gccgactga >gi568815592f:1212705_1413913|GENSCAN_predicted_peptide_6|102_aa MKLGKSKREKKLAKEKEGFPISGTKSKAQAVLPIKAQASPLFLSAVDYDVRIHYAHRGYL LPHKLVDKEIVGEKMVSKASMQSFIGQFPPRTTKSISKDFSG >gi568815592f:1212705_1413913|GENSCAN_predicted_CDS_6|309_bp atgaaacttggaaaaagtaaaagggaaaagaaactcgcaaaggaaaaagaaggctttcct atctcaggaactaagtcaaaagcccaggctgtgctgccaataaaagcgcaggccagcccg ctgtttttaagtgccgtggactatgatgtcagaattcattacgcacacaggggctattta ctgcctcataaacttgtagacaaggaaattgttggagaaaaaatggtttccaaagcttca atgcaatcattcattgggcagttccccccaaggaccaccaagtccatctccaaggacttt tctggctga >gi568815592f:1212705_1413913|GENSCAN_predicted_peptide_7|43_aa MPRIAGGAECFLQLGAVATPVWENAFREGWHRGRRLEAAGSHP >gi568815592f:1212705_1413913|GENSCAN_predicted_CDS_7|132_bp atgccccgaattgctgggggtgcggagtgctttctgcagctgggcgcagtggcgaccccg gtgtgggagaacgccttccgcgaggggtggcaccgagggcggcgtctagaagctgcaggc agtcacccctga >gi568815592f:1212705_1413913|GENSCAN_predicted_peptide_8|42_aa MLQKPARLRVTHRPASAAGVAGPRVAGRREPRAARLVCEDCV >gi568815592f:1212705_1413913|GENSCAN_predicted_CDS_8|129_bp atgctgcagaagcctgcccgcctgcgggtcacacaccggccggcctcggctgcaggggtt gcggggccacgggtcgcggggcgccgggagccgcgagcagcccgtctggtgtgcgaggat tgtgtttga >gi568815592f:1212705_1413913|GENSCAN_predicted_peptide_9|547_aa MTTEGGPPPAPLRRACSPVPGALQAALMSPPPAAAAAAAAAPETTSSSSSSSSASCASSS SSSNSASAPSAACKSAGGGGAGAGSGGAKKASSGLRRPEKPPYSYIALIVMAIQSSPSKR LTLSEIYQFLQARFPFFRGAYQGWKNSVRHNLSLNECFIKLPKGLGRPGKGHYWTIDPAS EFMFEEGSFRRRPRGFRRKCQALKPMYHRVVSGLGFGASLLPQGFDFQAPPSAPLGCHSQ GGYGGLDMMPAGYDAGAGAPSHAHPHHHHHHHVPHMSPNPGSTYMASCPVPAGPGGVGAA GGGGGGDYGPDSSSSPVPSSPAMASAIECHSPYTSPAAHWSSPGASPYLKQPPALTPSSN PAASAGLHSSMSSYSLEQSYLHQNAREDLSGAALEFSIKPCEHFRKELGLLSSRSAHQSA DGRLRGQVPSANPRAERPMDFCFTWLGLNSGEGTFGNAETVLTVALQKRVVATGILDLKA RDVAKHPPTQMLLQVNKPTGTQEESQQVTAHPAPDEKGDVRNAAASHLALRGILTSGNAR SRKSNTE >gi568815592f:1212705_1413913|GENSCAN_predicted_CDS_9|1644_bp atgaccaccgagggcgggccgccgccggccccgctccgccgcgcgtgcagcccggtcccc ggcgcgctccaggccgccctgatgagcccgccgcccgccgccgccgccgccgccgccgcc gccccggagaccacctcctcctcctcgtcgtcgtcctccgcctcctgcgcctcgtcctcg tcctcctccaattcggccagcgccccctcggctgcctgcaagagcgcgggcggcggcggc gcgggcgccgggagcgggggcgccaagaaggcgagctcggggctgcggcggcccgagaag ccgccctactcgtacatcgcgctcatcgtcatggccatccagagctcgcccagcaagcgc ctgacgctcagcgagatctaccagttcctgcaggcgcgcttccccttcttccgcggcgcc taccagggctggaagaactcggtgcgccacaatctctcgctcaacgagtgcttcatcaag ctgcctaagggcctcgggcggcccggcaagggccactactggaccatcgacccggccagc gagttcatgttcgaggagggctcgttccgccgccggccgcgcggcttcaggcggaagtgc caggcgctcaagcccatgtaccaccgcgtggtgagcggcttgggcttcggggcgtcgctg ctgccccagggcttcgacttccaggcgcccccgtcggcgccgctcggctgccacagccag ggcggctacggcggcctcgacatgatgcccgcgggctacgacgccggcgcgggcgccccc agccacgcgcaccctcaccaccaccaccaccaccacgtcccgcacatgtcgcccaacccg ggttccacctacatggccagctgcccggtgcccgcgggacccgggggcgtcggtgcggcc gggggcggcggcggcggcgactacgggccggacagcagcagcagcccggtaccctcgtcc ccggccatggcgagcgccatcgaatgccactcgccctacacgagccctgcggcgcactgg agctcgcctggcgcctcgccttacctcaagcagccgcctgccctgacgcccagcagcaac cccgccgcctcggcaggcctgcactccagcatgtcctcctactcgctggagcagagctac ttgcaccagaacgctcgcgaggacctctcaggggctgccctggagttctccattaagccc tgcgagcacttcaggaaggagctcgggcttctgtcatcccgctcagcacatcaaagcgcc gacggccgccttcgtgggcaggtccccagtgccaacccacgagctgaaagacccatggac ttctgttttacttggcttggcctgaattctggggaagggacatttggcaatgctgagaca gttttgactgtggcactgcagaagagggtggttgctactggcatcttggatctaaaggcc agggatgttgctaaacatcccccgacgcagatgctgctccaggtgaacaaacccacaggg actcaggaggaatcccagcaagtgacggctcatcctgccccagatgaaaaaggagatgtt cgtaacgcagctgcttcccacctcgcacttcggggcattcttacatctggcaacgctaga agccgcaagtcaaacacagaataa