GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:42:35 Sequence gi568815597f:203383185_203586878 : 203694 bp : 47.30% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 10299 10338 40 -2.46 1.01 Init + 18454 18572 119 0 2 75 52 107 0.322 5.47 1.02 Intr + 22950 23016 67 0 1 77 40 76 0.229 0.61 1.03 Term + 29381 29614 234 1 0 85 54 168 0.258 9.42 1.04 PlyA + 30185 30190 6 1.05 2.00 Prom + 30720 30759 40 -6.36 2.01 Sngl + 32189 33565 1377 1 0 68 43 488 0.994 38.42 2.02 PlyA + 33698 33703 6 1.05 3.09 PlyA - 34496 34491 6 1.05 3.08 Term - 39525 39355 171 0 0 101 44 49 0.197 -0.37 3.07 Intr - 40059 39939 121 2 1 73 56 36 0.048 -0.60 3.06 Intr - 46117 45894 224 1 2 77 92 122 0.251 8.43 3.05 Intr - 47370 47322 49 2 1 61 100 15 0.329 -1.32 3.04 Intr - 55047 54876 172 1 1 140 96 45 0.440 9.50 3.03 Intr - 56759 56723 37 2 1 70 77 10 0.121 -4.06 3.02 Intr - 61101 60985 117 0 0 92 84 46 0.376 5.26 3.01 Init - 64220 64176 45 2 0 97 83 12 0.172 2.28 3.00 Prom - 69622 69583 40 -1.66 4.03 PlyA - 72901 72896 6 1.05 4.02 Term - 73697 73584 114 2 0 105 42 29 0.048 -1.43 4.01 Init - 82220 82071 150 2 0 97 22 111 0.080 5.44 4.00 Prom - 89762 89723 40 -3.06 5.00 Prom + 96578 96617 40 -6.66 5.01 Init + 97791 97848 58 2 1 69 42 82 0.661 1.17 5.02 Intr + 100194 100973 780 1 0 107 70 1154 0.867 106.95 5.03 Term + 103522 103697 176 2 2 120 47 385 0.830 35.52 5.04 PlyA + 105181 105186 6 -0.45 6.00 Prom + 106227 106266 40 -2.96 6.01 Init + 112822 113052 231 0 0 63 78 269 0.997 19.67 6.02 Intr + 113793 113931 139 2 1 70 115 35 0.858 4.44 6.03 Intr + 115497 115655 159 1 0 51 27 208 0.989 10.96 6.04 Intr + 116465 116667 203 0 2 77 96 281 0.999 26.80 6.05 Intr + 119730 119825 96 2 0 108 110 107 0.963 15.11 6.06 Intr + 120366 120532 167 2 2 74 22 220 0.064 12.76 6.07 Intr + 128788 128891 104 1 2 122 -6 62 0.067 0.02 6.08 Intr + 130459 130542 84 2 0 119 95 -2 0.213 3.39 6.09 Intr + 133426 133477 52 2 1 107 62 0 0.092 -2.73 6.10 Term + 134272 134500 229 1 1 109 47 89 0.380 3.00 6.11 PlyA + 134569 134574 6 1.05 7.02 PlyA - 137831 137826 6 1.05 7.01 Sngl - 146527 145934 594 1 0 40 54 356 0.976 23.60 7.00 Prom - 161539 161500 40 -3.66 8.00 Prom + 163107 163146 40 -6.16 8.01 Init + 165484 165571 88 0 1 77 110 37 0.376 5.60 8.02 Intr + 172477 172665 189 2 0 122 28 69 0.396 3.86 8.03 Term + 178728 178903 176 1 2 19 55 455 0.993 33.22 8.04 PlyA + 181627 181632 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 6638 6410 229 1 1 46 47 136 0.837 1.40 S.002 Intr - 7236 7178 59 0 2 147 82 27 0.848 5.68 S.003 Term + 120366 120536 171 2 0 74 36 245 0.935 15.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:203383185_203586878|GENSCAN_predicted_peptide_1|139_aa MEAKREDSFTRTAETPAVPAESSREAPAMEGGNVPREFDQLPASQCKLKRCFYNICRHQS GWERSSFPAMEQSWTENVFDELREGFRRSVITNFSELREDVRTHRKEAKNLEKRLDKWLT RINSVEKTLNDLMELKTMA >gi568815597f:203383185_203586878|GENSCAN_predicted_CDS_1|420_bp atggaagccaaaagggaagacagcttcaccaggacagcggagacgccagcggtgcctgca gagtcctccagagaggcccctgccatggagggtggaaacgtgcctcgtgaatttgatcag ttaccagcaagccagtgtaaacttaagcgctgcttctacaacatctgcagacaccagtca gggtgggaacgcagctcctttccagcaatggaacaaagctggacggagaatgtctttgac gagttgagagaaggcttcagacgatcagtaataacaaacttttccgagctaagggaggat gttcgaacccatcgcaaagaagctaaaaaccttgaaaaaagattagacaaatggctaact agaataaacagtgtagagaagaccttaaatgacttgatggagctgaaaaccatggcatga >gi568815597f:203383185_203586878|GENSCAN_predicted_peptide_2|458_aa MGEFLDTHTLPRLNQEEAESLNRPITGSEIEAIINSLSTKKSPGPDRFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASFILIPKPGGDTTEKENFIPISLMNIDAKILSKILV NQIQEHIKKLIHHDQVGFIPGMQGWFNICKSINAIHHINRTKDKNHMIISIDAEKAFDKI QQPFMLQTLNKIGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPFLF NIVLEVLARAIRQEKEITCIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSEV SGYKINVQKSQAFLYTNNRPTESRMSEFPFTITTKRIKYLGIQLTRDVKDLFKENYKPLL NEIKEDRNKWKNIPCSWIGRINIVKMTILPKVIYRFNAIPIKLPMTFFTELEKTTVKFIW NQKRGGIAKTILSQKNKAGGIMLPDFKLYYKATVTKTA >gi568815597f:203383185_203586878|GENSCAN_predicted_CDS_2|1377_bp atgggtgaattcctggacacacacaccctcccaagactaaaccaggaagaagctgaatcc ctgaatagaccaataacaggctctgaaattgaggcaataattaatagcctatcgaccaaa aaaagtccaggaccagaccgattcacagccgaattctaccagaggtacaaagaggagctg gtaccattccttctgaagctattccaatcaatagaaaaagaaggaatcctccctaactca ttttatgaggccagcttcatcctgataccaaagcctggcggagacacaacagaaaaagag aattttataccaatatccctgatgaacatcgatgcaaaaatcctcagtaaaatactggta aaccaaatccaggagcacatcaaaaagcttatccaccacgatcaagttggcttcatccct gggatgcaaggctggttcaacatatgcaaatcaataaatgcaatccatcatataaacaga accaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacagcccttcatgctacaaactctcaataaaataggtattgatgggacatacctcaaa ataataagagctatttatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccattcctattc aacatagtgttggaagttctggccagggcaatcaggcaggagaaagaaataacgtgtatt caattaggaaaagaagaagtcaaattgtccctgtttgcagatgacatgattgtatattta gagaatcccattgtctcagcccaaaatctcctcaagctgataagcaacttcagcgaagtc tcaggatacaaaatcaatgtgcaaaaatcacaagcattcctatacaccaataacagacca acagagagccgaatgagtgaattcccattcacaattactacaaagagaataaaataccta ggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctc aatgaaataaaagaggacagaaacaaatggaagaacattccatgctcatggataggaaga atcaatattgtgaaaatgactatactgcccaaggtaatttatagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattagaaaaaactactgtaaagttcatatgg aaccaaaaaagaggcggcattgccaagacaatcctcagccaaaagaacaaagctggaggc atcatgctacctgacttcaaactatactacaaggctacggtaaccaaaacagcatga >gi568815597f:203383185_203586878|GENSCAN_predicted_peptide_3|311_aa MEVMAAKSWWKREVGMLTFGTQPLCCKEAHATWRDTLEEEPKPSPNSQCQLASHWSLLRG PHLLKSDGQRTDDIPHIRVAPPTSHSRPDISALLNARLWRRPYLCPPHRSSSCRPGQTET KQLGGHLAMSGDNFGCHNWKVFGIVPALNKSKQLQFSGLNSGHLSCSYTVYLCLSTQFIH QSGSVGQTRQVVPRVRSSAVDHDGTLENEREVTAQLLEQTVSSGSVSSLQPGKFYSFQSR QRRLSISSTIKMGQTALTRTIPPLLLLPALLHVKAEFIIPFSNGILDIPLQPLINWIDSL CLFLFTNSQGC >gi568815597f:203383185_203586878|GENSCAN_predicted_CDS_3|936_bp atggaggtcatggctgcaaaaagctggtggaaaagggaagttggaatgctcacctttgga acccagccactatgttgcaaggaggcccatgccacatggagagatacacttgaagaggaa ccaaagccttctcccaacagccagtgccaacttgccagccattggtctctcctgcgagga cctcaccttctgaagtcagatgggcagcggactgatgatatcccgcatattagggtggcc cctcccacatcccattcccgcccagacatctcagccctgctcaatgcccgcctctggcgg agaccttatctctgcccaccacaccgcagctccagctgccgtcctgggcagactgaaacc aaacagctaggagggcatttggcaatgtctggagacaattttggttgtcacaactggaag gtttttggcattgtgcctgcactgaataaaagcaagcagctccagttctcggggctgaac tccggccacttgagctgctcttacactgtatacctgtgtctgagtactcagttcatccac cagtcagggtctgtgggacagactaggcaggtggtgccccgtgtgaggagcagcgcagtg gatcatgatggaaccctcgaaaatgaacgtgaagtgactgcgcaactcttggagcagaca gtgtcatctggctcagtctccagccttcagccaggaaaattctacagctttcagagcaga cagagaagactcagtatctcttcaaccatcaagatgggacagactgccctaactcggaca atacctcctcttttacttctccctgcccttttgcatgtcaaagcagaattcatcattccg ttctccaatggaatattggatattccattacagcctttaatcaactggattgatagtctc tgcctcttcctcttcaccaatagtcagggatgctag >gi568815597f:203383185_203586878|GENSCAN_predicted_peptide_4|87_aa MEDPEHCHEVSMPRDTVFPSSVYAGFGVINNGFSQESSRPRLRAREVIYWSQHSSKNNPV TPVASNLRGKPKALGPLASPSPLLLLL >gi568815597f:203383185_203586878|GENSCAN_predicted_CDS_4|264_bp atggaggatccagagcactgtcatgaggtgtccatgccaagagacactgtgtttccttct agtgtttatgctggctttggcgtcatcaacaatggatttagccaagagagctccaggccg aggctacgagctcgggaagtaatttactggtctcaacacagcagcaagaacaatcctgtt actccagtggcttccaatctcagaggaaaacccaaggctctgggccctttggcttccccg tccccactcctcttacttctctga >gi568815597f:203383185_203586878|GENSCAN_predicted_peptide_5|337_aa MKTRAALALSQGNPAPPSTGPPSIFPDCPRECYCPPDFPSALYCDSRNLRKVPVIPPRIH YLYLQNNFITELPVESFQNATGLRWINLDNNRIRKIDQRVLEKLPGLVFLYMEKNQLEEV PSALPRNLEQLRLSQNHISRIPPGVFSKLENLLLLDLQHNRLSDGVFKPDTFHGLKNLMQ LNLAHNILRKMPPRVPTAIHQLYLDSNKIETIPNGYFKSFPNLAFIRLNYNKLTDRGLPK NSFNISNLLVLHLSHNRISSVPAINNRLEHLYLNNNSIEKINGTQICPNDLVAFHDFSSD LENVPHLRYLRLDGNYLKPPIPLDLMMCFRLLQSVVI >gi568815597f:203383185_203586878|GENSCAN_predicted_CDS_5|1014_bp atgaagaccagggctgccctggcactgagccaaggaaaccccgccccgcccagcacaggc cctccatctatcttccctgactgtccccgcgaatgctactgcccccctgatttcccatct gccctctactgtgatagccgcaacctgcgaaaggtccctgtcatcccgccccgcatccat tacctctatctccagaacaacttcatcactgagctcccggtggagtccttccagaatgcc acaggcctgcgatggattaacctggacaacaaccgaatccgcaagatagaccagagggtg ctggagaaactgcccggcctggtgttcctctacatggagaagaaccagttggaagaggtc ccctcggccctgccccggaacctggagcagctgaggctgagccagaaccacatctccaga atcccgcctggtgtcttcagcaagctggagaacctgctgctcctggatctccagcacaac aggctgagcgacggcgtcttcaagcccgacaccttccatggcctcaagaacctcatgcag ctcaacctggcccacaacatcctgagaaagatgccgcccagggtccccaccgccattcac cagctctacctggacagtaacaagattgagaccatccctaacggatacttcaagagcttt cccaatcttgccttcattcggcttaactacaacaagctgacagacaggggactccccaag aactcctttaatatctccaacctgcttgtgctccacctgtcccacaacaggatcagcagt gtgcccgccatcaacaacaggctggaacacctgtacctcaacaacaatagcatcgagaaa atcaacggaacccagatttgccccaacgacctagtggcgttccatgacttctcctcggac ctggagaacgtgccacacctgcgctacctgcggctggatggaaactacttgaagccgccc atcccgctggacctcatgatgtgcttccgcctcctgcagtccgtggtcatctag >gi568815597f:203383185_203586878|GENSCAN_predicted_peptide_6|487_aa MRLLAFLSLLALVLQETGTASLPRKERKRREEQMPREGDSFEVLPLRNDVLNPDNYGEVI DLSNYEELTDYGDQLPEVKVTSLAPATSISPAKSTTAPGTPSSNPTMTRPTTAGLLLSSQ PNHGLPTCLVCVCLGSSVYCDDIDLEDIPPLPRRTAYLYARFNRISRIRAEDFKGLTKLK RIDLSNNLISSIDNDAFRLLHALQDLILPENQLEALPVLPSGIEFLDVRLNRLQSSGIQP AAFRAMEKLQFLYLSDNLLDSIPGPLPLSLRSVHLQNNLIETMQRDVFCDPEEHKHTRRQ LEDIRLDGNPINLSLFPSAYFCLPRLPIGRFTALQFERVQHQTYCLITSMKSRNCPLTSC LEYRKRRRLLREGSPQGMRDKLPNPVADGSASLHGPQFCGTDDRLGTAERERYGSWGGYT WGAVTVGNNANSDRHGARDRKGALGWGGLEDSSEEAAGELTLPLLIRLMSLPSGQAAMEG VPTLTAA >gi568815597f:203383185_203586878|GENSCAN_predicted_CDS_6|1464_bp atgaggctcctggctttcctgagtctgctggccttggtgctgcaggagacagggacagct tctctcccaaggaaggagaggaagaggagagaggagcagatgcccagggaaggcgattcc tttgaagttctgcctctgcggaatgatgtcctgaacccagacaactatggtgaagtcatt gacctgagcaactatgaggagctcacagattatggggaccaactccccgaggttaaggtg actagcctcgctcctgcaaccagcatcagtcccgccaagagcactacggctccagggaca ccctcgtcaaaccccacgatgaccagacctactacagcagggctgctactgagttcccag cccaaccatggtctgcccacctgcctggtctgcgtgtgcctcggttcctctgtgtattgc gatgacattgacctagaggacattcctcctcttcctcggaggactgcctacctgtatgca cgcttcaaccgcatcagccgtatcagggccgaagacttcaaagggctgacaaagttgaag aggattgacctctccaacaacctcatttcctccatcgataatgatgccttccgcctgcta catgccctccaggacctcatcctcccagagaaccagttggaagctctgcccgtgctgccc agtggcattgagttcctggatgtccgcctaaatcggctccagagctcggggatacagcct gcagccttcagggcaatggagaagctgcagttcctttacctgtcagacaacctgctggat tctatcccggggcctttgcccctgagcctgcgctctgtacacctgcagaataacctgata gagaccatgcagagagacgtcttctgtgaccccgaggagcacaaacacacccgcaggcag ctggaagacatccgcctggatggcaaccccatcaacctcagcctcttccccagcgcctac ttctgcctgcctcggctccccatcggccgcttcacagcattacagtttgagagggtacag caccaaacttactgcctcattacttccatgaagtccaggaactgcccacttacctcctgt ctggaataccggaagagaagaaggctactcagagaggggtccccacaagggatgagggac aagttgcctaaccctgtggcagacggctctgcttctcttcatggtcctcagttctgtggc acagatgacaggttggggacagcagagagggaaaggtatgggtcctggggaggttatact tggggggcagtaactgtaggcaataatgctaacagtgacagacatggtgctcgggacagg aagggagccttgggctggggtgggctggaagattcctcggaggaggcagccggtgaactg acactgcccctcctcattcgcttaatgagcctgccctcaggccaggcagccatggagggc gtgcccacactgacagctgcctaa >gi568815597f:203383185_203586878|GENSCAN_predicted_peptide_7|197_aa MVKEHPTAWGQQGNRNWGDSTLLHGSSKETGNGDSILLHGGAGKQELGAQHPTAWGQQGN RNWGHSILLHGGSKETGTGGTASYCMGAAGKHELGDSILLHGGSKETGNGYSILLHGGSR EMGTGQTASYCMGAAGKQELGGTASYCMGAAGKRELGGQHPTAQKEENPRGCVGGGSFSE NKQFSEFRHIPEARKET >gi568815597f:203383185_203586878|GENSCAN_predicted_CDS_7|594_bp atggtaaaagagcatcctactgcgtgggggcagcagggaaacaggaactggggggacagc accctactgcatgggagcagcaaggaaacaggaaatggggacagcatcctactgcatggg ggcgcagggaaacaggaactgggggcacagcatcctactgcatgggggcagcaaggaaac aggaactgggggcacagcatcctactgcatgggggcagcaaggaaacaggaactgggggc acagcatcctactgcatgggggcagcagggaaacatgaactgggggacagcatcctactg catgggggcagcaaggaaacaggaaatgggtacagcatcctactgcatgggggcagcagg gaaatgggaactgggcagacagcatcctactgcatgggggcagcagggaaacaggaactg ggggggacagcatcctactgcatgggggcagcagggaaacgagaactggggggacagcat cctactgcacagaaagaggagaaccccaggggatgtgtgggtggtggcagcttttcagag aacaagcagttctctgagtttcggcacattcctgaggctagaaaggagacctga >gi568815597f:203383185_203586878|GENSCAN_predicted_peptide_8|150_aa MSVNDQVIIQGTEDKKTHSLGSPSKNIHAGAPSWATPPAPSHCGKIQVSKMASQIFLRWR GSCPLEKRQPPSATERKLYSPAAGPSRPLTDAEEEEEEEEEEEEEEEEEEEEEEEEEEEE EEEEEERRKKKNKNKNKKKKRKRKKKKCCL >gi568815597f:203383185_203586878|GENSCAN_predicted_CDS_8|453_bp atgtcagtaaatgatcaggttattattcaaggaacagaagacaagaagacacactcccta ggaagtcccagcaagaatattcatgcaggtgcaccttcctgggccacacccccagccccc tcacactgtgggaaaatccaggtatccaaaatggcaagccagattttcctgcggtggcgt ggcagctgccccttggaaaagaggcagcccccttcagctactgagagaaagctgtactcc ccagcagctggtcccagcaggcccctcacagatgcagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaggaggaagaggaagag gaagaggaagaagaagaaagaaggaagaagaagaacaagaacaagaacaagaaaaagaag aggaagaggaagaagaagaaatgctgcctttga