GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:20:38 Sequence gi568815597f:99546318_99794059 : 247742 bp : 38.03% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4114 4207 94 2 1 90 47 97 0.551 3.90 1.02 Term + 7469 7847 379 2 1 61 47 165 0.775 2.98 1.03 PlyA + 7894 7899 6 1.05 2.00 Prom + 15299 15338 40 -4.15 2.01 Init + 15861 15936 76 2 1 84 89 72 0.860 8.20 2.02 Intr + 24906 25006 101 1 2 -31 92 129 0.252 0.31 2.03 Term + 26916 27044 129 2 0 40 32 105 0.250 -2.70 2.04 PlyA + 27911 27916 6 1.05 3.00 Prom + 35663 35702 40 -0.35 3.01 Sngl + 42890 43357 468 1 0 65 49 249 0.884 14.58 3.02 PlyA + 43385 43390 6 1.05 4.00 Prom + 47076 47115 40 -8.55 4.01 Init + 50289 50492 204 2 0 38 43 135 0.094 2.90 4.02 Intr + 65786 65843 58 1 1 105 68 51 0.024 2.34 4.03 Term + 75530 75690 161 0 2 96 47 128 0.904 6.62 4.04 PlyA + 76047 76052 6 1.05 5.00 Prom + 82692 82731 40 -5.55 5.01 Init + 99100 99232 133 0 1 87 48 63 0.388 2.55 5.02 Term + 103207 103475 269 2 2 10 49 214 0.514 4.37 5.03 PlyA + 103841 103846 6 1.05 6.00 Prom + 119082 119121 40 -6.05 6.01 Init + 121345 121449 105 0 0 85 80 117 0.809 10.87 6.02 Intr + 126206 126229 24 1 0 90 100 13 0.255 0.00 6.03 Intr + 139552 139675 124 0 1 71 87 32 0.060 0.64 6.04 Intr + 142458 143555 1098 1 0 54 90 972 0.629 82.97 6.05 Intr + 151406 151577 172 0 1 85 95 148 0.270 13.28 6.06 Intr + 152223 152383 161 0 2 74 43 166 0.997 9.41 6.07 Intr + 152665 152843 179 2 2 10 27 283 0.396 13.12 6.08 Intr + 158013 158087 75 2 0 44 78 67 0.137 0.09 6.09 Intr + 158338 158638 301 0 1 2 64 285 0.351 12.88 6.10 Term + 158872 158981 110 2 2 90 41 52 0.355 -1.61 6.11 PlyA + 159341 159346 6 1.05 7.13 PlyA - 159443 159438 6 1.05 7.12 Term - 162803 162711 93 2 0 102 53 7 0.388 -4.55 7.11 Intr - 162942 162881 62 1 2 53 107 56 0.409 1.53 7.10 Intr - 164632 164489 144 2 0 69 86 148 0.428 12.03 7.09 Intr - 167718 167688 31 0 1 132 64 -18 0.413 -3.01 7.08 Intr - 169355 169269 87 2 0 101 110 -2 0.540 2.55 7.07 Intr - 171208 171093 116 2 2 83 96 33 0.609 2.85 7.06 Intr - 173330 173217 114 0 0 107 78 42 0.764 4.60 7.05 Intr - 191951 191765 187 2 1 89 10 148 0.032 5.44 7.04 Intr - 194623 194476 148 0 1 64 100 53 0.837 3.42 7.03 Intr - 195956 195862 95 2 2 96 65 22 0.056 -1.46 7.02 Intr - 201113 200977 137 0 2 95 72 124 0.145 10.87 7.01 Init - 202451 202256 196 2 1 50 81 83 0.537 2.95 7.00 Prom - 203643 203604 40 -7.25 8.00 Prom + 203735 203774 40 -7.05 8.01 Init + 204148 204286 139 0 1 36 9 138 0.467 1.05 8.02 Term + 204842 204969 128 0 2 107 44 127 0.980 7.66 8.03 PlyA + 205069 205074 6 1.05 9.00 Prom + 211189 211228 40 -4.65 9.01 Sngl + 235463 235756 294 1 0 74 42 211 0.433 10.55 9.02 PlyA + 237281 237286 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 140759 140872 114 0 0 56 71 84 0.801 3.00 S.002 Sngl + 200218 200442 225 0 0 67 42 251 0.804 13.29 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:99546318_99794059|GENSCAN_predicted_peptide_1|157_aa XPRTDSSFLVAAVSEIFMANSLLEASHSEQERNNRSPRWSNQARQRNKSTQIAKEEIKLS LYASDVIIYLEKPKHSSKRLLDLINEFSKVSGYKINVHKPAALLHTNDNRAENQIKNSIP FATAAKTNKKIPSNILNQGGERFLQGELQNTAETNHR >gi568815597f:99546318_99794059|GENSCAN_predicted_CDS_1|474_bp ntaccccgcacagactcgagctttctcgtggctgctgtttcagaaatattcatggccaat tcgctgctcgaagcctcacacagtgagcaggagagaaacaatagaagtcctagatggagc aatcaggcaagacaaagaaataaaagcacccaaattgcaaaagaggaaatcaaactatca ctgtatgccagtgatgtgatcatatacctagaaaagcctaaacactcctccaaaagactc ctagatttgataaatgaattcagtaaagtctcaggttacaaaatcaacgtacacaaacca gcagcactgctacacaccaacgacaaccgagctgagaatcaaatcaagaactcaatccct tttgcaacagctgcaaaaacaaacaaaaaaatacctagcaatatacttaaccaaggaggt gaaagatttctacaaggagaactacaaaacactgctgaaacaaatcatagatga >gi568815597f:99546318_99794059|GENSCAN_predicted_peptide_2|101_aa MDGAGSYYPQRTNAGTENQAPHVLTCTYTCNGLGKAADTKEAHITNTWLLISSSEFGEIN LQQLQANSTPPNSHQLKASDPQVTSTCKAIWGAMGQCLSPT >gi568815597f:99546318_99794059|GENSCAN_predicted_CDS_2|306_bp atggatggagctggaagctattatcctcagcgaactaatgcaggaacagaaaaccaagca ccgcacgttctcacttgcacgtatacctgcaacggacttggaaaagctgctgacacaaag gaagctcacatcacaaacacctggcttttaatatctagctcagagtttggcgaaataaac ttgcagcagctccaggctaattccacacccccaaactcccaccagctcaaggcctctgat ccacaagtcaccagcacctgcaaagccatctggggtgcaatggggcaatgcctctctcct acttaa >gi568815597f:99546318_99794059|GENSCAN_predicted_peptide_3|155_aa MDKSTTTLLPISSEFQRSQKPHLSSHNPPNPKWSNTLLDLERMICLRKQGQQWESVRKSF LYTTKGKRTEQLLFPVQRPFVREHAHLCFPTFPIWLFFIKLNLFDPCQQAQLELPPSRHP LALCIIVLSYGKAAGVYHTLGPQPKTRGAYVSVIP >gi568815597f:99546318_99794059|GENSCAN_predicted_CDS_3|468_bp atggacaagtctacaacaacactgctacccatcagcagtgagttccaaaggtcacaaaag ccacaccttagttcacacaacccaccaaacccgaagtggagcaacacactcctagacctt gagagaatgatttgcctaagaaaacaaggacaacagtgggagtctgttagaaagagcttc ctctatacgaccaagggaaaaagaacagaacagcttctgtttccagtgcagcgacccttt gtgagagaacatgctcacctttgttttcctacctttccaatctggctcttcttcatcaag ctgaacctctttgatccctgccagcaggcacagcttgagctgccaccttccagacatccc ctagcactctgcatcattgtcctgtcctatggtaaagcagcaggtgtctaccacacactc ggacctcagcccaagaccagaggagcttatgtctcagttattccatag >gi568815597f:99546318_99794059|GENSCAN_predicted_peptide_4|140_aa MSYMAAGKKRMRAKQKGFPLVKPSDLVRLIHYHENSMGETIPVMQLSPTESLPQHMGIMG ATTEDEIWDLKSEGTRYDGRNLALDSYAYSHMLLLAGTESIKTYVVILHLLTSDPIEDSV VRAESESGKAHSRAGREAMP >gi568815597f:99546318_99794059|GENSCAN_predicted_CDS_4|423_bp atgtcttacatggcagctggcaaaaagagaatgagagccaagcaaaaaggatttcccctt gtaaaaccatcagatctcgtgagacttattcactaccatgagaacagtatgggagaaacc atcccagtgatgcaattatctcccactgagtccctcccacagcacatgggaattatggga gctacaactgaagatgagatttgggacctgaaatcagagggaacaaggtatgatggacga aatttggcattagatagctatgcatactctcacatgttgctgcttgcagggactgagtct atcaaaacatatgtagtaatattacaccttttaacaagtgatcctattgaggattcagtg gtccgtgctgagtccgaaagtggaaaagcacatagcagagctggccgtgaagcaatgcct tga >gi568815597f:99546318_99794059|GENSCAN_predicted_peptide_5|133_aa MGTFLKAKSVISEQRPRKKTAFHRNPSQAITKGGTWCLSHTKYLELSLQSYCLKVNVAHT EEQRKGPGVSPQKHKDSLRMSNIPYTLNISSTANTFSYPHAYSDIGKYHYKYGYAQTGKL DLKLSCDHRLSSI >gi568815597f:99546318_99794059|GENSCAN_predicted_CDS_5|402_bp atgggcaccttcctgaaagccaagagtgtcatttctgaacagcgtccaagaaagaaaact gcattccatagaaatccaagtcaagcaataacaaaaggtggcacttggtgcttaagccac actaaataccttgagttatccctacagagttactgtctgaaagtgaacgtggcccacact gaagaacaacgaaaaggcccaggggtctctcctcagaaacacaaagattcccttaggatg agtaacattccgtatactcttaacatctcctccacagctaatacattttcttacccccat gcctacagcgacattggaaaataccattacaaatatggatatgctcaaactggaaaatta gatcttaaactgtcttgcgatcatagactcagttctatttga >gi568815597f:99546318_99794059|GENSCAN_predicted_peptide_6|782_aa MASRWNQQRKRTGRDEEAKSTRPAPDPGSRTKYPQPTPKMTPPRRGHDAAPWSSFESDTQ LYTGQDTSSECLQQSSFKCLSLCCSLYAMEIKVEKDLKTGESTVLSSIPLPSDDFKGTGI KVYDDGQKSVYAVSSNHSAAYNGTDGLAPVEVEELLRQASERNSKSPTEYHEPVYANPFY RPTTPQRETVTPGPNFQERIKIKTNGLGIGVNESIHNMGNGLSEERGNNFNHISPIPPVP HPRSVIQQAEEKLHTPQKRLMTPWEESNVMQDKDAPSPKPRLSPRETIFGKSEHQNSSPT CQEDEEDVRYNIVHSLPPDINDTEPVTMIFMGYQQAEDSEEDKKFLTGYDGIIHAELVVI DDEEEEDEGEAEKPSYHPIAPHSQVYQPAKPTPLPRKRSEASPHENTNHKSPHKNSISLK EQEESLGSPVHHSPFDAQTTGDGTEDPSLTDFPRIWMTLTVLKIAEYVFCTRVQTMGYNP TQLAAQVGPALAVESSFSRLLCPFDISLILPKIKFTKPDISIEDVAKKLDEMWNNLNDSE KEPYLTALVQSHAAMKKNLRWGEIELGSQSQTISITKAAKLKEKYEKDVADYKSRKLDGA KGPAKVAQKQVEEEDEEDEEEDGQNKKIPMLDSKSGKELELDKLAFADTQAAGRGEEHIA EDTSSWTARGCGWEHARGRAHDRSQHADRPRLAEGGTVGGVWPGQSEESPGSLAAQLQGK TISLLAPPSAESYFYSVKPCTRSSSPDLSSGALKKPATSPFHTLQGGQGNISRFTEKGIP GW >gi568815597f:99546318_99794059|GENSCAN_predicted_CDS_6|2349_bp atggcttctagatggaatcagcagcggaaaagaacaggaagagatgaagaagcaaaatca acaagaccagcaccagatccaggttctagaacaaagtatcctcagcccacacccaaaatg acacccccaaggagaggtcacgatgctgccccctggagttcatttgaatctgacacgcag ctgtacactggccaagataccagctcagaatgcttgcaacagtctagtttcaaatgtctg tctctttgttgctctttatatgccatggaaattaaagttgaaaaagacttgaagactgga gaaagtacagttctgtcttcaatacctctgccatcagatgactttaaaggtacaggaata aaagtttatgatgatgggcaaaagtcagtgtatgcagtaagttctaatcacagtgcagca tacaatggcaccgatggcctggcaccagttgaagtagaggaacttctaagacaagcctca gagagaaactctaaatccccaacagagtatcatgagcctgtatatgccaatcccttttac aggcctacaaccccacagagagaaacggtgacccctggaccaaactttcaagaaaggata aagattaaaactaatggactgggtattggtgtaaatgaatccatacacaatatgggcaat ggtctttcagaggaaaggggaaacaacttcaatcacatcagtcccattccgccagtgcct catccccgatcagtgattcaacaagcagaagagaagcttcacaccccgcaaaaaaggcta atgactccttgggaagaatcgaatgtcatgcaggacaaagatgcaccctctccaaagcca aggctgagccccagagagacaatatttgggaaatctgaacaccagaattcttcacccact tgtcaggaggacgaggaagatgtcagatataatatcgttcattccctgcctccagacata aatgatacagaaccggtgacaatgattttcatggggtatcagcaggcagaagacagtgaa gaagataagaagtttctgacaggatatgatgggatcatccatgctgagctggttgtgatt gatgatgaggaggaggaggatgaaggagaagcagagaaaccgtcctaccaccccatagct ccccatagtcaggtgtaccagccagccaaaccaacaccacttcctagaaaaagatcagaa gctagtcctcatgaaaacacaaatcataaatccccccacaaaaattccatatctctgaaa gagcaagaagaaagcttaggcagccctgtccaccattccccatttgatgctcagacaact ggagatgggactgaggatccatccttaacagactttcctcgtatttggatgaccttgaca gttctgaaaattgctgagtatgtattttgtaccagggtacaaaccatgggttataatcca acacaacttgctgctcaagttggtccagctttggctgttgaaagctcattcagtaggctc ctgtgcccctttgacatatccctaattctgcccaagatcaaattcacaaagcctgacatc tctattgaagatgtggcaaaaaagctggatgagatgtggaataacttaaatgacagtgaa aaggagccttaccttactgcattagtccagtctcatgctgctatgaagaaaaacctgaga tggggtgagattgagttggggtcacagagccaaaccatatcaatcaccaaggcagcaaag ctgaaagagaagtatgagaaggatgttgctgactacaagtctagaaagcttgatggtgca aaaggtcctgctaaagttgcccaaaaacaagtggaagaggaagatgaagaagacgaggag gaagatggtcaaaacaaaaagataccaatgttggacagcaagtcagggaaggagctagaa cttgataaactagcttttgcagacacacaagcagctggacgtggagaggaacacatcgca gaagacacaagcagctggacagcgagaggatgtggatgggagcacgccagaggaagagca cacgacagatcccagcatgccgacaggccacgactggcggaaggaggaacagttggcgga gtttggccggggcaatcagaggagagcccaggcagtctagcggcccaactccaggggaaa accatctcccttctggctcccccatctgctgagagctacttctactcagtaaaaccttgc actcgttcttcaagcccagatttgagcagtggggcactgaagaagccagccacatcccca ttccacaccctgcaagggggacaagggaacatttcccgtttcactgagaagggcattcca ggctggtaa >gi568815597f:99546318_99794059|GENSCAN_predicted_peptide_7|469_aa MAVSGFTLGTCILLLHISYVANYPNGKVTQSCHGMIPEHGHSPQSVPVHDIYVSQMTFRP GDQIEVTLSGHPFKGFLLEARNAEDLNGPPIGSFTLIDSEVSQLLTCEDIQGSAVSHRSA SKKTEIKVYWNAPSSAPNHTQFLVTVVEKYKIYWVKIPGPIISQPNAFPFTTPKATVVPL PTLPPVSHLTKPFSASDCGNKKFCIRSPLNCDPEKEASCVFLSFTRDDQSVMVEMSGPSK GYLSFALSHDQWMVGRIYKHSQQPLITYEKYDVTDSPKNIGGSHSVLLLKVHGALMFVAW MTTVSIGVLVARFFKPVWSKAFLLGEAAWFQVHRMLMFTTTVLTCIAFVMPFIYRGGWSR SISFHTSLLVVAAMFLGMDLPGLNLPDSWKTYAMTGFVAWHVGTEVVLEVHAYRLSRKVE ILDDDRIQILQSFTAVETEGHAFKKAVLAIYVCGNVTFLIIFLSAINHL >gi568815597f:99546318_99794059|GENSCAN_predicted_CDS_7|1410_bp atggcagtttctggatttactcttggtacctgcatacttctgttgcacattagttatgtg gctaattatcccaatggaaaagtaacacagtcatgccatggaatgattcctgaacatggt catagtccacagtctgttcctgttcatgacatttacgtgagtcagatgacattcaggcca ggagatcagattgaagttactttgtcagggcatccatttaaaggctttctcctagaagcg cgtaatgctgaggatctgaatggccctcctattggctccttcacattgattgacagtgaa gtgtcacaacttttgacctgtgaagatatacagggatcagcagtgagtcacagaagtgca tctaaaaaaacagaaattaaagtctactggaatgctccaagcagtgctccaaatcacaca cagtttctagtcacagttgttgagaagtataaaatctactgggtgaagattcctggtcct ataatttcacaaccaaatgcatttccttttacaacacctaaagctacagtagtacctttg ccaacgttacctcccgtttcccacttaaccaaaccattcagtgcctcagattgtgggaac aagaagttctgtattaggagtcctttgaactgtgacccagagaaggaggcttcctgtgtc ttcttgtccttcacaagagatgaccaatcggtgatggttgaaatgagcggccccagtaaa ggctatttatcctttgcattgtctcatgatcagtggatggttggtcgaatttacaagcac tctcagcaacctttgattacctatgaaaaatatgatgtgacagactctccaaagaacata ggaggatcccattctgtactccttctgaaggttcatggtgccttaatgtttgtggcatgg atgactactgttagcataggtgtactggttgcccggttcttcaagccagtttggtcaaaa gctttcttgcttggtgaagcagcttggtttcaggtgcatcggatgctcatgttcaccaca actgtcctcacctgcattgcttttgttatgccgtttatatacaggggaggctggagtagg tccatctccttccataccagcctcctggttgtggcagcgatgttcctgggaatggattta ccaggactgaatcttcctgattcatggaaaacctatgcaatgaccggattcgtagcctgg catgttgggactgaggttgttctggaggtacatgcttatcggctctctcgcaaagttgaa atattggatgatgacagaattcagatccttcagtcatttactgcagtggaaacagagggt catgcttttaaaaaggcagtgttggcaatttatgtctgtgggaatgttacttttctcatc atatttttatctgcaatcaaccatctatga >gi568815597f:99546318_99794059|GENSCAN_predicted_peptide_8|88_aa MIKMLNAPMDKVDGMQEQMHRDGNSQNQKEMLEIKNTLKEVKYGLVDIKVPGPGALKPQD LHQQSFPQPKALRPEQEVRYGFPRFLGL >gi568815597f:99546318_99794059|GENSCAN_predicted_CDS_8|267_bp atgattaagatgctaaatgctccaatggataaagtagatggcatgcaagaacaaatgcac agagatggaaattctcagaaccaaaaagaaatgctagagatcaaaaacactttaaaagaa gtgaagtatgggcttgtggacatcaaggttcctggtcctggggccttaaaaccacaggac ttacaccagcaatcctttccccaacccaaggccctcagacctgaacaggaggtgcgctat gggttcccccggttcttaggcctttag >gi568815597f:99546318_99794059|GENSCAN_predicted_peptide_9|97_aa MAQIYRRSNSTIHRVEAGSSEQLQSNPPPVAMVPRVFIKPLVPFRTLPLVTPYEGLACDQ PEAEVETWPAVNQKLKWKLLSSYHRSADVACILPNLA >gi568815597f:99546318_99794059|GENSCAN_predicted_CDS_9|294_bp atggcacagatttatcgaagaagcaatagtacaattcacagagtggaagcaggctcaagc gagcagctccagagcaacccccctccggttgcaatggtccccagggtttttataaagccc ttggtgccctttagaacccttccattggttacaccctatgaaggactggcctgcgaccaa ccagaggctgaagtggagacttggcctgcagtcaatcagaagctgaagtggaaacttctg tcttcttatcacaggagtgctgatgtggcctgtatactgcctaatcttgcctag