GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:05:31 Sequence gi568815586f:118276692_118514430 : 237739 bp : 41.82% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1455 1450 6 1.05 1.02 Term - 6229 5990 240 1 0 66 47 205 0.752 9.24 1.01 Init - 18551 18549 3 2 0 98 95 0 0.066 1.75 1.00 Prom - 63494 63455 40 -1.75 2.00 Prom + 64827 64866 40 -5.45 2.01 Init + 100001 100142 142 1 1 92 113 351 0.986 36.34 2.02 Intr + 103471 103540 70 2 1 57 79 112 0.846 4.52 2.03 Intr + 107321 107376 56 2 2 102 81 58 0.554 4.20 2.04 Intr + 109423 109494 72 2 0 39 110 64 0.737 2.16 2.05 Intr + 112504 112610 107 2 2 35 53 105 0.759 0.71 2.06 Intr + 114435 114591 157 2 1 82 75 271 0.859 23.86 2.07 Intr + 126721 126826 106 2 1 46 80 122 0.171 5.55 2.08 Intr + 134382 134466 85 0 1 98 59 55 0.128 2.50 2.09 Term + 137644 137742 99 0 0 120 38 86 0.234 3.95 2.10 PlyA + 138285 138290 6 1.05 3.03 PlyA - 138462 138457 6 1.05 3.02 Term - 142987 142899 89 2 2 78 43 102 0.482 1.54 3.01 Init - 146590 146545 46 1 1 62 98 45 0.389 3.80 3.00 Prom - 146735 146696 40 -3.75 4.00 Prom + 158833 158872 40 -5.45 4.01 Init + 165905 165923 19 1 1 78 115 21 0.214 4.06 4.02 Term + 176217 176302 86 1 2 70 54 96 0.450 1.24 4.03 PlyA + 176614 176619 6 1.05 5.00 Prom + 177497 177536 40 -3.65 5.01 Init + 184276 184440 165 0 0 80 116 68 0.765 8.50 5.02 Intr + 187625 187658 34 1 1 75 106 35 0.487 0.88 5.03 Intr + 190961 191102 142 0 1 49 84 53 0.383 -0.51 5.04 Intr + 192416 192574 159 2 0 61 87 63 0.698 1.68 5.05 Term + 195310 195397 88 1 1 44 48 158 0.935 3.55 5.06 PlyA + 195464 195469 6 1.05 6.08 PlyA - 197023 197018 6 1.05 6.07 Term - 201689 201362 328 1 1 46 50 132 0.634 -1.60 6.06 Intr - 201911 201759 153 1 0 -5 109 137 0.504 4.77 6.05 Intr - 205793 205567 227 2 2 126 32 64 0.264 0.46 6.04 Intr - 210877 210394 484 0 1 97 80 207 0.831 12.90 6.03 Intr - 211843 211774 70 2 1 133 83 52 0.964 6.52 6.02 Intr - 215752 215666 87 2 0 51 90 51 0.572 0.62 6.01 Init - 229258 229177 82 1 1 77 102 40 0.230 5.48 6.00 Prom - 229934 229895 40 -4.45 7.02 PlyA - 231007 231002 6 1.05 7.01 Term - 237265 237110 156 1 0 76 52 112 0.534 3.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 96563 96508 56 2 2 91 55 61 0.923 3.01 S.002 Term + 116002 116168 167 2 2 70 48 103 0.847 1.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:118276692_118514430|GENSCAN_predicted_peptide_1|80_aa MVEASVLPSAVDTAKGVQVDIPSSVAWQLSGGHVTLVNQILPPRTLNLASSGHSRDDCGS CEDDKSPVEQNRMMMASTAS >gi568815586f:118276692_118514430|GENSCAN_predicted_CDS_1|243_bp atggtggaagccagtgttcttccttctgctgtggatacagcgaaaggagtacaggtcgat attccatcttcagttgcctggcagctttcaggcgggcatgtgactttggtcaatcagata ctcccacccagaactttgaatcttgcaagctcagggcacagcagagatgattgtggcagt tgtgaagatgataagagtccagtagagcagaatcgaatgatgatggcatcaacagcatcc taa >gi568815586f:118276692_118514430|GENSCAN_predicted_peptide_2|297_aa MSAAGLLAPAPAQAGAPPAPEYYPEEDEELESAEDDERSCRGRESDEDTEDASETDLAKH DEEDYVEMKEQMYQDKLASLKRQLQQLQEGTLQEYQKRMKKLDQQYKERIRNAGCGPLGC FPPPAACPPTLPQPVGELSSTELVPGAKKTEQVERNYIKEKKAAVKEFEDKKVELKENLI AELEEKKKMIENEKLTMELTGASPSSPEHLPATPAESPAQRFEARIEDGKLYYDKRWYHK SQAIYLESKDNQKLSCVISSVGANEIWVRKTSDSTKMRIYLGQLQRGLFVIRRRSAA >gi568815586f:118276692_118514430|GENSCAN_predicted_CDS_2|894_bp atgagtgccgcggggctgctggccccggccccggcccaggctggagcgccgccggccccc gagtactaccccgaggaggatgaagagctggagagcgccgaggacgacgagcgcagctgt cggggccgcgagtcggacgaagacactgaggatgctagtgaaactgacctggcaaagcat gatgaagaagactatgtagaaatgaaggaacagatgtatcaggacaaactggcttctctc aagaggcagttgcaacaactgcaagaaggtacattacaggaatatcagaagagaatgaaa aaactagatcagcagtacaaagagaggatacggaatgcaggctgcggaccactgggctgc tttcccccacctgccgcctgcccccctaccctgccccagcccgtgggagaattgtcttcc accgaactggtccctggtgccaaaaagactgaacaagtggaacgaaattacattaaagaa aagaaggcagcagtgaaagaatttgaagacaagaaggttgagctgaaagagaacctgatt gctgagctagaagaaaagaagaaaatgattgaaaatgaaaagctgacaatggaactgact ggagcatctccatcctctcctgagcacttgcctgcaacacccgcggaatctccagcccag aggttcgaagctcggatagaagatggcaaactgtactatgacaaaagatggtaccacaag agccaggccatctatctggagtcaaaggacaaccagaaactgagctgcgtgatcagttct gtaggagccaatgagatctgggtgaggaagacaagtgacagcaccaagatgaggatctac ctgggccagcttcagcgcgggctcttcgtgatccgccggcgctcagctgcttga >gi568815586f:118276692_118514430|GENSCAN_predicted_peptide_3|44_aa MFERRGLIKVGERESVPLQEGMPWKGPVTSEHQDAEGPTPYRHF >gi568815586f:118276692_118514430|GENSCAN_predicted_CDS_3|135_bp atgtttgagagaagaggactgatcaaagtaggagaaagggaatcagtgcctctacaggag ggcatgccatggaaaggtccagtgacatcagagcatcaagatgctgaaggcccaactcca taccgacatttttag >gi568815586f:118276692_118514430|GENSCAN_predicted_peptide_4|34_aa MKQLAQATVTVDAPVRTQAMGAEPRPPTVDMECE >gi568815586f:118276692_118514430|GENSCAN_predicted_CDS_4|105_bp atgaagcaactcgcccaagccacagtgacagttgatgctccagttagaactcaggcaatg ggagcagagccccggcctcctacagtggacatggagtgtgagtga >gi568815586f:118276692_118514430|GENSCAN_predicted_peptide_5|195_aa MGTGRKWSRETGQMTLTLSEGRVWEGFEKRFRGGKVWLRKCLDLGIRKRKEVKVKVFGKQ VVFGYMVPLQLPWENYQASMSDGERPVLWSQVVPVMIQARAILDQPTARQPPDMLCMTTL NPWVDTNHSASSIMIMAHNYIKLIRCRYCPKCFTRINSFSLHKNPMRGCGPVETTSAFSF VTVDDNYQIINRSVT >gi568815586f:118276692_118514430|GENSCAN_predicted_CDS_5|588_bp atggggacaggaaggaagtggagcagggagactggccagatgacactgacattgtcagag ggaagagtatgggaaggatttgagaaacgtttcagaggtggaaaggtttggctcaggaag tgcctggatttagggataagaaagagaaaggaggtgaaagtgaaggtttttgggaaacag gtggtgtttggttacatggttcctctgcaattgccatgggaaaattaccaggctagcatg tcagacggtgagaggccagtcttgtggagccaagttgtcccagtcatgatccaagctagg gccatcctagatcagccaacagctagacaaccaccagacatgctctgtatgactacacta aacccatgggtggacaccaatcattctgcttcatcaatcatgataatggcacacaattac ataaagcttattagatgccgatactgtcctaaatgctttacacgtattaactcatttagt cttcacaaaaaccctatgaggggatgtggccccgtggaaacaacaagtgcgttcagcttt gtgactgtggatgataattatcaaattattaaccggagcgttacgtga >gi568815586f:118276692_118514430|GENSCAN_predicted_peptide_6|476_aa MKPNTSCEHIQGTRYNFVLGPTDRDREGHAKVSLNWIGKALANLTSLGYKNEIGVRAPVQ EALCQFQPEHPTLDTNPAVWEFLHITQSFNLTWYDIYIILTSTLTSDEKECIWHSAETHA DELHNQAPIQNPVANGAVPCRDPDWTYQQGDNGISRRDHMVTCLLAVMDKSAHKAVNYEK LREITQEPQENPALFLSCLTDAMLKYTNLYPESKEDQTFLHLQFISICPRNQEKKLQNLG QDPLPFTSPESWRKKKTNPRKPLMSLSLPNFHHFLTNLVTDLQWYPYETPITHPEQLLTV LWDLWLQGTFQDLKLPKVSASLQDEIGSRKKGSPEITKPPADAHKKARQPNPRASAEMGR SLQIAAHSLANQSSLLGKRRALRRNCEEKNSNRTRTVGQRKEKVRTQAEEGSSERTHLKV LGHIFDYLVKTTEDESLETGSLISYPSMISPQKSIGNLTSVQNKSKSVTVKSHPKF >gi568815586f:118276692_118514430|GENSCAN_predicted_CDS_6|1431_bp atgaagccaaacacttcatgtgaacatattcagggaacacgctacaactttgtgctgggg cctacagacagagacagagaaggacatgccaaggtatcgttaaactggattggcaaggca ttagccaatctcacatctctgggctacaaaaatgagattggggttagagctccagtccag gaggccctttgccaattccaaccagaacatccaacattggacactaatccagccgtgtgg gaatttctacacataacccaatcctttaatttaacttggtatgacatttatataattcta acctccaccctcacttctgatgaaaaagagtgcatctggcattcagctgaaacacatgca gatgaactccataatcaagcccctatacaaaatccagtggccaatggtgctgtcccctgt agagacccagattggacttaccaacagggagacaatggcatcagccgaagagaccacatg gttacctgtctgctcgctgtcatggacaaaagtgctcataaggcagttaactatgaaaaa cttagagaaattacacaagagccccaggaaaatcctgcccttttcttatcatgcctcact gacgctatgttaaaatataccaatttgtacccagaatctaaagaagatcaaacttttctc caccttcagtttatttcaatctgccccagaaatcaggaaaagaaattacaaaacttaggc caggacccacttcccttcacctcacctgaatcctggaggaaaaagaagacaaatccacgt aagccacttatgtctctttctctcccaaactttcatcacttcctaaccaaccttgttaca gacctccagtggtacccttacgaaactcccattacacatcctgagcaactccttactgtt ctatgggacctatggcttcaaggaactttccaggacttgaagttacccaaagtatcagca tctttgcaagacgaaatagggagtagaaaaaagggctcccctgagatcaccaaaccacca gctgatgctcataagaaagcaaggcagcccaacccgagagcatcagctgaaatggggagg agtctgcagattgcagctcacagtctagcaaatcaaagctcccttctagggaaaaggcgt gcactgaggagaaactgtgaagagaagaattcaaacaggacaaggacagtggggcagagg aaagagaaggtccggacacaagcagaagagggaagtagtgaaagaacacatctgaaagta ttaggccatatttttgattacttagtaaaaacaactgaagatgaatctctagagacagga agtttgataagttatcctagtatgatttctccacagaaaagtataggaaatctcacttca gtacaaaacaagtcaaaaagtgtcactgtcaaatctcatccaaaattctga >gi568815586f:118276692_118514430|GENSCAN_predicted_peptide_7|51_aa TCTPSPTVKEHSDPMGHTCETTDDFSIASSAEVHFSASHRDGPGCISTALA >gi568815586f:118276692_118514430|GENSCAN_predicted_CDS_7|156_bp acttgcactcccagccccactgtcaaagaacacagtgaccccatgggacatacatgtgag actacagatgacttttccatcgccagctcagctgaagtgcatttctctgcatcacacaga gatggccctggctgtatcagcacagctctcgcttga