GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:04:36 Sequence gi568815596f:202165942_202403057 : 237116 bp : 43.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5467 5575 109 0 1 38 93 153 0.807 9.21 1.02 Intr + 13354 13569 216 2 0 11 8 187 0.086 1.48 1.03 Intr + 18812 18902 91 0 1 96 84 44 0.534 3.85 1.04 Intr + 20992 21157 166 1 1 43 84 12 0.353 -3.74 1.05 Intr + 22211 22325 115 1 1 80 91 124 0.965 11.92 1.06 Intr + 24233 24552 320 0 2 103 97 60 0.815 4.08 1.07 Intr + 27360 27562 203 2 2 42 94 184 0.795 12.48 1.08 Intr + 28249 28421 173 1 2 68 93 217 0.999 19.79 1.09 Term + 30859 31109 251 2 2 117 50 317 0.939 26.67 1.10 PlyA + 31597 31602 6 1.05 2.06 PlyA - 32608 32603 6 1.05 2.05 Term - 35852 35439 414 2 0 74 55 268 0.619 17.36 2.04 Intr - 44865 44794 72 0 0 82 81 33 0.609 1.60 2.03 Intr - 48493 48416 78 1 0 96 79 33 0.759 2.95 2.02 Intr - 54165 54091 75 0 0 59 109 63 0.881 5.21 2.01 Init - 57942 57895 48 0 0 70 93 -7 0.541 -1.05 2.00 Prom - 66937 66898 40 -7.86 3.00 Prom + 68231 68270 40 -2.96 3.01 Init + 72509 72557 49 1 1 77 102 85 0.603 8.02 3.02 Intr + 72609 72849 241 1 1 100 29 202 0.988 12.11 3.03 Term + 73582 73771 190 1 1 7 54 182 0.956 3.62 3.04 PlyA + 74180 74185 6 1.05 4.00 Prom + 88516 88555 40 -5.86 4.01 Init + 100001 100045 45 1 0 74 100 39 0.898 4.40 4.02 Intr + 109172 109248 77 1 2 64 101 56 0.952 2.81 4.03 Intr + 112009 112061 53 1 2 95 103 37 0.981 4.45 4.04 Intr + 116410 116531 122 2 2 48 98 111 0.966 8.31 4.05 Intr + 118404 118540 137 2 2 83 94 -17 0.600 -2.23 4.06 Intr + 121719 121783 65 0 2 92 78 41 0.651 1.86 4.07 Intr + 124382 124516 135 0 0 62 97 63 0.970 5.14 4.08 Intr + 125184 125329 146 1 2 60 99 123 0.999 10.60 4.09 Intr + 126836 126962 127 1 1 84 64 139 0.995 11.35 4.10 Intr + 129733 129896 164 2 2 94 93 13 0.973 2.09 4.11 Intr + 131438 131572 135 1 0 39 78 102 0.965 5.06 4.12 Intr + 131904 131965 62 2 2 74 113 47 0.905 3.33 4.13 Intr + 134293 134426 134 1 2 40 96 133 0.907 9.59 4.14 Intr + 136980 137116 137 1 2 45 34 205 0.909 11.09 4.15 Term + 169990 170130 141 0 0 82 36 111 0.744 3.23 4.16 PlyA + 171235 171240 6 1.05 5.00 Prom + 179597 179636 40 -8.26 5.01 Init + 181836 181888 53 2 2 56 84 64 0.729 3.43 5.02 Term + 187950 188058 109 0 1 92 45 116 0.829 5.68 5.03 PlyA + 190672 190677 6 1.05 6.04 PlyA - 190855 190850 6 1.05 6.03 Term - 193952 193937 16 1 1 110 32 12 0.020 -4.49 6.02 Intr - 206450 206357 94 0 1 74 101 75 0.373 6.42 6.01 Init - 210654 210489 166 0 1 51 63 145 0.659 7.20 6.00 Prom - 235302 235263 40 -1.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:202165942_202403057|GENSCAN_predicted_peptide_1|547_aa MASIAPLRVPAASERLLPSASLLREVVPALQKELLADGGARYGFSLTTFSPSDKLVRIEY ALAAVAGGALSVGIKAANGVVLATEKKQKFILYDERSVDKVEPITKHIDPEPRSMTLDSP RASRTEHIQTPEADIVQKVGRDYDVHHLHRGLLGYGPESPERLSAVYTSLLPREREGKAE PRLFSQETSANISHERDLINEAKRKEKPKKDKTKGPKSEREGKVYGQAEAAIGKSKDSKA KKKLEKKTRPQRKRTQKERNLEIAAELSGPDVSYEETEDTSNRGSFASDSFVEDPWLSPK YDAQESQVSLDGRSSPSQIATVTGNMESKEERRCEDPSKALLTKREQEKASWDRLRAERA EMRWLEVEKKRREQEEQRQLQQEQLERAKKMEEELELEQQRRTEEIRLRKQRLQEEQQRQ EEEERKQQLRLKAAQERARQQQEEFRRKLRELQRKKQQEEAERAEAEKQRQEELEMQLEE EQKHLMEMAEEERLEYQRRKQEAEEKARLEAEERRQKEEEAARLALEEATKQAQEQARYW IFGQQLP >gi568815596f:202165942_202403057|GENSCAN_predicted_CDS_1|1644_bp atggcttcaatagcgccgctccgggtccccgcggcgtcggagaggcttctgcctagcgcc tctctgctgcgcgaggtcgtcccggccttacaaaaggagctcctcgctgatggcggagcg cggtacggcttttcgctgactacattcagcccgtctgataaacttgtccggattgaatat gctttggctgctgtagctggaggagccctgtcagtgggaattaaagcggcaaatggtgtg gtattagcaaccgagaaaaaacagaaattcattctgtatgatgagcgaagtgtagacaaa gtggaaccaattaccaagcacatagacccagagccaaggagtatgacccttgactctccc agggcttcccggactgagcacatccagaccccagaagcagatattgtgcaaaaagtgggc agagattatgatgtacaccacctacacagaggacttctgggatacgggcctgagtcaccc gagaggttgagtgctgtgtatacatctcttcttccaagagaaagagaagggaaggctgaa ccaagactgtttagccaggagacatcagccaacatcagtcatgagagggatttgattaac gaggccaagagaaaggaaaaacccaagaaagacaaaaccaaaggacccaaaagcgagaga gaaggaaaggtctacgggcaagcagaggctgccattggaaagtcaaaggactcaaaggct aaaaaaaaattagaaaaaaaaacaagaccccaaaggaaaaggacacagaaggaaagaaat ctggagatagcggcagagctgagcgggcctgatgtcagctatgaggaaacagaagacacc tcaaatagaggttcctttgcctcagactcctttgtagaggacccttggctttctcccaaa tatgatgcccaggaaagccaagtttctctagatggaagatcatcaccctctcagattgca actgtcactggcaacatggaatctaaagaagagagaagatgtgaggacccttccaaggcc ctcctcactaagagggagcaggagaaggcttcctgggacaggcttcgagcagaaagagcc gagatgaggtggctggaggtggagaagaagagaagggagcaggaagagcaaaggcagctc cagcaggagcagctggagagagcaaaaaagatggaggaggagctggagctggagcagcag agacgtacagaagagatccgcttgaggaaacagagactccaagaagaacagcagcggcag gaggaggaggagagaaagcagcagctccggttgaaagcagcccaggagagagcccggcaa cagcaagaggagtttcggaggaaactgcgagaactacagagaaaaaagcagcaggaggaa gccgagagggccgaggcagagaagcaaaggcaagaggaattggaaatgcagttagaagaa gaacaaaaacacctgatggaaatggctgaagaggaacgactggagtaccagcggcggaaa caggaagcagaagagaaggctcggctggaggcagaggagaggaggcaaaaagaagaggaa gcagcaagactggctctggaagaagccacgaaacaagcccaggaacaagccaggtactgg atatttgggcaacagttgccatag >gi568815596f:202165942_202403057|GENSCAN_predicted_peptide_2|228_aa MRREPNLGSQRWRVFKEAKPSTEDLGDKKEGEYIKLKVIGQDSSEIHFKVKMTTHLKKLK ESYCQRQGVPMNSLRFLFEGQRIADNHTPKEALPYTDAPTAYSELYHPSFVHPRGCHSPH HVGRFPGASLYLPMAQSVAVGSLGSTIPMAYYPVGPIYPPGSTVLVEGGYDAGARFGAGA TAGNILPPPPGCPPSAAQLAVMQAANVLVTQWKGNFFMGESDGGYTIW >gi568815596f:202165942_202403057|GENSCAN_predicted_CDS_2|687_bp atgaggagagaacctaatcttggcagccaaaggtggagagttttcaaggaggcaaaacct tcaactgaggacttgggggataagaaggaaggtgaatatattaaactcaaagtcattgga caggatagcagtgagattcacttcaaagtgaaaatgacaacacatctcaagaaactcaaa gaatcatactgtcaaagacagggtgttccaatgaattcactcaggtttctctttgagggt cagagaattgctgataatcatactccaaaagaagctctaccctatactgatgctccgact gcctactcagagctctatcatccaagctttgtgcaccccaggggctgccacagtccccac catgtcggccgctttcctggagcctctctgtatcttcccatggcccagtctgtggctgtt gggtctttaggttccacaatccccatggcttattatccagttggtcccatctatccacct ggctccacagtgctggtggaaggcgggtatgatgcaggtgccagatttggagctggggct actgctggcaacattcttcctccacctcctggatgccctcccagtgctgctcagcttgca gtcatgcaggcagccaatgtcctcgtaactcagtggaaggggaacttcttcatgggtgaa tcagatggtggctacaccatctggtga >gi568815596f:202165942_202403057|GENSCAN_predicted_peptide_3|159_aa MMTVASPGGLRTAAASAQHRGCSNFRYASRHFRFAQGGGRERREERQGGKWVGEQPRGGA GCGAALPVVARVNGSFVSSFKDQVLRISTRAIPQEQRAAAALCALLALPTNIMLQFLLGF TLGNVVGMYLAQNYDIPNLAKKLEEIKKDLDAKKKPPTS >gi568815596f:202165942_202403057|GENSCAN_predicted_CDS_3|480_bp atgatgacggtggcttcacccgggggtctccgcacagcagcggcctcggcacaacaccgc ggctgcagtaacttccgctacgcctcgcgtcacttccgcttcgcgcagggaggagggaga gagagaagagaggaaagacaaggcgggaaatgggtgggggagcagccaaggggaggggca ggctgtggagctgctttgccggtggtcgcgcgggtgaacggcagttttgtcagctccttc aaggaccaggtcctgcgcatcagtactagggccatcccccaggagcaaagagctgcagca gccctttgcgccctcctcgccctccccaccaacatcatgctccaattcctgcttggattt acattgggcaacgtggttggaatgtatctggctcagaactatgacataccaaacctggct aaaaaacttgaagaaattaaaaaggacttggatgccaagaagaaaccccctacttcatga >gi568815596f:202165942_202403057|GENSCAN_predicted_peptide_4|559_aa MLVLFETSVGYAIFKVLNEKKLQEVDSLWKEFETPEKANKIVKLKHFEKFQDTAEALAAF TALMEGKINKQLKKVLKKIVKEAHEPLAVADAKLGGVIKEKLNLSCIHSPVVNELMRGIR SQMDGLIPGVEPREMAAMCLGLAHSLSRYRLKFSADKVDTMIVQAISLLDDLDKELNNYI MRCREWYGWHFPELGKIISDNLTYCKCLQKVGDRKNYASAKLSELLPEEVEAEVKAAAEI SMGTEVSEEDICNILHLCTQVIEISEYRTQLYEYLQNRMMAIAPNVTVMVGELVGARLIA HAGSLLNLAKHAASTVQILGAEKALFRALKSRRDTPKYGLIYHASLVGQTSPKHKGKISR MLAAKTVLAIRYDAFGEDSSSAMGVENRAKLEARLRTLEDRGIRKISGTGKALAKTEKYE HKSEVKTYDPSGDSTLPTCSKKRKIEQVDKEDEITEKKAKKAKIKVKVEEEEEEKVAEEE ETSVKKKKKRGKKKHIKEEPLSEEEPCTSTAIAEAHGEAQTSLGMKPWQSDKACFLQATY SAMNERDDVTTLFRHNGFI >gi568815596f:202165942_202403057|GENSCAN_predicted_CDS_4|1680_bp atgttggtgctgtttgaaacgtctgtgggttacgccatctttaaggttctaaatgagaag aaacttcaagaggttgatagtttatggaaagaatttgaaactccagagaaagcaaacaaa atagtaaagctaaaacattttgagaaatttcaggatacagcagaagcattagcagcattc acagctctgatggagggcaaaatcaataagcagctgaaaaaagttctgaagaaaatagta aaagaagcccatgaaccgctggcagtagctgatgctaaactaggaggggtcataaaggaa aagctgaatctcagttgtatccatagtcctgttgttaatgaacttatgagaggaattcgt tcacaaatggatggattaatccctggggtagaaccacgtgaaatggcagctatgtgtctt ggattggctcacagcctgtctcgatatagattgaagtttagcgctgataaagtagacaca atgattgttcaggcaatttccttgttagatgacttggataaagaactaaacaactacatt atgcgatgtagagaatggtatggctggcatttccctgaattaggaaaaattatttcagat aatttaacatactgcaagtgtttacagaaagttggcgataggaagaactatgcctctgcc aagctttctgagttgctgccagaagaagttgaagcagaagtgaaagcagctgcagagata tcaatgggaacagaggtttcagaagaagatatttgcaatattctgcatctttgcacccag gtgattgaaatctctgaatatcgaacccagctctatgaatatctacaaaatcgaatgatg gccattgcacccaatgttacagtcatggttggggaattagttggagcacggcttattgct catgcaggttctcttttaaatttggccaagcatgcagcttctaccgttcagattcttgga gctgaaaaggcacttttcagagccctcaaatctagacgggatacccctaagtatggtctc atttatcatgcttcactcgtgggccagacaagtcccaaacacaaaggaaagatttctcga atgctggcagccaaaaccgttttggctatccgttatgatgcttttggtgaggattcaagt tctgcaatgggagttgagaacagagccaaattagaggccaggttgagaactttggaagac agagggataagaaaaataagtggaacaggaaaagcattagcaaaaacagaaaaatatgaa cacaaaagtgaagtgaagacttacgatccttctggtgactccacacttccaacctgttct aaaaaacgcaaaatagaacaggtagataaagaggatgaaattactgaaaagaaagccaaa aaagccaagattaaagttaaagttgaagaagaggaagaagaaaaagtggcagaagaagaa gaaacatctgtgaagaagaagaagaaaaggggtaaaaagaaacacattaaggaagaacca ctttctgaggaagaaccatgtaccagcacagcaattgctgaagctcatggggaagcacag acatccttgggtatgaagccttggcaatcagacaaggcttgtttcctgcaggccacttac tcagccatgaatgagagagatgatgtcaccactctcttccgccataatggcttcatctag >gi568815596f:202165942_202403057|GENSCAN_predicted_peptide_5|53_aa MTAQTEVQSEGQYRPPDIPNAEILCDIETSFGTLMDSLPLIIRFIIVVAVTVA >gi568815596f:202165942_202403057|GENSCAN_predicted_CDS_5|162_bp atgacagcacaaactgaggtccagtctgaggggcagtatcgaccaccagacatccctaat gctgaaatcctttgtgatatagagaccagctttggcactctaatggattctttaccattg attatccgtttcatcatcgtggttgctgtaactgttgcatag >gi568815596f:202165942_202403057|GENSCAN_predicted_peptide_6|91_aa MAPVSESQPPEEAAAAAAAAAAAAAAAASFSRSSGLLTELGRRGPRAGGGGRADEVGTWL FTHSEKEHQRTELRLGKKPVHDKRQTMLKPA >gi568815596f:202165942_202403057|GENSCAN_predicted_CDS_6|276_bp atggctcctgtgagcgaatcacaaccccccgaggaagccgctgctgctgccgccgccgcc gccgccgccgccgccgccgccgcctcgttctcccgcagctctgggctcctgacggaactc gggcgtcgcggaccgagggcggggggcggcgggagggcggatgaagttgggacatggctg ttcactcacagtgagaaagaacatcaaagaacagaattaagacttggcaagaaacctgtc cacgacaaaaggcagacaatgctaaagccagcataa