GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:55:00 Sequence gi568815596r:202107256_202314436 : 207181 bp : 43.30% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2333 2534 202 1 1 111 84 90 0.619 9.76 1.02 Intr + 6081 6191 111 1 0 50 121 29 0.209 2.75 1.03 Intr + 17935 18027 93 2 0 87 121 -2 0.385 2.94 1.04 Intr + 28922 29047 126 0 0 29 101 107 0.734 6.75 1.05 Intr + 31177 31253 77 2 2 89 80 36 0.461 2.13 1.06 Intr + 47418 47555 138 2 0 93 53 140 0.644 11.66 1.07 Intr + 58029 58101 73 2 1 58 87 67 0.171 2.58 1.08 Intr + 64100 64261 162 0 0 35 93 110 0.234 6.15 1.09 Intr + 72040 72255 216 2 0 11 8 187 0.086 1.48 1.10 Intr + 77498 77588 91 0 1 96 84 44 0.534 3.85 1.11 Intr + 79678 79843 166 1 1 43 84 12 0.353 -3.74 1.12 Intr + 80897 81011 115 1 1 80 91 124 0.965 11.92 1.13 Intr + 82919 83238 320 0 2 103 97 60 0.815 4.08 1.14 Intr + 86046 86248 203 2 2 42 94 184 0.795 12.48 1.15 Intr + 86935 87107 173 1 2 68 93 217 0.999 19.79 1.16 Term + 89545 89795 251 2 2 117 50 317 0.939 26.67 1.17 PlyA + 90283 90288 6 1.05 2.06 PlyA - 91294 91289 6 1.05 2.05 Term - 94538 94125 414 2 0 74 55 268 0.619 17.36 2.04 Intr - 103551 103480 72 0 0 82 81 33 0.609 1.60 2.03 Intr - 107179 107102 78 1 0 96 79 33 0.759 2.95 2.02 Intr - 112851 112777 75 0 0 59 109 63 0.881 5.21 2.01 Init - 116628 116581 48 0 0 70 93 -7 0.541 -1.05 2.00 Prom - 125623 125584 40 -7.86 3.00 Prom + 126917 126956 40 -2.96 3.01 Init + 131195 131243 49 1 1 77 102 85 0.603 8.02 3.02 Intr + 131295 131535 241 1 1 100 29 202 0.988 12.11 3.03 Term + 132268 132457 190 1 1 7 54 182 0.956 3.62 3.04 PlyA + 132866 132871 6 1.05 4.00 Prom + 147202 147241 40 -5.86 4.01 Init + 158687 158731 45 1 0 74 100 39 0.898 4.40 4.02 Intr + 167858 167934 77 1 2 64 101 56 0.952 2.81 4.03 Intr + 170695 170747 53 1 2 95 103 37 0.981 4.45 4.04 Intr + 175096 175217 122 2 2 48 98 111 0.966 8.31 4.05 Intr + 177090 177226 137 2 2 83 94 -17 0.600 -2.23 4.06 Intr + 180405 180469 65 0 2 92 78 41 0.651 1.86 4.07 Intr + 183068 183202 135 0 0 62 97 63 0.970 5.14 4.08 Intr + 183870 184015 146 1 2 60 99 123 0.999 10.60 4.09 Intr + 185522 185648 127 1 1 84 64 139 0.995 11.35 4.10 Intr + 188419 188582 164 2 2 94 93 13 0.973 2.09 4.11 Intr + 190124 190258 135 1 0 39 78 102 0.965 5.06 4.12 Intr + 190590 190651 62 2 2 74 113 47 0.905 3.33 4.13 Intr + 192979 193112 134 1 2 40 96 133 0.907 9.59 4.14 Intr + 195666 195802 137 1 2 45 34 205 0.909 11.09 4.15 Term + 196131 196181 51 2 0 67 42 57 0.429 -3.57 4.16 PlyA + 196383 196388 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:202107256_202314436|GENSCAN_predicted_peptide_1|838_aa AFSPFVFKDDDAPPHDVAPPLDLLPPIKGKKSPESQKGVDSPRTSDHNSPPSLPNMRVPR RALPAAQEDSSDPTLGHFLLGPDGEKVCLSLPGHTQTEALPSGKGKISYCFSAYESVNSN ISHEEEGPSSQHFLKGVKLQTFMVSVTGLKGSTSGVVRSFRWVRGLTGLKSEAADLRANT EPRANLHMNLYETSPLTQTTEKQGAQQSLEAAAQKTGEPQSCINKALICSNRKEFYTRKL HIDMTPFLKESGNALDYQEEAGRPLRETHHNDQGAPQQARWEQLLSGPAAFMASIAPLRV PAASERLLPSASLLREVVPALQKELLADGGARYGFSLTTFSPSDKLVRIEYALAAVAGGA LSVGIKAANGVVLATEKKQKFILYDERSVDKVEPITKHIDPEPRSMTLDSPRASRTEHIQ TPEADIVQKVGRDYDVHHLHRGLLGYGPESPERLSAVYTSLLPREREGKAEPRLFSQETS ANISHERDLINEAKRKEKPKKDKTKGPKSEREGKVYGQAEAAIGKSKDSKAKKKLEKKTR PQRKRTQKERNLEIAAELSGPDVSYEETEDTSNRGSFASDSFVEDPWLSPKYDAQESQVS LDGRSSPSQIATVTGNMESKEERRCEDPSKALLTKREQEKASWDRLRAERAEMRWLEVEK KRREQEEQRQLQQEQLERAKKMEEELELEQQRRTEEIRLRKQRLQEEQQRQEEEERKQQL RLKAAQERARQQQEEFRRKLRELQRKKQQEEAERAEAEKQRQEELEMQLEEEQKHLMEMA EEERLEYQRRKQEAEEKARLEAEERRQKEEEAARLALEEATKQAQEQARYWIFGQQLP >gi568815596r:202107256_202314436|GENSCAN_predicted_CDS_1|2517_bp gctttttccccttttgtttttaaagatgatgatgccccacctcacgatgtggccccacca ttggatcttctacccccgattaaaggaaaaaaaagtcctgagagccagaagggcgtggac agccctaggacatcagaccacaacagccccccaagtctcccgaacatgagagtgcccagg agggcactgccagcagctcaagaagattccagcgaccctacactgggacacttcttgctg ggtccagatggagaaaaagtctgcctgtccctcccagggcatacccaaaccgaggcgctt ccatcgggtaaaggtaaaatatcttactgtttttcagcctatgaatctgtcaattcaaat atcagccatgaagaggaagggcctagtagtcagcatttcctaaaaggagtgaagctgcag accttcatggtgagtgttacaggtcttaaaggcagcacgtctggagttgttcgttctttc cggtgggttcgtggtcttactggcctcaagagtgaagctgcagaccttcgcgcaaacact gaacctagagccaatcttcacatgaacctttatgaaacctcaccgttgacccaaacaaca gagaagcagggagcccagcagtccttggaggcagcagctcagaagacaggagagcctcaa agttgtataaataaagcgctgatatgttcaaacagaaaagaattttacacgcgcaagctg cacatcgacatgacgccgttcctgaaggaaagtggaaatgcactggactatcaggaagaa gcagggagacccctcagagaaactcaccacaacgaccaaggggccccgcagcaggcgcgc tgggagcagcttctctcggggcctgccgcgtttatggcttcaatagcgccgctccgggtc cccgcggcgtcggagaggcttctgcctagcgcctctctgctgcgcgaggtcgtcccggcc ttacaaaaggagctcctcgctgatggcggagcgcggtacggcttttcgctgactacattc agcccgtctgataaacttgtccggattgaatatgctttggctgctgtagctggaggagcc ctgtcagtgggaattaaagcggcaaatggtgtggtattagcaaccgagaaaaaacagaaa ttcattctgtatgatgagcgaagtgtagacaaagtggaaccaattaccaagcacatagac ccagagccaaggagtatgacccttgactctcccagggcttcccggactgagcacatccag accccagaagcagatattgtgcaaaaagtgggcagagattatgatgtacaccacctacac agaggacttctgggatacgggcctgagtcacccgagaggttgagtgctgtgtatacatct cttcttccaagagaaagagaagggaaggctgaaccaagactgtttagccaggagacatca gccaacatcagtcatgagagggatttgattaacgaggccaagagaaaggaaaaacccaag aaagacaaaaccaaaggacccaaaagcgagagagaaggaaaggtctacgggcaagcagag gctgccattggaaagtcaaaggactcaaaggctaaaaaaaaattagaaaaaaaaacaaga ccccaaaggaaaaggacacagaaggaaagaaatctggagatagcggcagagctgagcggg cctgatgtcagctatgaggaaacagaagacacctcaaatagaggttcctttgcctcagac tcctttgtagaggacccttggctttctcccaaatatgatgcccaggaaagccaagtttct ctagatggaagatcatcaccctctcagattgcaactgtcactggcaacatggaatctaaa gaagagagaagatgtgaggacccttccaaggccctcctcactaagagggagcaggagaag gcttcctgggacaggcttcgagcagaaagagccgagatgaggtggctggaggtggagaag aagagaagggagcaggaagagcaaaggcagctccagcaggagcagctggagagagcaaaa aagatggaggaggagctggagctggagcagcagagacgtacagaagagatccgcttgagg aaacagagactccaagaagaacagcagcggcaggaggaggaggagagaaagcagcagctc cggttgaaagcagcccaggagagagcccggcaacagcaagaggagtttcggaggaaactg cgagaactacagagaaaaaagcagcaggaggaagccgagagggccgaggcagagaagcaa aggcaagaggaattggaaatgcagttagaagaagaacaaaaacacctgatggaaatggct gaagaggaacgactggagtaccagcggcggaaacaggaagcagaagagaaggctcggctg gaggcagaggagaggaggcaaaaagaagaggaagcagcaagactggctctggaagaagcc acgaaacaagcccaggaacaagccaggtactggatatttgggcaacagttgccatag >gi568815596r:202107256_202314436|GENSCAN_predicted_peptide_2|228_aa MRREPNLGSQRWRVFKEAKPSTEDLGDKKEGEYIKLKVIGQDSSEIHFKVKMTTHLKKLK ESYCQRQGVPMNSLRFLFEGQRIADNHTPKEALPYTDAPTAYSELYHPSFVHPRGCHSPH HVGRFPGASLYLPMAQSVAVGSLGSTIPMAYYPVGPIYPPGSTVLVEGGYDAGARFGAGA TAGNILPPPPGCPPSAAQLAVMQAANVLVTQWKGNFFMGESDGGYTIW >gi568815596r:202107256_202314436|GENSCAN_predicted_CDS_2|687_bp atgaggagagaacctaatcttggcagccaaaggtggagagttttcaaggaggcaaaacct tcaactgaggacttgggggataagaaggaaggtgaatatattaaactcaaagtcattgga caggatagcagtgagattcacttcaaagtgaaaatgacaacacatctcaagaaactcaaa gaatcatactgtcaaagacagggtgttccaatgaattcactcaggtttctctttgagggt cagagaattgctgataatcatactccaaaagaagctctaccctatactgatgctccgact gcctactcagagctctatcatccaagctttgtgcaccccaggggctgccacagtccccac catgtcggccgctttcctggagcctctctgtatcttcccatggcccagtctgtggctgtt gggtctttaggttccacaatccccatggcttattatccagttggtcccatctatccacct ggctccacagtgctggtggaaggcgggtatgatgcaggtgccagatttggagctggggct actgctggcaacattcttcctccacctcctggatgccctcccagtgctgctcagcttgca gtcatgcaggcagccaatgtcctcgtaactcagtggaaggggaacttcttcatgggtgaa tcagatggtggctacaccatctggtga >gi568815596r:202107256_202314436|GENSCAN_predicted_peptide_3|159_aa MMTVASPGGLRTAAASAQHRGCSNFRYASRHFRFAQGGGRERREERQGGKWVGEQPRGGA GCGAALPVVARVNGSFVSSFKDQVLRISTRAIPQEQRAAAALCALLALPTNIMLQFLLGF TLGNVVGMYLAQNYDIPNLAKKLEEIKKDLDAKKKPPTS >gi568815596r:202107256_202314436|GENSCAN_predicted_CDS_3|480_bp atgatgacggtggcttcacccgggggtctccgcacagcagcggcctcggcacaacaccgc ggctgcagtaacttccgctacgcctcgcgtcacttccgcttcgcgcagggaggagggaga gagagaagagaggaaagacaaggcgggaaatgggtgggggagcagccaaggggaggggca ggctgtggagctgctttgccggtggtcgcgcgggtgaacggcagttttgtcagctccttc aaggaccaggtcctgcgcatcagtactagggccatcccccaggagcaaagagctgcagca gccctttgcgccctcctcgccctccccaccaacatcatgctccaattcctgcttggattt acattgggcaacgtggttggaatgtatctggctcagaactatgacataccaaacctggct aaaaaacttgaagaaattaaaaaggacttggatgccaagaagaaaccccctacttcatga >gi568815596r:202107256_202314436|GENSCAN_predicted_peptide_4|529_aa MLVLFETSVGYAIFKVLNEKKLQEVDSLWKEFETPEKANKIVKLKHFEKFQDTAEALAAF TALMEGKINKQLKKVLKKIVKEAHEPLAVADAKLGGVIKEKLNLSCIHSPVVNELMRGIR SQMDGLIPGVEPREMAAMCLGLAHSLSRYRLKFSADKVDTMIVQAISLLDDLDKELNNYI MRCREWYGWHFPELGKIISDNLTYCKCLQKVGDRKNYASAKLSELLPEEVEAEVKAAAEI SMGTEVSEEDICNILHLCTQVIEISEYRTQLYEYLQNRMMAIAPNVTVMVGELVGARLIA HAGSLLNLAKHAASTVQILGAEKALFRALKSRRDTPKYGLIYHASLVGQTSPKHKGKISR MLAAKTVLAIRYDAFGEDSSSAMGVENRAKLEARLRTLEDRGIRKISGTGKALAKTEKYE HKSEVKTYDPSGDSTLPTCSKKRKIEQVDKEDEITEKKAKKAKIKVKVEEEEEEKVAEEE ETSVKKKKKRGKKKHIKEEPLSEEEPCTSTAIASPEKKKKKKKKRENED >gi568815596r:202107256_202314436|GENSCAN_predicted_CDS_4|1590_bp atgttggtgctgtttgaaacgtctgtgggttacgccatctttaaggttctaaatgagaag aaacttcaagaggttgatagtttatggaaagaatttgaaactccagagaaagcaaacaaa atagtaaagctaaaacattttgagaaatttcaggatacagcagaagcattagcagcattc acagctctgatggagggcaaaatcaataagcagctgaaaaaagttctgaagaaaatagta aaagaagcccatgaaccgctggcagtagctgatgctaaactaggaggggtcataaaggaa aagctgaatctcagttgtatccatagtcctgttgttaatgaacttatgagaggaattcgt tcacaaatggatggattaatccctggggtagaaccacgtgaaatggcagctatgtgtctt ggattggctcacagcctgtctcgatatagattgaagtttagcgctgataaagtagacaca atgattgttcaggcaatttccttgttagatgacttggataaagaactaaacaactacatt atgcgatgtagagaatggtatggctggcatttccctgaattaggaaaaattatttcagat aatttaacatactgcaagtgtttacagaaagttggcgataggaagaactatgcctctgcc aagctttctgagttgctgccagaagaagttgaagcagaagtgaaagcagctgcagagata tcaatgggaacagaggtttcagaagaagatatttgcaatattctgcatctttgcacccag gtgattgaaatctctgaatatcgaacccagctctatgaatatctacaaaatcgaatgatg gccattgcacccaatgttacagtcatggttggggaattagttggagcacggcttattgct catgcaggttctcttttaaatttggccaagcatgcagcttctaccgttcagattcttgga gctgaaaaggcacttttcagagccctcaaatctagacgggatacccctaagtatggtctc atttatcatgcttcactcgtgggccagacaagtcccaaacacaaaggaaagatttctcga atgctggcagccaaaaccgttttggctatccgttatgatgcttttggtgaggattcaagt tctgcaatgggagttgagaacagagccaaattagaggccaggttgagaactttggaagac agagggataagaaaaataagtggaacaggaaaagcattagcaaaaacagaaaaatatgaa cacaaaagtgaagtgaagacttacgatccttctggtgactccacacttccaacctgttct aaaaaacgcaaaatagaacaggtagataaagaggatgaaattactgaaaagaaagccaaa aaagccaagattaaagttaaagttgaagaagaggaagaagaaaaagtggcagaagaagaa gaaacatctgtgaagaagaagaagaaaaggggtaaaaagaaacacattaaggaagaacca ctttctgaggaagaaccatgtaccagcacagcaattgctagtccagagaaaaagaagaaa aagaaaaaaaagagagagaacgaggattaa