GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:56:38 Sequence gi568815578r:35853748_36054284 : 200537 bp : 44.61% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4555 4634 80 2 2 76 94 66 0.874 5.27 1.02 Intr + 9266 9653 388 1 1 63 105 334 0.997 26.96 1.03 Intr + 17208 17387 180 1 0 43 113 145 0.964 12.34 1.04 Intr + 17903 18082 180 0 0 122 22 56 0.332 2.24 1.05 Intr + 44607 44630 24 1 0 85 101 27 0.595 1.70 1.06 Intr + 45623 45901 279 0 0 72 108 150 0.955 12.85 1.07 Intr + 59502 59600 99 1 0 66 51 123 0.982 6.48 1.08 Intr + 60286 60450 165 2 0 70 82 57 0.891 3.23 1.09 Intr + 63737 63915 179 0 2 96 96 321 0.996 33.34 1.10 Intr + 74033 74132 100 1 1 72 80 55 0.678 2.78 1.11 Intr + 77502 77697 196 1 1 99 92 206 0.977 20.57 1.12 Intr + 84950 85361 412 2 1 78 95 242 0.645 18.09 1.13 Intr + 87117 87300 184 2 1 89 54 228 0.980 18.86 1.14 Term + 93738 93880 143 1 2 90 47 177 0.995 11.89 1.15 PlyA + 93948 93953 6 1.05 2.05 PlyA - 94246 94241 6 1.05 2.04 Term - 100592 99998 595 1 1 66 53 815 0.902 69.80 2.03 Intr - 101071 100701 371 1 2 33 115 398 0.973 30.90 2.02 Intr - 101355 101239 117 0 0 33 64 99 0.833 2.66 2.01 Init - 106749 106744 6 0 0 82 76 10 0.736 -0.49 2.00 Prom - 107506 107467 40 -9.06 3.00 Prom + 108102 108141 40 -1.86 3.01 Init + 115016 115066 51 1 0 99 81 39 0.918 5.47 3.02 Intr + 118882 119019 138 0 0 93 69 92 0.984 8.46 3.03 Intr + 122202 122255 54 2 0 131 69 30 0.954 4.58 3.04 Intr + 126712 126875 164 0 2 120 73 259 0.756 26.37 3.05 Intr + 130235 130391 157 2 1 36 86 138 0.594 8.41 3.06 Intr + 130880 131031 152 1 2 84 61 202 0.973 16.06 3.07 Intr + 133648 133786 139 1 1 63 62 129 0.914 8.27 3.08 Intr + 141291 141405 115 2 1 97 91 85 0.997 9.72 3.09 Intr + 154550 154727 178 0 1 81 74 17 0.029 -1.42 3.10 Intr + 157390 157510 121 1 1 41 97 148 0.269 11.50 3.11 Intr + 169855 170024 170 0 2 41 82 83 0.032 1.74 3.12 Term + 176610 176901 292 0 1 65 36 320 0.925 19.32 3.13 PlyA + 176933 176938 6 1.05 4.05 PlyA - 178180 178175 6 1.05 4.04 Term - 181128 181032 97 2 1 90 37 48 0.133 -2.66 4.03 Intr - 187726 187495 232 2 1 71 85 150 0.049 9.83 4.02 Intr - 190178 190138 41 1 2 110 55 45 0.023 1.27 4.01 Intr - 196739 196633 107 2 2 53 105 69 0.012 4.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:35853748_36054284|GENSCAN_predicted_peptide_1|869_aa XTYTVKFYDGVVQTVKHIHVKAFSKDQNIVGNARPKETDHKSLSSSPDKREKFKEQRKAT VNVKKDKEDKPLKTEKRPKQPDKEGKLICSEKGKVSEKSLPKNEKEDKENISENDREYSG DAQVDKKPENDIVKSPQENLREPKRKRGRPPSIAPTAQEKSKNYSENTDKDLSRRRSSRL STNGTHEILDPDLVVSDLVDTDPLQDTLSSTKESEEGQLKSALEAGQVSSALTCHSFGDG SGAAGLELNCPSMGENTMKTEPTSPLVELQEISTVEDRYQDGSKLTNTFKKTDDFGSSNA PAVDLDHKFRCKVVDCLKFFRKAKLLHYHMKYFHGMEKSLEPEESPGKRHVQTRGPSASD KPSQETLTRKRVSASSPTTKDKEKNKEKKFKEFVRVKPKKKKKKKKKTKPECPCSEEISD TSQEPSPPKAFAVTRCGSSHKPGVHMSPQLHGPESGHHKGKVKALEEDNLSESSSESFLW SDDEYGQDVDVTTNPDEELDGDDRYDFEVVRCICEVQEENDFMIQCEECQCWQHGVCMGL LEENVPEKYTCYVCQDPPGQRPGFKYWYDKEWLSRGHMHGLAFLEENYSHQNAKKIVATH QLLGDVQRVIEVLHGLQLKMSILQSREHPDLPLWCQPWKQHSGEGRSHFRNIPVTDTRSK EEAPSYRTLNGAVEKPRPLALPLPRSVEESYITSEHCYQKPRAYYPAVEQKLVVETRGSA LDDAVNPLHENGDDSLSPRLGWPLDQDRSKGDSDPKPGSPKVKEYVSKKALPEEAPARKL LDRGGEGLLSSQHQWQFNLLTHVESLQDEVTHRMDSIEKELDVLESWLDYTGELEPPEPL ARLPQLKHCIKQLLMDLGKVQQIALCCST >gi568815578r:35853748_36054284|GENSCAN_predicted_CDS_1|2610_bp ngtacttacactgtgaaattttatgatggagtagttcagactgtcaaacatattcatgtc aaagctttttccaaagatcagaatattgtgggtaatgctaggcctaaagaaacagatcac aaaagtctttcatcatctcctgataaacgagagaagtttaaagaacagagaaaagcaaca gtgaatgtgaagaaagacaaagaagataaacccttaaagacagaaaagcgacccaagcag cctgataaagaaggaaagttaatctgttctgaaaaggggaaagtgtcagagaaaagtctt cccaagaacgagaaggaagacaaggaaaacatttccgaaaatgacagagagtattctgga gatgcccaagtggataagaaacctgaaaatgacattgtgaagagtccacaagaaaacttg agggaacccaaaagaaaacgaggcagacccccttccatagctcctactgcccaggaaaag tcaaaaaactactcggaaaacactgacaaagacttatcgaggagacgttcctccaggctg tccactaatgggacccatgagatcctagatcctgacttggttgtatcagatttggttgat acggatcctttgcaagacacgttgtctagtaccaaggaatctgaagaaggtcagttgaag tctgctttggaagctggccaggtctcatctgcactgacttgccactcctttggggatgga tccggggctgcaggcttggagttgaactgcccatcaatgggagaaaacacgatgaaaaca gaaccgacttctccccttgtggaattacaagagatttcgactgtggaagataggtaccag gatggtagtaaattaacaaatacttttaagaaaacagatgattttgggtcatctaatgca ccagctgtcgacctagaccataagtttagatgcaaagttgtggactgtttaaaatttttc cgcaaagccaaactgttgcactatcacatgaagtatttccatggaatggagaagtcactg gagccagaagagagcccgggaaagaggcatgtccaaaccaggggcccttcagcttcagac aagcccagccaggagaccctgaccaggaagcgggtctctgccagttccccaactacaaaa gacaaggaaaagaataaagagaagaaattcaaggagtttgtgagagtgaagccaaagaag aaaaagaaaaagaaaaagaaaaccaaacctgaatgcccctgcagtgaggagatcagtgac acctcccaggaaccttctccacccaaggcatttgctgttaccaggtgtgggtcctcacac aagccaggggtccatatgagcccgcagcttcatggcccagaatctggacaccacaaaggg aaagtgaaagcattggaggaggataatttgagtgagtcctcttctgagagctttctctgg agtgatgatgagtatggccaagatgtggatgtgaccaccaacccagatgaggaacttgat ggggatgaccgctatgacttcgaggtggtccgctgcatctgtgaggtccaggaggaaaat gacttcatgattcagtgtgaagagtgccagtgctggcagcatggggtctgcatgggatta ctggaagaaaatgtgcccgagaaatacacctgttatgtttgccaagaccctccaggtcag aggcctggcttcaagtactggtatgacaaggagtggctgagcaggggacatatgcatggc ctggcatttctagaagagaactactcccatcagaatgccaagaagatcgtggccacccac cagcttcttggtgatgtgcagagagtgattgaggttctgcatggcctgcagctcaagatg agcatcttgcaaagccgggagcatcctgatctgccgctgtggtgccagccttggaaacag cactcaggggaggggagatctcatttcagaaacatccctgtcactgacaccaggagcaag gaggaagctccaagctatagaactttgaacggggcagtggagaagcccaggcccctggcc ctgcccctgccgcgttctgtggaggaatcctatatcaccagtgagcattgctaccagaag ccccgcgcctattaccctgccgtggagcagaagctggtggtggagacgaggggctctgcc ctcgacgatgcggtcaaccccctccatgagaacggcgatgattccctttccccgcgcctg ggctggcctctagaccaagacaggagcaagggggacagtgaccccaaacccggctcccca aaggtgaaggaatatgtctccaaaaaggccctaccagaagaagcccctgctcggaagctg ctggacagaggtggagaggggctgctgagctcccagcaccagtggcagtttaacctgctg acccatgtggaatctcttcaggatgaagttacgcacaggatggactccattgagaaggag ttggatgtgttggagagctggctggactacactggggaactggagccccctgagccgctg gccaggcttccgcagctcaagcattgtatcaagcagctgctgatggacctgggcaaggtg cagcagatcgccctctgctgctcaacatga >gi568815578r:35853748_36054284|GENSCAN_predicted_peptide_2|362_aa MANPQSRISLGFGGHAKNNNSKGFSSISFAPDGVLYFVIVAEGPSELQLIRVNQKGRPHA KVTCKPKRRTDKVRDLKSISKHSGSSSDSSSNWRSEAPPFGRSPGHAQLALPHREGSSRL MRLENARGRRFREPASGFRVLARRPGRREARDFRFRLPQALRWCRRSAEHTATGQRVTPG AGVMAATEPILAATGSPAAVPPEKLEGAGSSSAPERNCVGSSLPEASPPAPEPSSPNAAV PEAIPTPRAAASAALELPLGPAPVSVAPQAEAEARSTPGPAGSRLGPETFRQRFRQFRYQ DAAGPREAFRQLRELSRQWLRPDIRTKEQIVEMLVQEQLLAILPEAARARRIRRRTDVRI TG >gi568815578r:35853748_36054284|GENSCAN_predicted_CDS_2|1089_bp atggcgaacccacagagccgcatttcccttggtttcggaggacacgcgaaaaataacaac agcaaagggttctcgagcatctccttcgcgccagacggtgtgttgtactttgtcatcgtc gcagaggggccatcagagctacagctcatcagggtcaatcagaaggggcgcccgcacgca aaggtcacctgcaaaccgaagcggcgcacggataaggtccgggacttgaagtctatctcc aaacattcaggctcctcctccgactcctcctcgaactggcgctcagaggcgccaccattc ggaaggagcccaggccacgcgcagctcgccctcccgcaccgcgagggcagcagccgcctc atgcgactcgagaacgctagagggcgccgcttccgggagcccgcaagcggcttccgggtg ctcgcgcgccgacctggacgcagagaagccagagactttcgcttccggctgccgcaggcg cttcgctggtgcagacgcagtgctgagcacacagctaccggacaaagagtgacgcccgga gctggagttatggcggctacggagccgatcttggcggccactgggagtcccgcggcggtg ccaccggagaaactggaaggagccggttcgagctcagcccctgagcgtaactgtgtgggc tcctcgctgccagaggcctcaccgcctgcccctgagccttccagtcccaacgccgcggtc cctgaagccatccctacgccccgagctgcggcctccgcggccctggagctgcctctcggg cccgcacccgtgagcgtagcgcctcaggccgaagctgaagcgcgctccacaccaggcccc gccggctctagactcggtcccgagacgttccgccagcgtttccggcagttccgctaccag gatgcggcgggtccccgggaggctttccggcagctgcgggagctgtcccgccagtggctg cggcctgacatccgcaccaaggagcagatcgtggagatgctggtgcaagagcagctgctc gccatcctgcccgaggcggctcgggcccggcggatccgccgccgcacggatgtgcgcatc actggctga >gi568815578r:35853748_36054284|GENSCAN_predicted_peptide_3|576_aa MRRHMVTYAWQLLKKELGLYQLAMDIIIMIRVCKMFRQGLRGFREYQIIETAHWKHPIFS FWDKKMQSRVTFDTMDFIAEEGHFPPKAIQIMQKKPSWRTEDEIQAVCNILQVLDSYRNY AEPLQLLLAKVMRFERFGRRRVIIKKGQKGNSFYFIYLGTVAITKDEDGSSAFLDPHPKL LHKGSCFGEMDVLHASVRRSTIVCMEETEFLVVDREDFFANKLDQEVQKDAQYRFEFFRK MELFASWSDEKLWQLVAMAKIERFSYGQLISKDFGESPFIMFISKGSCEVLRLLDLGASP SYRRWIWQHLELIDGRPLKTHLSEYSPMERFKEFQIKSYPLQDFSSLKLPHLKKAWGLQG TSFSRKIRTSGDTLPKMLGPKIQSRPAQSIKCAMINIKPGELPKEAAVGAYVKVHTVEQG EILGLHQAFLPEGECDTRPLILMSLGNELIRIRKEIFYELIDNDDEMIKKLLKLNIAFPS DEDMCQKFLQQNSWNIFRKDLLQLLVEPCQSQLFTPNRPKKREIYNPKSVVLDLCSINKT TKPRYPIFMAPQKYLPPLRIVQAIKAPRYKIRELLA >gi568815578r:35853748_36054284|GENSCAN_predicted_CDS_3|1731_bp atgaggagacatatggtaacttatgcctggcagctcctgaagaaggaactgggactgtac cagctcgccatggatatcatcataatgatccgagtgtgtaaaatgttccgccaaggcctc aggggattccgggaatatcaaatcattgagactgctcactggaagcaccctatcttctcc ttctgggataaaaagatgcaaagccgagtcacatttgataccatggacttcattgcagag gagggtcactttcctccaaaggccattcagatcatgcagaagaagccttcctggagaaca gaggatgagatccaggccgtctgtaacatcttgcaggttctggatagctatcggaactac gcagagcccctgcagctgctcctggccaaagtcatgcgctttgaacggtttggtcgcagg cgtgtgatcatcaagaaggggcagaagggcaacagcttttatttcatctacctgggcaca gttgcaataaccaaggacgaggatggcagcagtgccttcctagatccccacccgaaattg ctgcacaagggtagctgttttggggaaatggacgttctgcatgcttcagtgaggaggtcc accatcgtctgtatggaagaaacggagttcctggttgttgaccgggaggacttctttgct aataagctggaccaggaagttcagaaggatgctcagtatcggtttgaattttttaggaag atggagctgtttgcatcatggtctgatgagaagctctggcagctggtagccatggcgaag atagagaggttctcgtatgggcagctgatctcaaaagattttggagagtcacccttcatc atgtttatcagcaagggcagctgtgaagtcctgcggctgttggaccttggggcctcccct tcctaccgtagatggatctggcagcacctggagctgatagatggcagacctctgaagacc cacctgagtgaatactctcctatggaaagatttaaggaattccagatcaaatcatatcct ctgcaagactttagctccttgaaacttccacatctcaaaaaagcctgggggctacagggg acaagcttcagcaggaagatcagaacctcaggagacactctccccaagatgctgggcccg aagatccaatccaggcctgctcagtcgatcaaatgtgccatgatcaatatcaagcctggt gagctccccaaggaggctgcagtgggggcctacgtgaaggtgcacactgtggagcaggga gaaattttgggtcttcaccaggccttccttccagagggtgaatgcgacacacgacccttg atcctgatgagcctgggaaatgagttgatacggataaggaaggaaatattttatgaactg attgacaatgatgacgagatgataaaaaagttgttaaagctcaatattgcattccccagt gatgaagatatgtgccagaagttcctccagcagaacagctggaatatctttcggaaggac ctgttgcagctgctcgtggagccttgccaaagtcaactgttcactccaaaccggcccaag aagagagagatctacaaccctaagtctgtggtcctggatttgtgcagcatcaacaagacg actaaacctcgttatcctatttttatggcaccccagaaatacctccccccattgaggatt gtccaagccatcaaagcacctcggtacaaaatccgagaactcttggcttag >gi568815578r:35853748_36054284|GENSCAN_predicted_peptide_4|158_aa ATEPRRSGSRVTTLSPLGETYLVLTTTDKGLKGPGRSDAGPFKAMSFIGGFRFSDASSPR PPCFMEMPRSLKVRGLRWTEDKRCLEGGEQVAANGLQDASRRGGKPHELMIIEADEDVDN GAMCECCTPPTQCLVWFAALSPLSPERLPVHVPMSSFT >gi568815578r:35853748_36054284|GENSCAN_predicted_CDS_4|477_bp gccacggagccgcgcagatccggttcccgggtgaccactctgtcgccattgggcgagacc tacctagtcctgacgacaacggacaaaggccttaaggggcctggaagatcagatgctggg cctttcaaagccatgagtttcatcgggggttttagattcagtgatgccagctctcctagg cctccctgtttcatggagatgcccaggagcctgaaggtgaggggactgagatggacagaa gacaagcgctgtttggagggaggggagcaggtagcagccaatgggctccaggatgcttct cgcagagggggcaaaccccatgagctgatgatcatcgaagctgatgaggatgtggataat ggagcaatgtgtgaatgctgtaccccccccacacagtgcttggtatggtttgctgccctg tctccactctcccctgagaggctgcccgttcatgttcccatgagctccttcacctag