GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:05:30 Sequence gi568815587f:7887353_8096019 : 208667 bp : 44.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 13777 13850 74 2 2 93 35 109 0.549 4.17 1.02 PlyA + 13965 13970 6 1.05 2.04 PlyA - 16275 16270 6 1.05 2.03 Term - 18464 18273 192 2 0 65 47 185 0.789 9.52 2.02 Intr - 28047 27903 145 2 1 45 62 71 0.158 0.48 2.01 Init - 33990 33572 419 0 2 71 53 129 0.129 3.60 2.00 Prom - 34044 34005 40 -4.46 3.03 PlyA - 34161 34156 6 1.05 3.02 Term - 34657 34205 453 1 0 19 48 199 0.376 4.16 3.01 Init - 36155 35487 669 2 0 66 41 280 0.361 16.31 3.00 Prom - 46126 46087 40 -6.16 4.06 PlyA - 46508 46503 6 1.05 4.05 Term - 47313 47140 174 0 0 97 41 45 0.029 -1.54 4.04 Intr - 52096 51959 138 1 0 45 105 91 0.004 7.16 4.03 Intr - 61061 60919 143 0 2 58 71 82 0.014 3.57 4.02 Intr - 73970 72528 1443 0 0 63 61 1075 0.021 91.90 4.01 Init - 76143 75855 289 0 1 104 101 285 0.872 26.68 4.00 Prom - 81249 81210 40 -3.16 5.00 Prom + 84610 84649 40 -1.76 5.01 Init + 96699 96774 76 2 1 73 83 56 0.449 3.71 5.02 Intr + 99648 100021 374 1 2 42 100 258 0.450 17.18 5.03 Intr + 100071 100273 203 2 2 24 25 256 0.712 10.98 5.04 Intr + 100279 100364 86 1 2 41 92 142 0.976 9.36 5.05 Intr + 104429 104499 71 0 2 96 81 77 0.998 6.70 5.06 Intr + 104732 104811 80 1 2 102 119 71 0.999 9.95 5.07 Intr + 105535 105672 138 1 0 78 60 246 0.998 20.38 5.08 Intr + 107074 107165 92 1 2 108 105 151 0.997 18.44 5.09 Intr + 107630 107766 137 0 2 44 85 144 0.999 9.99 5.10 Intr + 107902 108015 114 0 0 64 82 185 0.999 16.14 5.11 Term + 108593 108670 78 1 0 140 42 86 0.998 6.96 5.12 PlyA + 108790 108795 6 1.05 6.08 PlyA - 108841 108836 6 1.05 6.07 Term - 115956 115897 60 0 0 93 43 43 0.107 -1.90 6.06 Intr - 125435 125309 127 1 1 53 109 48 0.054 4.08 6.05 Intr - 130456 130344 113 1 2 71 84 41 0.006 1.18 6.04 Intr - 144384 144216 169 2 1 119 15 135 0.373 9.25 6.03 Intr - 146668 146596 73 2 1 56 74 33 0.436 -2.84 6.02 Intr - 149168 149120 49 2 1 89 89 18 0.211 0.25 6.01 Init - 149880 149797 84 0 0 68 75 62 0.514 3.72 6.00 Prom - 155110 155071 40 -0.26 7.03 PlyA - 155448 155443 6 1.05 7.02 Term - 175610 175520 91 1 1 117 45 97 0.471 5.49 7.01 Init - 182739 182645 95 0 2 85 49 72 0.388 2.85 7.00 Prom - 186734 186695 40 -3.46 8.00 Prom + 189622 189661 40 -7.36 8.01 Init + 194059 194196 138 0 0 72 101 148 0.999 12.64 8.02 Intr + 195094 195266 173 0 2 67 36 202 0.419 11.64 8.03 Intr + 202258 202309 52 1 1 88 109 72 0.935 8.21 8.04 Intr + 202717 202879 163 0 1 57 69 293 0.989 23.85 8.05 Intr + 206694 206837 144 1 0 18 82 89 0.584 1.55 8.06 Intr + 208146 208313 168 1 0 74 84 151 0.640 13.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 73970 72292 1679 0 2 63 40 1104 0.968 92.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:7887353_8096019|GENSCAN_predicted_peptide_1|24_aa XIMVLEVVYENKYQKYEKDLHENE >gi568815587f:7887353_8096019|GENSCAN_predicted_CDS_1|75_bp nggataatggtcctggaagttgtgtatgagaacaaatatcagaaatatgaaaaggatctg catgagaacgagtaa >gi568815587f:7887353_8096019|GENSCAN_predicted_peptide_2|251_aa MGKDFMSKTPKAMATKDKIDKWGLIKLKSFCTAKETTIRVNRQPTKREKIFATYSSDKEL ISRIYNELKQIYKKKTNNPIKKWVKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMEIKTT MRYHLTPVRMAIIKKSGNNRSPQTITDAELQVTLTVEDKPIPFLISTEATHSTLPSFQGP VSLASITVVHAGYIVVPQTVTQLGSQQVVTQNRDAKDANKWRLNTGLRRCEQKRSAGAEK LKLTRQTGTPG >gi568815587f:7887353_8096019|GENSCAN_predicted_CDS_2|756_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaaattgac aaatggggtctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aataggcaacctacaaaacgggagaaaatttttgcaacctactcatctgacaaagagcta atatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggtgaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaa aagcacatgaaaaaatgctcatcatcactggccatcagagaaatggaaatcaaaaccaca atgagataccatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacaga agcccccagaccatcacggacgctgagcttcaggtaactctcacagtggaagataagccc atccccttcttaatcagtacggaggctacccactccacattaccttcttttcaagggcct gtttcccttgcctccataactgttgtccatgcaggttacatagtggtgccccaaacagtg acacaattgggttctcaacaagtggtgactcagaacagggacgccaaggatgccaacaag tggcgtctgaacacgggacttcgacgatgtgaacaaaaaagatctgctggagcagagaag ctgaaattgacaaggcaaacggggaccccaggatga >gi568815587f:7887353_8096019|GENSCAN_predicted_peptide_3|373_aa MKAEIKMFFETNENKDTTYQDLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELE KQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESKSWFFERINKIDRPLARLIKKKR EKNQIDTIKNDKGDMTTDPTEIQTTIREYYKHLYAKKLENLEEMDKFLNTYILPRLNQEE VESLNRPITGAEIVAIINSLPIKKSPGPDGFTAKFSQRYKEELRIKYLGIQLTRDVKDLF NENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMALLPKVIYRFNAIPIKLPMTFFTELE KTTLKFIWNQKRAHFAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQNRA LGNNTAYLQLSDL >gi568815587f:7887353_8096019|GENSCAN_predicted_CDS_3|1122_bp atgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacacaacataccag gatctctgggacacattcaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaagaactagaa aagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaaatcagagca gaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatccaagagctgg ttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaagaaaaaaaga gagaagaatcaaatagacacaataaaaaatgataaaggggatatgaccactgatcccaca gaaatacaaactaccatcagagaatactacaaacacctctatgcaaaaaaactagaaaat ctagaagaaatggataaattcctcaacacatacattctcccaagactaaaccaggaagaa gttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatcaatagctta ccaatcaaaaagagtccaggaccagatggattcacagccaaattctcccagaggtacaag gaggaactgagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttc aatgagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaac attccatgctcatgggtaggaagaatcaatatcgtgaaaatggccttactgcccaaggta atttatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaa aaaactaccttaaagttcatatggaaccaaaaaagagcccactttgccaagtcaatccta agccaaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggct acagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcagaacagagcc ctcggaaataacaccgcatatctacaactatctgatctttga >gi568815587f:7887353_8096019|GENSCAN_predicted_peptide_4|728_aa MAMAKARKPREALLWALSDLEENDFKKLKFYLRDMTLSEGQPPLARGELEGLIPVDLAEL LISKYGEKEAVKVVLKGLKVMNLLELVDQLSHICLHDYREVYREHVRCLEEWQEAGVNGR YNQVLLVAKPSSESPESLACPFPEQELESVTVEALFDSGEKPSLAPSLVVLQGSAGTGKT TLARKMVLDWATGTLYPGRFDYVFYVSCKEVVLLLESKLEQLLFWCCGDNQAPVTEILRQ PERLLFILDGFDELQRPFEEKLKKRGLSPKESLLHLLIRRHTLPTCSLLITTRPLALRNL EPLLKQARHVHILGFSEEERARYFSSYFTDEKQADRAFDIVQKNDILYKACQVPGICWVV CSWLQGQMERGKVVLETPRNSTDIFMAYVSTFLPPDDDGGCSELSRHRVLRSLCSLAAEG IQHQRFLFEEAELRKHNLDGPRLAAFLSSNDYQLGLAIKKFYSFRHISFQDFFHAMSYLV KEDQSRLGKESRREVQRLLEVKEQEGNDEMTLTMQFLLDISKKDSFSNLELKFCFRISPC LAQDLKHFKEQMESMKHNRTWDLEFSLYEAKIKNLVKVGAKPLFSDIDQLVGTDAPGEAV LLGADSSGLSGTRKIFIPEDSSFLKVQLFGVFLVIYVVTLMGNAIITVIISLNQSLHVPM YLFLLNLSVVEKLFSLIRFHLSILAFVAIAFGVILIPKPGRDTTKKDNFRPISLMNIDAK ILNKILAN >gi568815587f:7887353_8096019|GENSCAN_predicted_CDS_4|2187_bp atggccatggccaaggccagaaagccccgggaggcattgctctgggccttgagtgacctt gaggagaacgatttcaagaagttaaagttctacttacgggatatgaccctgtctgagggc cagcccccactggccagaggggagttggagggcctgattccggtggacctggcagaatta ctgatttcaaagtatggagaaaaggaggctgtgaaagttgtcctcaagggcttgaaggtc atgaacctgttggaacttgtggaccagctcagccatatttgtctgcatgattacagagaa gtataccgagagcatgtgcgctgcctagaggaatggcaggaagcaggagtcaatggcaga tacaaccaggtgctcctggtggccaagcccagctcagagagcccagaatcacttgcctgc cccttcccggagcaggagctggagtctgtcacggtggaggctctatttgattcaggggaa aagccctcactggccccatccttagttgtgctacaggggtcggctggcactggaaagaca actctcgccagaaaaatggtgttggactgggccaccggtactctgtacccaggccggttt gattatgtcttttatgtaagctgcaaagaagtggtcctgctgctggagagcaaactggag cagctccttttctggtgctgcggggacaatcaagcccctgtcacagagattctgaggcag ccagagcggctcctgttcatcctggatggctttgatgagctgcagaggccctttgaagaa aagttgaagaagaggggtttgagtcccaaggagagcctgctgcaccttctaattaggaga catacactccccacgtgctcccttctcatcaccacccggcccctggctttgaggaatctg gagcccttgctgaaacaagcacgtcatgtccatatcctaggcttctctgaggaggagagg gcgaggtacttcagctcctatttcacggatgagaagcaagctgaccgtgccttcgacatt gtacagaaaaatgacattctctacaaagcgtgtcaggttccaggcatttgctgggtggtc tgctcctggctgcaggggcagatggagagaggcaaagttgtcttagagacacctagaaac agcactgacatcttcatggcttacgtctccacctttctgccgcccgatgatgatgggggc tgctccgagctttcccggcacagggtcctgaggagtctgtgctccctagcagctgaaggg attcagcaccagaggttcctatttgaagaagctgagctcaggaaacataatttagatggc cccaggcttgccgctttcctgagtagtaacgactaccaattgggacttgccatcaagaag ttctacagcttccgccacatcagcttccaggacttttttcatgccatgtcttacctggtg aaagaggaccaaagccggctggggaaggagtcccgcagagaagtgcaaaggctgctggag gtaaaggagcaggaagggaatgatgagatgaccctcactatgcagtttttactggacatc tcgaaaaaagacagcttctcgaacttggagctcaagttctgcttcagaatttctccctgt ttagcgcaggatctgaagcattttaaagaacagatggaatctatgaagcacaacaggacc tgggatttggaattctccctgtatgaagctaaaataaagaatctggtaaaagtgggggcc aagcccttattctctgacattgatcagttagtaggtacagatgccccaggtgaagcagtc cttttaggggctgacagctcaggcctttctggcactcgcaaaatcttcattcctgaagat tcttcattcttgaaggtgcagctctttggggttttcctagttatttatgtggtgaccctg atgggaaatgccatcattacagtcatcatctccttaaaccagagcctccacgttcccatg tacctgttcctcctgaacctatctgtggtggagaagctctttagtttaattagattccat ttgtcaattttggcttttgttgccattgcttttggtgtcatcctgataccaaaacctggc agagacacaacaaaaaaagacaatttcaggccaatatccctgatgaacatcgatgcaaaa atcctcaataaaatactggcaaactga >gi568815587f:7887353_8096019|GENSCAN_predicted_peptide_5|482_aa MGRQHQRVGGSITVHAAALPHEGSLSTLTSIISFKSKTRISTSALLSGSTWRQLHKVFLF ERRPKFYPELLSPGFPECSSGQKQSAQIPEEITNGGTISVLREVSPVTNQQKALAHVVYI QPIKLLIFMNAVISASASFFLDKMATPAVPPQFQRQRQHRLRLRFPLRLQPHPQTLRQQR LQLRLLARPRPQRKLQRRPQRPLCLVLLFQGPSPAAAWLHPVILASIVDSYERRNEGAAR VIGTLLGTVDKHSVEVTNCFSVPHNESEDEVAVDMEFAKNMYELHKKVSPNELILGWYAT GHDITEHSVLIHEYYSREAPNPIHLTVDTSLQNGRMSIKAYVSTLMGVPGRTMGVMFTPL TVKYAYYDTERIGVDLIMKTCFSPNRVIGLSSDLQQVGGASARIQDALSTVLQYAEDVLS GKVSADNTVGRFLMSLVNQVPKIVPDDFETMLNSNINDLLMVTYLANLTQSQIALNEKLV NL >gi568815587f:7887353_8096019|GENSCAN_predicted_CDS_5|1449_bp atggggcggcagcatcagagggtgggcggcagcatcacagtgcatgcggcagctctgccc catgagggaagccttagcactttgacatccattatctcatttaagtctaaaacacgaatt tcgacctccgcgctgctttctggttccacctggcggcagctgcacaaagtgtttctcttc gaacgccgtcccaagttttacccggaactacttagcccagggtttcctgagtgctcctcg ggccaaaagcagagcgcacaaattccagaagaaattaccaacggcgggactatttcagtg ctccgagaagtatcaccagtaaccaatcagcaaaaagctctagctcacgtggtgtatatt cagcctatcaagctcctgatattcatgaatgctgtcatttccgcttccgcctccttcttt ctcgacaagatggccacaccggcggtaccacctcagttccagcgccaacgccagcaccgg ctgcggctccggttcccgctgcggctccagcctcatcctcagaccctgcggcagcagcgg ctgcaactgcggctcctggccagaccccggcctcagcgcaagctccagcgcagaccccag cgcccgctctgcctggtcctgctcttccagggcccttccccggcggccgcgtggctgcac ccagtcattttggcctccattgtggacagctacgagagacgcaacgagggtgctgcccga gttatcgggaccctgttgggaactgtcgacaaacactcagtggaggtcaccaattgcttt tcagtgccgcacaatgagtcagaagatgaagtggctgttgacatggaatttgctaagaat atgtatgaactgcataaaaaagtttctccaaatgagctcatcctgggctggtacgctacg ggccatgacatcacagagcactctgtgctgatccacgagtactacagccgagaggccccc aaccccatccacctcactgtggacacaagtctccagaacggccgcatgagcatcaaagcc tacgtcagcactttaatgggagtccctgggaggaccatgggagtgatgttcacgcctctg acagtgaaatacgcgtactacgacactgaacgcatcggagttgacctgatcatgaagacc tgctttagccccaacagagtgattggactctcaagtgacttgcagcaagtaggaggggca tcagctcgcatccaggatgccctgagtacagtgttgcaatatgcagaggatgtactgtct ggaaaggtgtcagctgacaatactgtgggccgcttcctgatgagcctggttaaccaagta ccgaaaatagttcccgatgactttgagaccatgctcaacagcaacatcaatgaccttttg atggtgacctacctggccaacctcacacagtcacagattgcactcaatgaaaaacttgta aacctgtga >gi568815587f:7887353_8096019|GENSCAN_predicted_peptide_6|224_aa MPQSCTASLKGVSARGTRTDILHAGTCQNMTLQSVTVLCISALNDKAGYAPKFYLETNAY SPALPVSLRCGHCQQIAHGQLGLVPGLSSYTKSETKEDGIEEVQAHPTLMRPRIAPQPLT SRQTELLSLPFLCNTAISITTEVPNLLIELPKRIINQYLPRASLVSENAIESSGLPLRPI KGLRGQMDPNGLQCLGEEDAKHMSQDTFTAHLDDSNPSFRLPVV >gi568815587f:7887353_8096019|GENSCAN_predicted_CDS_6|675_bp atgccccaaagctgcacagccagcctcaagggagtcagtgcaaggggcacacgcacagac atcctgcatgccggcacctgccagaacatgactcttcagtcggtcacagtgctttgcatt agtgcactgaatgacaaggccgggtatgccccaaaattctacctggaaacaaacgcctat tctccggccctccctgtatccctgcggtgcggccactgccagcagatcgcgcatggccag ctcggactggtccctgggctctccagttacactaaaagtgagaccaaggaagatggtatc gaggaggtgcaggctcatccaacattgatgcggccccggattgccccacagcccctcacc tcacggcagaccgagcttttaagtttaccatttctgtgcaatacagccatctctataacc actgaagtacccaacttgctcatagagctacccaaacgcatcattaaccaatatttacca agggccagcttggtgtcagaaaatgccatagaaagttctgggctccccttgaggccgatt aaagggctaagaggacagatggacccaaatgggcttcaatgtttgggtgaggaagatgca aagcacatgagccaggacacctttactgctcatttggacgactccaatccatctttcaga ctgccagtggtgtga >gi568815587f:7887353_8096019|GENSCAN_predicted_peptide_7|61_aa MVCERGLCDKKASVAMLPRWLVSISNTTSIFPKRLPKSMSASPSHESADDLNTFQRQRLS V >gi568815587f:7887353_8096019|GENSCAN_predicted_CDS_7|186_bp atggtttgtgaacgtgggctctgtgataagaaagccagtgtggccatgttgcctcgctgg ctagtgtcgatcagcaacaccacctccattttcccgaaaaggttaccgaagagtatgtcg gcctctccttcccacgagagtgcagatgacctgaacaccttccagagacaacgtctgagt gtctaa >gi568815587f:7887353_8096019|GENSCAN_predicted_peptide_8|280_aa MRPGAARELSRAPAPAPPGPGLQSRSHRPAPERHDFQAAFRLDSLQGRELIAMAAPDSWP PIGSLVAMGYGAVSVENGCWTPKAHCVIVDPGARASGRQDEILSVLDDEGRNLRQQKLDR QRALLEQKQKKKRQEPLMVQANADGRPRSRRARQSEEQAPLVESYLSSSGSTSYQVQEAD SLASVQLGATRPTAPASAKRTKAAATAGGQGGAARKEKKGKHKGTSGPAALAEDKSEAQG PVQILTVGQSDHAQDAGETAAGGGERPSGQDLRATMQRKX >gi568815587f:7887353_8096019|GENSCAN_predicted_CDS_8|840_bp atgcggcccggggcggcccgggagctgagcagggcccccgcgccggcccctccgggcccc ggcctccagagccgcagccaccgccccgcccccgagagacatgacttccaagccgcattc cgactggattccctacagggaagggagctcatcgccatggcagcgccagattcttggcca cccattggctccttggttgccatgggctacggggctgtctctgtggaaaatgggtgctgg actcctaaggcccactgtgttattgtcgacccaggtgcacgtgctagcggacggcaggat gagatcctcagtgtcttagatgatgagggcagaaacctgaggcagcagaagcttgatcgg cagcgggccctgctggagcagaagcagaagaagaagcgccaggagcccctgatggtgcag gccaatgcagatgggcggccccggagccggcgggcccggcagtcagaggaacaagccccc ctggtggagtcctacctcagcagcagtggcagcaccagctaccaagttcaagaggccgac tcactcgccagtgtgcagctgggagccacgcgcccaacagcaccagcttcagccaagaga accaaggcggcagctacagcagggggccagggtggcgccgctaggaaggagaagaaggga aagcacaaaggcaccagcgggccagcagcactggcagaagacaagtctgaggcccaaggc ccagtgcagattctgactgtgggccagtcagaccacgcccaggacgcaggggagacggca gctggtgggggcgaacggcccagcgggcaggatctccgtgccacgatgcagaggaaggnn