GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:37:33 Sequence gi568815596r:80062451_80263211 : 200761 bp : 40.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9878 9988 111 1 0 86 32 89 0.069 2.76 1.02 Term + 16457 16606 150 1 0 69 40 108 0.508 1.03 1.03 PlyA + 16759 16764 6 1.05 2.04 PlyA - 17504 17499 6 1.05 2.03 Term - 30362 30272 91 1 1 42 47 160 0.958 3.51 2.02 Intr - 30866 30783 84 1 0 93 75 31 0.513 0.32 2.01 Init - 31632 31214 419 0 2 71 53 135 0.242 4.25 2.00 Prom - 31686 31647 40 -6.15 3.02 PlyA - 31803 31798 6 1.05 3.01 Sngl - 33800 33150 651 2 0 69 44 314 0.798 21.02 3.00 Prom - 34264 34225 40 -6.15 4.02 PlyA - 34433 34428 6 1.05 4.01 Sngl - 35676 35287 390 0 0 79 54 305 0.750 22.17 4.00 Prom - 37804 37765 40 -8.45 5.00 Prom + 38512 38551 40 -3.45 5.01 Init + 43547 43588 42 1 0 68 110 -3 0.090 0.37 5.02 Term + 45082 45255 174 0 0 85 54 140 0.337 7.08 5.03 PlyA + 46606 46611 6 1.05 6.05 PlyA - 46686 46681 6 1.05 6.04 Term - 60641 60550 92 0 2 55 53 143 0.560 4.40 6.03 Intr - 63717 63689 29 2 2 109 93 23 0.342 1.74 6.02 Intr - 66723 66657 67 1 1 126 89 22 0.460 3.14 6.01 Init - 76228 76081 148 1 1 65 79 72 0.171 4.30 6.00 Prom - 77100 77061 40 -7.45 7.00 Prom + 78130 78169 40 -7.85 7.01 Init + 83653 83736 84 0 0 77 78 80 0.817 6.77 7.02 Intr + 85818 85921 104 2 2 50 80 110 0.748 4.45 7.03 Intr + 89335 89533 199 1 1 31 -7 155 0.510 -1.27 7.04 Term + 90609 90740 132 2 0 119 49 144 0.830 10.71 7.05 PlyA + 91999 92004 6 1.05 8.09 PlyA - 92485 92480 6 1.05 8.08 Term - 100699 100098 602 2 2 6 34 240 0.112 4.10 8.07 Intr - 114513 114277 237 1 0 85 88 146 0.650 10.96 8.06 Intr - 115151 114945 207 0 0 59 -34 181 0.270 1.13 8.05 Intr - 116848 116491 358 1 1 67 81 269 0.967 18.00 8.04 Intr - 117695 117235 461 0 2 67 36 315 0.805 16.18 8.03 Intr - 135871 135796 76 1 1 55 93 40 0.008 -0.63 8.02 Intr - 141375 141091 285 0 0 -24 39 230 0.068 3.51 8.01 Init - 146208 146125 84 0 0 41 87 46 0.217 0.68 8.00 Prom - 148248 148209 40 -2.85 9.02 PlyA - 149014 149009 6 1.05 9.01 Sngl - 151379 150486 894 2 0 49 42 323 0.870 19.68 9.00 Prom - 159414 159375 40 -2.65 10.03 PlyA - 160659 160654 6 1.05 10.02 Term - 180510 180205 306 0 0 101 55 136 0.772 5.73 10.01 Init - 181556 181431 126 2 0 77 47 106 0.833 5.70 10.00 Prom - 183031 182992 40 -5.75 11.03 PlyA - 183301 183296 6 1.05 11.02 Term - 186680 186577 104 0 2 44 41 142 0.061 2.56 11.01 Init - 192725 192650 76 2 1 71 68 80 0.745 5.60 11.00 Prom - 194679 194640 40 -2.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:80062451_80263211|GENSCAN_predicted_peptide_1|86_aa YVRCQKFQALLWFCQFLGAQLSLLISVALKLGKGPSGVLEETGVLFDSSFERTAAMGDFL IDAKKLCFALPLRAFTIESLLTLPVN >gi568815596r:80062451_80263211|GENSCAN_predicted_CDS_1|261_bp tatgtccgctgccaaaagttccaagccctgctctggttctgccagtttcttggtgctcaa ctctctctcctgatttctgtagcacttaagcttggaaaagggccttctggggttttagaa gaaacaggagtcttatttgattcctcctttgaaagaactgctgccatgggtgatttcctc attgatgcaaagaaactatgctttgccctacctttgagagcatttacaatagaaagtcta cttactcttccagtgaactga >gi568815596r:80062451_80263211|GENSCAN_predicted_peptide_2|197_aa MGKDFMSKTPKAMATKAKIEKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFATYSSDKGL ISRIYNELKQIYKKKTNNPIKKWVKDINRHFSKEDIYAAKKHMKKCSSSLAIREIQIKTT MRYHLTPVRMAIIKKSGKNRDMDEAGNHHSQQTIARTKNQTPHVLTHREPLDGIEAVMAM GAEEEDAFSSMRDLGSS >gi568815596r:80062451_80263211|GENSCAN_predicted_CDS_2|594_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgaa aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaacctactcatctgacaaagggcta atatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggtgaaggatatcaacagacacttctcaaaagaagacatttatgcagccaaa aaacacatgaaaaaatgctcatcatcactggccatcagggaaatacaaatcaaaaccaca atgagataccatctcacaccagttagaatggcgatcattaaaaagtcaggaaaaaacagg gacatggatgaagctggaaaccatcattctcagcaaactatcgcaaggacaaaaaaccaa acacctcatgttctcactcacagggaacctctggatggaatagaagctgtcatggcaatg ggagctgaagaagaagatgcattctcctcaatgagagaccttggcagcagctag >gi568815596r:80062451_80263211|GENSCAN_predicted_peptide_3|216_aa MKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELE KQEQTHSKASRRQEITKIRAELKEIETQKTLQKINVSRSWFFEKINKINRSLARLIKKKR EKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEE VESLNRPVTGSEIEATINSLPTKKSPGPDGFTAKFY >gi568815596r:80062451_80263211|GENSCAN_predicted_CDS_3|651_bp atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggacacattcaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaagaactagag aagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaagatcagagca gaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgtatccaggagctgg ttttttgaaaagatcaacaaaattaatagatcgctagcaagactaataaagaagaaaaga gagaagaatcaaattgacgcaataaaaaatgataaaggggatatcaccaccgatcccaca gaaatacaaactaccatcagagaatactataaacacctctatgcaaataaactagaaaat ctagaagaaatggataaattcctcgacacatacaccctcccaagactaaaccaggaagaa gttgaatctctgaatagaccagtaacaggctctgaaattgaagcaacaattaatagctta ccaaccaaaaaaagtccaggaccagacggattcacagccaaattctactag >gi568815596r:80062451_80263211|GENSCAN_predicted_peptide_4|129_aa MRKKQSRKTGNSKNQSASPPPKECSSSPAMEQSWMETNFDRLREEGFRRSNYSELKEEVR TNGKEVKNLEKKLGAWLTRITNAEKYLKDLMELKTMARELRDECISLSRQCDQLEERVSA MEDEMNEMK >gi568815596r:80062451_80263211|GENSCAN_predicted_CDS_4|390_bp atgaggaaaaaacagagcagaaaaactggaaactctaaaaatcagagtgcctctcctcct ccaaaggaatgcagctcctcaccagcaatggaacaaagctggatggagactaactttgac aggttgagagaagaaggtttcagaagatcgaactactccgagctaaaggaggaagttcga accaatggcaaagaagttaaaaaccttgaaaaaaaattaggtgcatggctaactagaata accaatgcagagaagtacttaaaggacctgatggagctgaaaaccatggcacgagaacta cgtgacgaatgcataagcctcagtaggcaatgcgatcaactggaagaaagggtatcagcg atggaagacgaaatgaatgaaatgaagtga >gi568815596r:80062451_80263211|GENSCAN_predicted_peptide_5|71_aa MKEDLTRGFYYLPQPFPSLNKILCLTILQVSGQPHSSWMPNRAQDSRSVVPNKGCHTGTL PLLAEGRHPTG >gi568815596r:80062451_80263211|GENSCAN_predicted_CDS_5|216_bp atgaaggaggatttaacaagggggttttattacttgccacagccatttccatcacttaat aaaattctctgcctcaccatccttcaagtgtctgggcaacctcattcttcttggatgccg aacagagctcaggactcacggagtgtggtgcccaacaaaggctgtcacactggcactttg cccttgctggcagagggccgtcaccccaccggatga >gi568815596r:80062451_80263211|GENSCAN_predicted_peptide_6|111_aa MKPEYELRQSSCSTCVLNLHAYHVPALCCDGYLGRCPQKAHDTCSQEAHNVRLASWSEAL PNIMSTPIIIYRKKRGKTQKKVKDDRRLDELIPRGKEVVGPTPYHPQRQQK >gi568815596r:80062451_80263211|GENSCAN_predicted_CDS_6|336_bp atgaaaccagaatatgaactcaggcagtccagctgctcaacctgtgttcttaaccttcat gcatatcatgtgccagccctgtgttgtgatgggtacttgggaagatgcccacaaaaagca catgatacctgttctcaagaagctcacaatgttaggcttgcatcatggtctgaggctctc ccaaacatcatgtctactcccatcatcatttacagaaaaaagaggggaaaaacacagaag aaagtgaaagatgaccggagactggatgaacttatccccagaggaaaggaggtggtggga cccacaccatatcaccctcagagacagcagaagtga >gi568815596r:80062451_80263211|GENSCAN_predicted_peptide_7|172_aa MQRSTEVLQWTALAVSSNSREPSLAAMEGVTKVASAAKTAEYREVGKHHVKDKELDIYKS TVRVAMPDVWGPAAAASAASSAANPTLFILLLLPPTFLLPSPKLQMLPALQLCFPPAVLL LHCHGHQARNTVVKFAAPSGAKFSIQVTGATESVATDACAGKGTSPLLQCPC >gi568815596r:80062451_80263211|GENSCAN_predicted_CDS_7|519_bp atgcagaggagcactgaggtgctccaatggacagccctagctgtctcttcaaacagcagg gagccttcactggcagccatggagggggttacaaaagtagcatcagcagcaaagactgct gagtacagggaagttgggaaacatcatgtgaaagacaaggaactggatatttacaagtca acagtcagggttgccatgcctgatgtctggggtccagctgctgctgccagtgctgcctct tctgctgccaaccccactcttttcatcctcctgctgcttccaccaacatttctgctgcca tcgccaaaattacagatgctgccagcactgcagctttgcttcccccctgctgtcttgttg ctgcattgccatgggcatcaggcaaggaacaccgtggtgaaatttgcagcaccatccggt gccaaatttagcatccaggtcactggtgccacggaaagcgttgccactgatgcttgtgcc gggaaaggcacctctcctctgcttcagtgtccatgctaa >gi568815596r:80062451_80263211|GENSCAN_predicted_peptide_8|769_aa MVREVSALNALCPETEMQNFCLPDHSHKRTSAKAVRKGNVELEPPHRVPTGAPPSGAVRR GPPSSRNQSGNSTNSLHCAPGKAADTQNQPVKAARREAISCKATGVEVPKTIGTYHLHQH DLDSHVPINLKEQRPQTRVRAEDTPDVTGDLQPFTRVTVHWGKGNDQNFRGLLDPGFELM LIPRDTKHHYSSPIKVGSYVCQVINEVLAQVQLTVGPVSPQTHPVVTSPVPECIIGIDIL NSWQNPHIGYLTGKMTASMEGKAKWKPLELPLPRKIVNQKQYCISGGIAEISAIIKDLKY AKKTDGSWRMTVDYHKFNQVVTPTASAVPDVVSLLEQINTSLGNWYAATDSANAIPFHKA HQKQSAFSCQDQQYTFSVLPQGYINSLALYHDLVHKDLDRFSLPQDIILVHYIDDITLIG SITPVIAQWAHEKSGHGNRDGGYAWAQQHGLPLTKADLAMTTTECPICWQQRPTLSPQYG NIPWVINQLLARIHGSRNQGMEMEASFLTITRRHNNDSIKLEVKIATWPLWAPPTSKSTG NKGRKEVIGLAGVIDPDYQDKISLLLHSGGWASNKGEIPKDKDGKPKQFAFVNFKHEVSV PYAMNLLNGIKLYGRPIKIQFRSGSSHASQDVSLPYPQYHVGNSSPTSTTPSRYERTRDN MTSSAQIIQRSFSSPENFQRQAVMNSALRQMSYGGKFASSPLDQSGFSPSVQSHSHSFNQ SSSSQWRQGTPSSQRKVRMNSHPYLADRHYSREQRYTDHGSDHHYRGKR >gi568815596r:80062451_80263211|GENSCAN_predicted_CDS_8|2310_bp atggtacgcgaggtctcagccctaaatgctctgtgccccgagactgagatgcagaatttc tgcctgcctgatcactcccataagagaacctctgctaaggcagtgcggaagggaaatgtg gagttggagcccccacacagagtccctactggggcaccacctagtggagctgtgagaaga gggccaccatcctccagaaaccagagtggtaactccaccaacagcttgcactgtgcacct ggaaaagctgcagacacccaaaaccagcctgtgaaagcagccaggagggaagcaatatcc tgcaaagccacaggagtggaggtgccaaagactataggaacctaccacttgcatcagcat gacctggattcccatgttcctataaatctgaaagagcaaagacctcaaacaagggtaaga gcagaagacacccctgacgtaactggagatctccagccttttaccagggtaactgtgcac tggggaaagggaaatgatcagaactttcggggactactggaccctggctttgagctgatg ttgattccaagggacacaaaacatcattatagttctccaattaaagtggggtcttatgtt tgtcaagtaattaatgaagttttagctcaggtccaacttacagtgggtccagtgagtccc cagactcatcctgtggtcacttctccagtgccagaatgcataattggcatagacatactt aacagctggcagaacccccacattggctacctgactggtaagatgacggctagtatggag ggaaaggccaaatggaagccattagagctgcctctacctagaaagatagtaaatcaaaaa caatattgcatctctggagggattgcagagatcagtgccatcatcaaggacttgaaatat gcaaagaagacagatggatcttggagaatgacagtggattatcataagtttaaccaagtg gtgactccaactgcatctgctgtaccagatgtggtttcattgcttgagcaaattaacaca tctcttggtaactggtatgcagccactgattcggcaaatgccattccttttcataaggcc caccagaagcaatctgccttcagctgtcaagaccagcaatataccttctctgtcctacct caggggtacatcaactctctggcattgtatcatgatcttgttcacaaagatcttgatcgc ttttcccttccacaagatatcatactggtccattacattgatgatattacgctgattgga tccatcacccctgtcattgcccagtgggcccatgaaaaaagtggccatggtaacagagat ggaggttatgcatgggctcagcaacacggacttccactcaccaaggctgacctggctatg accaccactgagtgcccaatttgctggcagcagagaccaacactgagccctcaatatggc aacattccttgggtaatcaaccagctacttgccaggattcatggttccaggaatcaaggg atggaaatggaagcgtcatttctcaccatcaccagaagacacaacaacgattctattaaa ctggaagttaagattgccacttggccactttgggctccccctacctctaagtcaacaggc aacaaaggaagaaaggaagttatagggttggctggggtgattgacccagattatcaagat aaaatcagtttactactccacagtggaggctgggccagtaataaaggtgaaattccaaaa gataaggatggtaaaccaaagcagtttgcgtttgtgaatttcaaacatgaagtatctgtt ccttatgcaatgaatctacttaatggaatcaaactttatggaaggcctatcaaaattcaa tttagatcaggaagtagtcatgcctcgcaagatgtcagtttgccatatccccaatatcat gttggaaattcaagccctacctccacaactcctagcaggtacgaaaggactagggataac atgacttcatcagcacagataattcagagatctttctcttctccagaaaattttcagaga caagcagtgatgaacagtgctttgagacaaatgtcatatggtggaaaatttgcttcttca cctctggatcaatcaggattttcaccatcagttcaatcacacagtcatagttttaatcag tcttcaagctcccagtggcgccaaggtacaccatcatcacagcgtaaagtcagaatgaat tctcatccctacctagcagatagacattatagccgggaacagcgttacactgatcatggg tctgaccatcattacagaggaaagagatga >gi568815596r:80062451_80263211|GENSCAN_predicted_peptide_9|297_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSAEYTFFSAPHHTYS KIDHTVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNCSTTWKLNNVLLNDYWV HNEMKAEIKMFFETNKNKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKTSRRQEITKIRGEVKEIETQKTLQKINESSSWFFEKINKIDGPLARQIK KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLCANKLENLEEMDKFLDTYTLP >gi568815596r:80062451_80263211|GENSCAN_predicted_CDS_9|894_bp atgggagactttaacaccccactgtcaacattagacagatcaacaagacagaaagttaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctataga actctccaccccaaatcagcagaatatacattcttttcagcaccacaccacacttattcc aaaattgaccacacagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaattagaactcaggattaagaaactc actcaaaactgctcaactacgtggaaactgaacaatgtgctcctgaatgactactgggta cataatgaaatgaaggcagaaataaaaatgttctttgaaaccaataagaacaaagacaca acataccagaatctctgggacacatttaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaa gaactagagaagcaagagcaaacacattcaaaaactagcagaaggcaagaaataactaag atcagaggagaagtgaaggagatagagacacaaaaaacccttcaaaaaatcaatgaatcc agtagctggttttttgaaaagatcaacaaaattgatgggccgctagcaagacaaataaag aagaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccact gatcccacagagatacaaactaccatcagagaatactataaacacctctgtgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacaccctcccatga >gi568815596r:80062451_80263211|GENSCAN_predicted_peptide_10|143_aa MRPRLMQMALQAAALMSPSRNCSKSQPGCGGIGTLSQWEYKMKLFLALCCSLRLSPPVVL VKTHSLDPGGSIAILLHLAGSLQNEENTPPRVTIKKREQTQAGRGTARARLLFSCPIWIR EASGEMEPNSEWEMPLFELTLRQ >gi568815596r:80062451_80263211|GENSCAN_predicted_CDS_10|432_bp atgaggccacgtttgatgcagatggccctgcaggctgctgcattgatgtcccctagtaga aactgcagcaaaagccagccaggatgtggaggaattgggactctcagtcagtgggaatat aaaatgaaactgtttctggccctttgctgctccctccgcctctctcctccagtggtccta gtgaaaacccatagcctggaccctggtggcagcatagctattctgctccatctagcagga tccctgcagaatgaagaaaacacaccacccagagttaccatcaagaagagagaacagacc caggctggcagaggaacagctcgggccagactgctgttcagctgccctatctggataagg gaggcatctggggagatggagccaaactctgagtgggaaatgccattgtttgagttgaca cttagacagtga >gi568815596r:80062451_80263211|GENSCAN_predicted_peptide_11|59_aa MKGLGNQFAQTQISGKTNIYRHRKAEIKACCGNETTSSPFLLEEPADDGGEKGTAGSFY >gi568815596r:80062451_80263211|GENSCAN_predicted_CDS_11|180_bp atgaaaggattaggaaatcaatttgctcaaacacagatcagtggaaagacgaacatctat cgacacaggaaggctgaaataaaagcatgctgtggaaatgaaaccacatcatcacctttt cttctggaagagcctgctgatgatggaggagagaagggcactgcaggctctttctactaa