GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:10:53 Sequence gi568815595r:12484517_12718721 : 234205 bp : 44.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4684 4723 40 -2.16 1.01 Init + 5285 5473 189 1 0 65 106 197 0.625 18.21 1.02 Intr + 7620 7701 82 2 1 57 98 23 0.456 -0.59 1.03 Intr + 12608 12725 118 0 1 104 77 22 0.487 2.22 1.04 Intr + 18746 19268 523 2 1 98 100 282 0.766 23.35 1.05 Intr + 34543 34681 139 0 1 37 102 142 0.734 10.54 1.06 Intr + 44372 44408 37 0 1 121 58 41 0.992 1.72 1.07 Intr + 45246 45357 112 0 1 105 87 81 0.991 10.08 1.08 Intr + 49743 49782 40 2 1 60 90 64 0.565 1.60 1.09 Term + 49964 49974 11 0 2 102 39 7 0.422 -4.44 1.10 PlyA + 51956 51961 6 1.05 2.03 PlyA - 52153 52148 6 1.05 2.02 Term - 55917 55677 241 2 1 74 54 283 0.991 19.00 2.01 Init - 57417 57344 74 0 2 78 86 75 0.668 6.84 2.00 Prom - 58649 58610 40 -5.86 3.00 Prom + 69979 70018 40 -4.96 3.01 Init + 72553 72716 164 0 2 75 78 117 0.790 6.62 3.02 Intr + 78190 78378 189 1 0 60 101 76 0.793 4.80 3.03 Intr + 84359 84481 123 2 0 109 -14 92 0.303 0.80 3.04 Intr + 85555 85736 182 1 2 76 81 37 0.466 1.31 3.05 Intr + 87553 87788 236 2 2 91 85 359 0.350 33.31 3.06 Intr + 90276 90490 215 2 2 82 97 143 0.960 12.11 3.07 Intr + 92115 92225 111 0 0 82 98 59 0.981 5.79 3.08 Intr + 97292 97436 145 2 1 134 86 96 0.967 14.28 3.09 Term + 97600 97737 138 0 0 66 43 206 0.999 11.86 3.10 PlyA + 98830 98835 6 1.05 4.15 PlyA - 99106 99101 6 1.05 4.14 Term - 100141 99998 144 1 0 59 33 130 0.981 2.51 4.13 Intr - 100465 100331 135 1 0 98 110 36 0.982 7.56 4.12 Intr - 100737 100606 132 0 0 20 111 195 0.756 15.84 4.11 Intr - 103121 102998 124 1 1 118 -22 13 0.093 -6.11 4.10 Intr - 106458 106282 177 2 0 130 64 156 0.266 16.43 4.09 Intr - 107276 107192 85 0 1 70 101 58 0.995 4.18 4.08 Intr - 115292 115175 118 2 1 69 115 53 0.881 6.14 4.07 Intr - 115763 115636 128 0 2 39 68 58 0.515 -0.70 4.06 Intr - 115899 115872 28 0 1 113 80 35 0.829 2.89 4.05 Intr - 119773 119620 154 0 1 82 75 52 0.745 3.37 4.04 Intr - 124407 124250 158 0 2 90 61 184 0.846 14.71 4.03 Intr - 124819 124717 103 0 1 63 78 60 0.818 2.68 4.02 Intr - 127546 127434 113 1 2 87 90 39 0.807 3.18 4.01 Init - 134205 133999 207 0 0 64 68 243 0.999 18.72 4.00 Prom - 136784 136745 40 -3.56 5.00 Prom + 137132 137171 40 -0.86 5.01 Init + 154548 154686 139 2 1 39 72 89 0.430 2.70 5.02 Intr + 154837 155239 403 2 1 52 49 187 0.012 4.69 5.03 Intr + 179188 179370 183 1 0 56 76 62 0.091 0.70 5.04 Term + 179578 179746 169 1 1 59 43 202 0.442 10.15 5.05 PlyA + 180768 180773 6 1.05 6.04 PlyA - 183888 183883 6 1.05 6.03 Term - 215712 215664 49 2 1 67 55 68 0.098 -1.82 6.02 Intr - 225668 225556 113 2 2 -9 107 93 0.350 0.68 6.01 Init - 225856 225725 132 1 0 67 94 156 0.845 12.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 154837 155243 407 2 2 52 49 193 0.942 7.15 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:12484517_12718721|GENSCAN_predicted_peptide_1|416_aa MAEAVFHAPKRKRRVYETYESPLPIPFGQDHGPLKEFKIFRAEMINNNVIVRNAEDIEQL YGKGYFGKGILSRSRPSFTISDPKLVAKWKALFGVQHFPHVSYVGLDCRCLESRDQALLL SAVPHFVVPWYQHSVEWAAELMRRQGQDESTVRRILKDYTKPLEHPPVKRNEEAQVHDKL NSGMVSNMEGTAGGERPSVVNGDSGKSGGVGDPREPLGCLQEGSGCHPTTESFEKSVRED ASPLPHVCCCKQDALILQRGLHHEDGSQHIGLLHPGDRGPDHEYVLVEEAECAMSEREAA PNEEEPLTIVKLWKAFTVVQPTFRTTYMAYHYFRSKGWVPKVGLKYGTDLLLYRKGPPFY HASYSVIIELVDDHFEGSLRRPLSWKSLAALSRVSVNVSKCQKDSMTTTYGSKACG >gi568815595r:12484517_12718721|GENSCAN_predicted_CDS_1|1251_bp atggcagaagcagttttccatgccccaaagaggaaaagaagagtgtatgagacttacgag tctccattgccaatcccttttggtcaggaccatggtcctctgaaagaattcaagatattc cgtgctgaaatgattaacaacaatgtgattgtgaggaatgcggaggacattgagcagctc tatgggaaaggttattttggaaaaggtattctttcaagaagccgtccaagcttcacaatt tcagatcctaaactggttgctaaatggaaagccctcttcggcgtgcagcactttcctcat gtgtcatacgtgggtttagactgtcggtgcttggagagcagagatcaggcccttctcctc tctgcagtgccacactttgtggtcccttggtatcagcatagtgttgagtgggcagcagag ctgatgcgtagacaggggcaggatgagagtacagtgcgcagaatcctcaaggattacacg aaaccgcttgagcatcctcctgtgaaaaggaatgaagaggctcaagtgcatgacaagctt aactctggaatggtttccaacatggaaggcacagcagggggagagagaccttctgtggta aacggggactctggaaagtcaggtggtgtgggtgatccccgtgagccattaggctgcctg caggagggctctggctgccacccaacaacagagagctttgagaaaagcgtgcgagaggat gcctcacctctgccccatgtctgttgctgcaaacaagatgctctcatcctccagcgtggc cttcatcatgaagacggcagccagcacatcggcctcctgcatcctggggacagagggcct gaccatgagtacgtgctggtcgaggaagcggagtgtgccatgagcgagagggaggctgcc ccaaatgaggaagagcctttaacgatagtgaagctctggaaagctttcactgtagttcag cccacgttcagaaccacctacatggcctaccattactttcgaagcaagggctgggtgccc aaagtgggactcaagtacgggacagatttactgctatatcggaaaggccctccattttac catgcaagttattctgtcattatcgagctagttgatgaccattttgaaggctctctccgc aggcctctcagttggaagtccctggctgccttgagcagagtttccgttaatgtctctaag tgtcagaaagattccatgacgaccacctatggatccaaggcatgtggctaa >gi568815595r:12484517_12718721|GENSCAN_predicted_peptide_2|104_aa MYGMMEQWDKYLEDFSTSGAWLPHRYEDNHHNCYSYALTFINCVLMAEGRQQLDKGEFTE KYVVPRTRLASKFITLYRAIREHGFYVTDCPQQQAQPPEGGGLC >gi568815595r:12484517_12718721|GENSCAN_predicted_CDS_2|315_bp atgtatggaatgatggagcaatgggacaagtacctggaagacttctccacctcgggggcc tggctgcctcacaggtatgaagacaaccaccataactgctactcttacgcactcacgttc attaactgcgttctgatggcagaaggtagacagcaactggacaagggtgaatttacggag aagtacgtggtcccgcggacaaggctggcatccaagttcatcacactctaccgggcgata cgggagcatggcttctacgtcactgactgtccccagcagcaggcacaaccccctgagggc ggcggtttgtgctga >gi568815595r:12484517_12718721|GENSCAN_predicted_peptide_3|500_aa MGRARAKAEAAAAARGGGTTTVPQPSHHEHQADHLQVSALEPGASGRSPRPQGGRSAAAL DSNRSTNPIVSCACEGSTLCASYENLMPDELSLSPMTPRWDHLVAEKQAQCSTDSTLWYF MHGVCREGSQCLFSHDLANSKPSTICKYYQKGYCAYGTRYDHTRPSAAAGGAVGTMAHSV PSPAFHSPHPPSEVTASIVKTNSHEPGKREKRTLVLRDRNLSGMAERKTQPSMVSNPGSC SDPQPSPEMKPHSYLDAIRSGLDDVEASSSYSNEQQLCPYAAAGECRFGDACVYLHGEIC MLTFEHEMEKAFAFQASQDKVCSICMEVILEKASASERRFGILSNCNHTYCLSCIRQWRC AKQFENPIIKSCPECRVISEFVIPSVYWVEDQNKKNELIEAFKQGMGKKACKYFEQGKGT CPFGSKCLYRHAYPDGRLAEPEKPRKQLSSQGTVRFFNSVRLWDFIENRESRHVPNNEDV DMTELGDLFMHLSGVESSEP >gi568815595r:12484517_12718721|GENSCAN_predicted_CDS_3|1503_bp atgggccgggccagggccaaggccgaggcggcagcggctgcgagaggcggcggcacgacg acggtccctcagcccagccaccatgagcaccaagcagatcacttgcaggtcagtgcgctg gagccaggagcttcgggccgctcccccaggccgcaggggggccgatcagcagcagcatta gattctaatagaagcactaaccctattgtcagctgtgcatgtgagggatctacgttgtgt gcttcttatgagaatctaatgcctgatgaactgtcactgtctcccatgacccccagatgg gaccatctagttgcagaaaaacaagctcagtgttccactgattctacactatggtatttt atgcatggtgtgtgtcgggaaggaagtcagtgcctattctcacatgacttggcaaacagc aaaccgtccaccatctgcaagtactaccagaagggctactgtgcctatggaactcgatat gaccacacgaggccctctgctgcagctggaggtgctgtgggcaccatggcccacagtgtg ccctccccagctttccacagtcctcaccctccttccgaggtcactgcatccattgtgaaa actaactcacatgaacccggaaagcgtgaaaagagaacattggttcttagagaccgaaat ctctctggcatggctgaaaggaagacccagccgagcatggtgagtaatccaggcagctgc agcgacccccagcccagccccgagatgaagccgcattcctacctggatgccatcaggagt ggccttgatgacgtggaggccagcagctcctacagcaacgagcagcagctgtgcccctac gcagctgctggggagtgccggtttggggatgcctgtgtctacctgcacggggagatctgc atgttgacgttcgaacacgagatggaaaaggcctttgccttccaggcaagccaggacaaa gtgtgcagtatctgcatggaagtgatcctggagaaggcctctgcttctgagaggagattt gggattctctccaattgcaatcacacgtactgtttgtcctgcatccggcagtggcggtgt gccaaacagtttgaaaacccaatcattaagtcttgtccagaatgccgtgtgatatcagag tttgtaattccaagtgtgtattgggtggaagatcagaataaaaagaacgagttgattgaa gctttcaaacaggggatggggaaaaaagcctgtaaatactttgagcaaggcaaggggacc tgcccatttggaagcaaatgtctttatcgccatgcttaccccgatgggcggctagcagag cctgagaaacctcggaaacagctcagttctcaaggcactgtgaggttctttaattcagtg cggctctgggatttcatcgagaaccgagaaagccggcatgtccccaacaatgaagatgtc gacatgacagagctcggggacctcttcatgcacctttctggagtggaatcatcagaaccc taa >gi568815595r:12484517_12718721|GENSCAN_predicted_peptide_4|601_aa MEHIQGAWKTISNGFGFKDAVFDGSSCISPTIVQQFGYQRRASDDGKLTDPSKTSNTIRV FLPNKQRTVVNVRNGMSLHDCLMKALKVRGLQPECCAVFRLLHEHKGKKARLDWNTDAAS LIGEELQVDFLDHVPLTTHNFARKTFLKLAFCDICQKFLLNGFRCQTCGYKFHEHCSTKV PTMCVDWSNIRQLFSQHRYSTPHAFTFNTSSPSSEGSLSQRQRSTSTPNVHMVSTTLPVD SRMIEDAIRSHSESASPSALSSSPNNLSPTGWSQPKTPVPAQRERAPVSGTQEKNKIRPR GQRDSSYYWEIEASEVMLSTRIGSGSFGTVYKGKWHGDVAVKILKVVDPTPEQFQAFRNE VAVLRKTRHVNILLFMGYMTKDNLAIVTQWCEGSSLYKHLHVQETKFQMFQLIDIARQTA QGMDYLHAKNIIHRDMKSNSILWLLSSFDCSVLNLGKQKGGFLSQAPEVIRMQDNNPFSF QSDVYSYGIVLYELMTGELPYSHINNRDQIIFMVGRGYASPDLSKLYKNCPKAMKRLVAD CVKKVKEERPLFPQILSSIELLQHSLPKINRSASEPSLHRAAHTEDINACTLTTSPRLPV F >gi568815595r:12484517_12718721|GENSCAN_predicted_CDS_4|1806_bp atggagcacatacagggagcttggaagacgatcagcaatggttttggattcaaagatgcc gtgtttgatggctccagctgcatctctcctacaatagttcagcagtttggctatcagcgc cgggcatcagatgatggcaaactcacagatccttctaagacaagcaacactatccgtgtt ttcttgccgaacaagcaaagaacagtggtcaatgtgcgaaatggaatgagcttgcatgac tgccttatgaaagcactcaaggtgaggggcctgcaaccagagtgctgtgcagtgttcaga cttctccacgaacacaaaggtaaaaaagcacgcttagattggaatactgatgctgcgtct ttgattggagaagaacttcaagtagatttcctggatcatgttcccctcacaacacacaac tttgctcggaagacgttcctgaagcttgccttctgtgacatctgtcagaaattcctgctc aatggatttcgatgtcagacttgtggctacaaatttcatgagcactgtagcaccaaagta cctactatgtgtgtggactggagtaacatcagacaactcttttctcagcacagatattct acacctcacgccttcacctttaacacctccagtccctcatctgaaggttccctctcccag aggcagaggtcgacatccacacctaatgtccacatggtcagcaccaccctgcctgtggac agcaggatgattgaggatgcaattcgaagtcacagcgaatcagcctcaccttcagccctg tccagtagccccaacaatctgagcccaacaggctggtcacagccgaaaacccccgtgcca gcacaaagagagcgggcaccagtatctgggacccaggagaaaaacaaaattaggcctcgt ggacagagagattcaagctattattgggaaatagaagccagtgaagtgatgctgtccact cggattgggtcaggctcttttggaactgtttataagggtaaatggcacggagatgttgca gtaaagatcctaaaggttgtcgacccaaccccagagcaattccaggccttcaggaatgag gtggctgttctgcgcaaaacacggcatgtgaacattctgcttttcatggggtacatgaca aaggacaacctggcaattgtgacccagtggtgcgagggcagcagcctctacaaacacctg catgtccaggagaccaagtttcagatgttccagctaattgacattgcccggcagacggct cagggaatggactatttgcatgcaaagaacatcatccatagagacatgaaatccaacagt atcctttggttgttgagttcatttgactgctcggttctaaatttagggaaacagaaggga ggctttctatcacaagccccagaggtgatccgaatgcaggataacaacccattcagtttc cagtcggatgtctactcctatggcatcgtattgtatgaactgatgacgggggagcttcct tattctcacatcaacaaccgagatcagatcatcttcatggtgggccgaggatatgcctcc ccagatcttagtaagctatataagaactgccccaaagcaatgaagaggctggtagctgac tgtgtgaagaaagtaaaggaagagaggcctctttttccccagatcctgtcttccattgag ctgctccaacactctctaccgaagatcaaccggagcgcttccgagccatccttgcatcgg gcagcccacactgaggatatcaatgcttgcacgctgaccacgtccccgaggctgcctgtc ttctag >gi568815595r:12484517_12718721|GENSCAN_predicted_peptide_5|297_aa MQGWFNIHNPSYKQNQNQNYVIISIDAEKAFNKIQQPFMLKTLNKPVSEVLARAIRQEKE IKGIQLGKEEVKLSLFADDMIVYLENLIVSAQNLLKLTGNFSKVSGYKINVQKSQAFLYT NNRQTESQIMSELSFTIASKRIKYLGIRLTRDVKDLFKENYKPLLNKIKEDTNGRTFHAH GRGAAPSAPPRPGRPGSRSLSPAARLPRHPRPGPATYLREPGRPNVLSFGGSFSPAPPPR GGWRPKRPRLRLSGGPDERALDSVLVLPTCDATGSPILREPYWQSTSNLARASIALA >gi568815595r:12484517_12718721|GENSCAN_predicted_CDS_5|894_bp atgcaaggctggttcaacatacataatccatcatataaacagaaccaaaatcaaaactac gtgattatctcaatagatgcagaaaaggccttcaacaaaattcaacagcccttcatgcta aaaactctcaataaaccagtgtcggaagttctggccagagcaatcaggcaagagaaagaa ataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatg attgtatatttagaaaacctcatcgtctcagcccaaaatctccttaagctgacaggcaac ttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcccaagcattcctatacacc aataacagacaaacagagagccaaatcatgagtgaactctcattcacaattgcttcaaag agaataaaatacctaggaatccgacttacaagggatgtgaaggacctcttcaaggagaac tacaaaccactgctcaacaaaataaaagaggacacaaatggaagaacattccatgctcat ggccgcggggccgctccatcagcgccacccaggcccggtcgccctggaagccgatccctc agccccgccgcccggctcccccggcatccacgacccggtcctgccacctacctgagggag ccaggccgccccaacgtcctgtcgttcggcggcagcttctcgcccgctcctcctccccgc ggcggatggcggcccaagcgcccgcgattaagactctcgggcggcccagacgagcgagcc ctcgactcggtgctcgtcctcccgacctgcgacgccaccggctctccgattctgcgcgag ccctactggcagtcgacttctaacttggctcgggcatccatcgctctggcctga >gi568815595r:12484517_12718721|GENSCAN_predicted_peptide_6|97_aa MVAKTPMSSPSVTPACLLRLTLAAAFGLPYDQPANRKKAKLDPQPHSAGTLKDTNEKESS QQAELWAVHMLNHFVAKRNNLRPPGSTSGSYRKVTVG >gi568815595r:12484517_12718721|GENSCAN_predicted_CDS_6|294_bp atggtggccaagacccccatgtcatcccccagtgtgacacctgcatgcctccttcggctc accctggcagctgcttttgggctcccttatgaccagccagcaaacaggaaaaaggccaag cttgatccacagcctcactcagcagggacccttaaagacactaatgagaaggaatcttcc caacaggccgaactttgggcagtgcacatgctcaaccacttcgtggccaagagaaataac ctgagacctccagggtcaactagtgggtcctaccgcaaggtcactgttggctga