GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:31:18 Sequence gi568815578f:47522248_47753072 : 230825 bp : 44.78% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 5077 5116 40 -3.06 1.01 Sngl + 32339 32896 558 1 0 60 42 247 0.981 13.43 1.02 PlyA + 34795 34800 6 1.05 2.04 PlyA - 35064 35059 6 1.05 2.03 Term - 50720 50704 17 0 2 119 53 6 0.709 -1.40 2.02 Intr - 51361 51316 46 1 1 93 111 8 0.728 1.58 2.01 Init - 57390 57334 57 0 0 60 95 26 0.565 1.81 2.00 Prom - 71660 71621 40 -1.56 3.00 Prom + 85056 85095 40 -4.46 3.01 Init + 100001 100083 83 1 2 64 73 72 0.987 3.74 3.02 Intr + 101664 101836 173 0 2 85 85 98 0.568 8.79 3.03 Intr + 103134 103238 105 1 0 115 14 73 0.243 2.79 3.04 Intr + 105314 105502 189 0 0 48 116 33 0.356 1.66 3.05 Intr + 111249 111389 141 1 0 100 98 20 0.933 4.52 3.06 Intr + 113644 113733 90 2 0 77 70 62 0.797 3.27 3.07 Intr + 113773 113931 159 2 0 91 78 50 0.647 4.26 3.08 Intr + 114067 114515 449 2 2 90 76 254 0.717 17.47 3.09 Intr + 115401 115536 136 2 1 69 86 31 0.949 1.14 3.10 Intr + 117678 117804 127 1 1 97 47 45 0.964 1.04 3.11 Intr + 119978 120137 160 2 1 46 96 184 0.961 14.99 3.12 Intr + 124826 125119 294 1 0 79 91 90 0.882 5.61 3.13 Intr + 126758 126850 93 1 0 64 99 42 0.476 3.06 3.14 Intr + 128738 129029 292 1 1 45 54 377 0.735 26.71 3.15 Intr + 130159 130333 175 2 1 47 103 70 0.935 3.40 3.16 Intr + 130684 130825 142 1 1 71 80 77 0.912 5.56 3.17 Term + 132671 132703 33 1 0 109 41 21 0.606 -2.91 3.18 PlyA + 133535 133540 6 1.05 4.29 PlyA - 133629 133624 6 1.05 4.28 Term - 134196 134165 32 1 2 87 48 17 0.216 -4.38 4.27 Intr - 135130 135005 126 2 0 42 111 29 0.387 1.25 4.26 Intr - 138787 138617 171 2 0 35 99 98 0.525 5.51 4.25 Intr - 139649 139526 124 2 1 68 92 168 0.978 15.36 4.24 Intr - 140965 140823 143 2 2 90 92 176 0.982 18.27 4.23 Intr - 141375 141206 170 2 2 83 68 351 0.990 32.19 4.22 Intr - 143046 142952 95 0 2 90 94 82 0.978 7.76 4.21 Intr - 143706 143610 97 2 1 66 105 247 0.547 24.21 4.20 Intr - 144241 144013 229 2 1 111 109 332 0.998 34.53 4.19 Intr - 150146 149951 196 2 1 44 89 328 0.993 27.49 4.18 Intr - 154376 154247 130 1 1 93 110 219 0.994 25.30 4.17 Intr - 156557 156429 129 1 0 114 85 298 0.999 31.91 4.16 Intr - 160922 160747 176 2 2 103 84 339 0.998 33.74 4.15 Intr - 162334 162184 151 0 1 75 78 364 0.725 34.26 4.14 Intr - 168048 167879 170 0 2 117 77 206 0.853 21.14 4.13 Intr - 178306 178241 66 1 0 79 94 27 0.729 1.50 4.12 Intr - 180423 180272 152 1 2 105 121 219 0.958 26.78 4.11 Intr - 184132 184014 119 0 2 88 59 9 0.513 -1.89 4.10 Intr - 186289 186213 77 1 2 103 68 58 0.802 3.61 4.09 Intr - 187391 187256 136 1 1 16 82 80 0.488 0.77 4.08 Intr - 193301 193153 149 2 2 97 38 73 0.131 2.23 4.07 Intr - 199643 199553 91 1 1 42 75 88 0.524 2.90 4.06 Intr - 201779 201652 128 2 2 108 60 60 0.467 4.68 4.05 Intr - 204745 204614 132 1 0 89 59 59 0.345 3.94 4.04 Intr - 213592 213498 95 2 2 45 93 30 0.336 -1.12 4.03 Intr - 214695 214456 240 1 0 110 99 468 0.959 47.62 4.02 Intr - 225783 225666 118 0 1 10 90 93 0.501 1.74 4.01 Intr - 227233 227129 105 1 0 78 58 64 0.592 2.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:47522248_47753072|GENSCAN_predicted_peptide_1|185_aa MSELPFIIASKRIKYLGIQLTRDVKDLFKESYKPLLNEIKEDTNKWKNVPCSWVGRINIV KKAILPKVIYRFNAIPIELPMTFFTELEKTTLKFIWNQKGAHIAKSILSQKNKAGGITLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMLHIYKYLIFDKADKNKQWGKDSLFT LLKFQ >gi568815578f:47522248_47753072|GENSCAN_predicted_CDS_1|558_bp atgagtgaactcccattcataattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaagacctcttcaaggagagctacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacgttccatgctcgtgggtaggaagaatcaatatcgtg aaaaaggccatactgcccaaggtaatttatagattcaatgccatccccatcgagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaagga gcccacattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagaccaatggaacagaacagagccctcagaaataatgctgcatatctacaag tatctgatctttgacaaagctgacaaaaacaagcaatggggaaaggattccctatttaca ctcttgaaatttcaatga >gi568815578f:47522248_47753072|GENSCAN_predicted_peptide_2|39_aa MIGLPTRNLERSTTIYQDPILRFLQIEVCGSPASGLLIP >gi568815578f:47522248_47753072|GENSCAN_predicted_CDS_2|120_bp atgattggcttgcctaccaggaacctggaaaggagtactactatataccaggatcctata ttgcgctttttacaaattgaggtttgtggcagccctgcttcaggcctccttattccctga >gi568815578f:47522248_47753072|GENSCAN_predicted_peptide_3|946_aa MSGLGENLDPLASDSRKRKLPCDTPGQGLTCSGEKRRREQESKYIEELAELISANLSDID NFNVKPDKCAILKETVRQIRQIKEQGKTISNDDDVQKADVSSTGQGVIDKDSLGPLLLQA INGVSWTNETQRQKSHTFNCRMLMKTPHDILEDINASPEMRQRYETMQCFALSQPRAMME EGEGKVVNIDTNSLRSSMRPGFEDIIRRCIQRFFSLNDGQSWSQKRHYQEGVHSPMASSG NTGNHSFSSSSLSALQAISEGPKLDNSPNMNITQPSKVSNQDSKSPLGFYCDQNPVESSM CQSNSRDHLSDKENSSCKESSVSVTSPSGVSSSTSGGVSSTSNMHGSLLQEKHRILHKLL QNGNSPAEVAKITAEATGKDTSSITSCGDGNVVKQEQLSPKKKENNALLRYLLDRDDPSD ALSKELQPQVEGVDNKMSQCTSSTIPSSSQEKDPKIKTETSEEGSGDLDNLDAILGDLTS SDFYNNSISSNGSHLGTKQQVFQGTNSLGPGEIPMGMGANPYGQAAASNQLGSWPDGMLS MEQVSHGTQNRNSLDDLVGPPSNLEGQSDERALLDQLHTLLSNTDATGLEEIDRALGIPE LVNQGQALEPKQDAFQGQEAAVMMDQKAGLYGQTYPAQGPPMQGGFHLQGQSPSFNSMMN QMNQQGNFPLQGMHPRANIMRPRTNTPKQLRMQLQQRLQGQQFLNQSRQALELKMENPTA GGAAVMRPMMQPQGFLNAQMVAQRSRELLSHHFRQQRVAMMMQQQQQQQQQQQQQQQQQQ QQQQQQQQQQQTQAFSPPPNVTASPSMDGLLAGPTMPQAPPQQFPYQPNYGMGQQPDPAF GRVSSPPNAMMSSRMGPSQNPMMQHPQAASIYQSSEMKGWPSGNLARNSSFSQQQFAHQG NPAVYSMVHMNGSSGHMGQMNMNPMPMSGMPMGPDQERFQILVAKT >gi568815578f:47522248_47753072|GENSCAN_predicted_CDS_3|2841_bp atgagtggattaggagaaaacttggatccactggccagtgattcacgaaaacgcaaattg ccatgtgatactccaggacaaggtcttacctgcagtggtgaaaaacggagacgggagcag gaaagtaaatatattgaagaattggctgagctgatatctgccaatcttagtgatattgac aatttcaatgtcaaaccagataaatgtgcgattttaaaggaaacagtaagacagatacgt caaataaaagagcaaggaaaaactatttccaatgatgatgatgttcaaaaagccgatgta tcttctacagggcagggagttattgataaagactccttaggaccgcttttacttcaggca attaatggagtttcctggacaaatgagacccaaagacaaaaaagccatacatttaattgc cgtatgttgatgaaaacaccacatgatattctggaagacataaacgccagtcctgaaatg cgccagagatatgaaacaatgcagtgctttgccctgtctcagccacgagctatgatggag gaaggggaaggaaaggttgtcaatatagatacaaattcactgagatcctccatgaggcct ggctttgaagatataatccgaaggtgtattcagagattttttagtctaaatgatgggcag tcatggtcccagaaacgtcactatcaagaaggtgtgcactctcccatggcatcttctggc aatactgggaaccacagcttttccagcagctctctcagtgccctgcaagccatcagtgaa ggccccaaattggataactctcccaatatgaatattacccaaccaagtaaagtaagcaat caggattccaagagtcctctgggcttttattgcgaccaaaatccagtggagagttcaatg tgtcagtcaaatagcagagatcacctcagtgacaaagaaaattcaagttgtaaagaatct tctgttagtgtcaccagcccctctggagtctcctcctctacatctggaggagtatcctct acatccaatatgcatgggtcactgttacaagagaagcaccggattttgcacaagttgctg cagaatgggaattcaccagctgaggtagccaagattactgcagaagccactgggaaagac accagcagtataacttcttgtggggacggaaatgttgtcaagcaggagcagctaagtcct aagaagaaggagaataatgcacttcttagatacctgctggacagggatgatcctagtgat gcactctctaaagaactacagccccaagtggaaggagtggataataaaatgagtcagtgc accagctccaccattcctagctcaagtcaagagaaagaccctaaaattaagacagagaca agtgaagagggatctggagacttggataatctagatgctattcttggtgatctgactagt tctgacttttacaataattccatatcctcaaatggtagtcatctggggactaagcaacag gtgtttcaaggaactaattctctggggcctggtgaaatccccatgggaatgggggctaat ccctatggccaagcagcagcatctaaccaactgggttcctggcccgatggcatgttgtcc atggaacaagtttctcatggcactcaaaataggaattccctggatgatcttgttgggcca ccttccaacctggaaggccagagtgacgaaagagcattattggaccagctgcacactctt ctcagcaacacagatgccacaggcctggaagaaattgacagagctttgggcattcctgaa cttgtcaatcagggacaggcattagagcccaaacaggatgctttccaaggccaagaagca gcagtaatgatggatcagaaggcaggattatatggacagacatacccagcacaggggcct ccaatgcaaggaggctttcatcttcagggacaatcaccatcttttaactctatgatgaat cagatgaaccagcaaggcaattttcctctccaaggaatgcacccacgagccaacatcatg agaccccggacaaacacccccaagcaacttagaatgcagcttcagcagaggctgcagggc cagcagtttttgaatcagagccgacaggcacttgaattgaaaatggaaaaccctactgct ggtggtgctgcggtgatgaggcctatgatgcagccccagggttttcttaatgctcaaatg gtcgcccaacgcagcagagagctgctaagtcatcacttccgacaacagagggtggctatg atgatgcagcagcagcagcagcagcaacagcagcagcagcagcagcagcagcagcaacag caacagcaacagcaacagcagcaacagcagcaaacccaggccttcagcccacctcctaat gtgactgcttcccccagcatggatgggcttttggcaggacccacaatgccacaagctcct ccgcaacagtttccatatcaaccaaattatggaatgggacaacaaccagatccagccttt ggtcgagtgtctagtcctcccaatgcaatgatgtcgtcaagaatgggtccctcccagaat cccatgatgcaacacccgcaggctgcatccatctatcagtcctcagaaatgaagggctgg ccatcaggaaatttggccaggaacagctccttttcccagcagcagtttgcccaccagggg aatcctgcagtgtatagtatggtgcacatgaatggcagcagtggtcacatgggacagatg aacatgaaccccatgcccatgtctggcatgcctatgggtcctgatcaggagcgttttcag attttggttgctaagacctga >gi568815578f:47522248_47753072|GENSCAN_predicted_peptide_4|1248_aa PLITRGEKTCGCRLSQEEERLIRTHTGVQLAKHIPVKRVAQGQERKQGDQVVFAAVQVGD AGGTEQDGALQSERGSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGKYVHNH NTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTVLDLQPSRLGSFLSDTQCGLAGMLWC GEQASKLWSWVGSSVAALKISSMRSLPFPTMVSLKESPPAVGVETAAGSKNFQGPFLPQA LCMCSYLCLEHWPTTILVQATGFVEDSFSTDWGSQAKGDTLIVQANLMGLVQDSELERAS SEHASHYEVFVGSVVYPFIPQTFIEVSSLTKLCSRCEDTGDKGLRKPLPSWGWADSLQLA QLGSHAFASTNQFVHDHQGNAVLMGVKDLHTFLESTVKNVSKAVDLFPGKCTKGKIWHAM FQERKGLVTSAVYCPPRRRYGRAWREARSVTEVEPPRMEGSVRAFFGKYLNEYNGSYVPP GWKEWVGLLKNSRFYNYTLCRNGVKEKHGSDYSKGCFVGHVAVFHYGFTTKVFEDHDYLT DLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFPNASQHITPSYNYA PNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGELDNTYIV YTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCLNPHIVLNIDLAPTILDIA GLDIPADMDGKSILKLLDTERPVNRKLLHKRDNDKVDAQEENFLPKYQRVKDLCQRAEYQ TACEQLGQKWQCVEDATGKLKLHKCKGPMRLGGSRALSNLVPKYYGQGSEACTCDSGDYK LSLAGRRKKLFKKKYKASYVRSRSIRSVAIEVDGRVYHVGLGDAAQPRNLTKRHWPGAPE DQDDKDGGDFSGTGGLPDYSAANPIKVTHRCYILENDTVQCDLDLYKSLQAWKDHKLHID HEIETLQNKIKNLREVRGHLKKKRPEECDCHKIRKGLQEKDKVWLLREQKRKKKLRKLLK RLQNNDTCSMPGLTCFTHDNQHWQTAPFWTLGPFCACTSANNNTYWCMRTINETHNFLFC EFATGFLEYFDLNTDPYQLMNAVNTLDRDVLNQLHVQLMELRSCKGYKQCNPRTRNMDLG GEWDFMGENSEDGPVIQGEMVVAWTRGGSSKDDGFQIQVVKPPDNMVMGGAGLGVPVFKI LPVNIFLEENALGLEDFLKRGVDGCRLTKKSTILQGTTAAPTSGSKNF >gi568815578f:47522248_47753072|GENSCAN_predicted_CDS_4|3747_bp ccgctgatcactagaggagaaaaaacatgtggatgcaggttatctcaggaagaagagagg ctcattcggacccacactggtgttcagcttgccaagcacattcctgtgaagagggtggcc cagggacaagagcggaagcagggagatcaggtggtgtttgcagcagtccaggtgggagat gctggtggtacggagcaggacggggcactgcagagcgagagaggttccatgcaggtgatg aacaagacccggcgcatcatggagcagggcggggcgcacttcatcaacgccttcgtgacc acacccatgtgctgcccctcacgctcctccatcctcactggcaagtacgtccacaaccac aacacctacaccaacaatgagaactgctcctcgccctcctggcaggcacagcacgagagc cgcacctttgccgtgtacctcaatagcactggctaccggacagtgctggacctgcagccc tcgaggctgggcagcttcctctcagatacacagtgtgggctggcagggatgctttggtgt ggggagcaagccagtaagctgtggtcctgggtggggagcagtgtggccgccctgaagatc agctccatgcgtagccttcccttcccaaccatggtttctctcaaggaatctccacctgca gtgggcgttgaaacagctgcagggtcaaaaaacttccaaggtccattcctgccccaggcc ctttgtatgtgcagttacctctgtctggagcactggcccacaaccattctggtacaagca accggtttcgtggaagacagtttttccacggactggggcagccaggcaaaaggagatacc ctgattgtccaggccaacctcatgggcctggtccaggattcagagctagaaagagcctca tcagaacatgcctcccattacgaggtgttcgttggttcagttgtttacccatttattcca caaacgttcatcgaggtctcgtcactgaccaagctctgttctagatgtgaggacacaggg gacaagggactcagaaagccgctgccctcatggggctgggctgactcgctgcagttggcc cagcttgggtcacatgcctttgcctcaactaaccagtttgtgcatgatcaccaggggaat gcggtcctcatgggagtcaaggatttgcacacattcctggagtcaacagtcaagaatgtg tcaaaagctgtggacttgtttccaggaaaatgcacaaaaggcaaaatctggcatgcaatg ttccaggagcggaaaggtttggtgacctctgctgtatattgtccccctcggaggaggtac gggagagcatggagagaggccaggagtgttacagaagtggagcccccccgcatggagggc agcgttagagctttcttcgggaagtatcttaatgaatacaacggctcctacgtgccaccc ggctggaaggagtgggtcggactccttaaaaactcccgcttttataactacacgctgtgt cggaacggggtgaaagagaagcacggctccgactactccaagggctgctttgttgggcat gtagcggtgtttcattatggttttaccacaaaggtttttgaggatcacgattacctcaca gacctcatcaccaatgacagcgtgagcttcttccgcacgtccaagaagatgtacccgcac aggccagtcctcatggtcatcagccatgcagccccccacggccctgaggattcagcccca caatattcacgcctcttcccaaacgcatctcagcacatcacgccgagctacaactacgcg cccaacccggacaaacactggatcatgcgctacacggggcccatgaagcccatccacatg gaattcaccaacatgctccagcggaagcgcttgcagaccctcatgtcggtggacgactcc atggagacgatttacaacatgctggttgagacgggcgagctggacaacacgtacatcgta tacaccgccgaccacggttaccacatcggccagtttggcctggtgaaagggaaatccatg ccatatgagtttgacatcagggtcccgttctacgtgaggggccccaacgtggaagccggc tgtctgaatccccacatcgtcctcaacattgacctggcccccaccatcctggacattgca ggcctggacatacctgcggatatggacgggaaatccatcctcaagctgctggacacggag cggccggtgaatcgcaagctgctacacaagagagacaatgacaaggtggacgcccaggag gagaactttctgcccaagtaccagcgtgtgaaggacctgtgtcagcgtgctgagtaccag acggcgtgtgagcagctgggacagaagtggcagtgtgtggaggacgccacggggaagctg aagctgcataagtgcaagggccccatgcggctgggcggcagcagagccctctccaacctc gtgcccaagtactacgggcagggcagcgaggcctgcacctgtgacagcggggactacaag ctcagcctggccggacgccggaaaaaactcttcaagaagaagtacaaggccagctatgtc cgcagtcgctccatccgctcagtggccatcgaggtggacggcagggtgtaccacgtaggc ctgggtgatgccgcccagccccgaaacctcaccaagcggcactggccaggggcccctgag gaccaagatgacaaggatggtggggacttcagtggcactggaggccttcccgactactca gccgccaaccccattaaagtgacacatcggtgctacatcctagagaacgacacagtccag tgtgacctggacctgtacaagtccctgcaggcctggaaagaccacaagctgcacatcgac cacgagattgaaaccctgcagaacaaaattaagaacctgagggaagtccgaggtcacctg aagaaaaagcggccagaagaatgtgactgtcacaaaatcaggaagggcctgcaagagaag gacaaggtgtggctgttgcgggagcagaagcgcaagaagaaactccgcaagctgctcaag cgcctgcagaacaacgacacgtgcagcatgccaggcctcacgtgcttcacccacgacaac cagcactggcagacggcgcctttctggacactggggcctttctgtgcctgcaccagcgcc aacaataacacgtactggtgcatgaggaccatcaatgagactcacaatttcctcttctgt gaatttgcaactggcttcctagagtactttgatctcaacacagacccctaccagctgatg aatgcagtgaacacactggacagggatgtcctcaaccagctacacgtacagctcatggag ctgaggagctgcaagggttacaagcagtgtaacccccggactcgaaacatggacctgggt ggagaatgggactttatgggagaaaacagcgaagatggcccagttatccagggagaaatg gtggtggcttggactagaggtgggagcagtaaagatgatggatttcagattcaggtggtg aagcccccagacaacatggtgatgggaggtgcgggattaggagtcccagtattcaagata ctaccagtcaacatctttttggaagaaaatgccttgggtttagaagattttcttaaaagg ggagtagatggttgtagattgactaaaaagtctaccatacttcaagggactacagctgca cccacctctggatcaaagaacttctag