GENSCAN 1.0 Date run: 3-Nov-116 Time: 13:00:33 Sequence gi568815593f:69000806_69229614 : 228809 bp : 42.44% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4822 4972 151 0 1 90 92 68 0.565 7.66 1.02 Intr + 5533 5635 103 2 1 28 74 23 0.127 -6.79 1.03 Term + 10611 10776 166 0 1 29 48 202 0.447 6.71 1.04 PlyA + 12012 12017 6 1.05 2.06 PlyA - 12796 12791 6 1.05 2.05 Term - 13454 13110 345 2 0 4 45 791 0.674 59.91 2.04 Intr - 31523 31392 132 2 0 59 -12 140 0.005 1.02 2.03 Intr - 33103 33018 86 2 2 79 107 2 0.089 -0.18 2.02 Intr - 35545 35424 122 0 2 96 47 57 0.096 1.62 2.01 Init - 45901 45837 65 1 2 77 65 43 0.008 1.57 2.00 Prom - 60542 60503 40 -6.55 3.00 Prom + 60564 60603 40 -6.05 3.01 Init + 61654 61960 307 0 1 68 44 134 0.291 4.50 3.02 Term + 68411 68523 113 0 2 62 54 125 0.600 4.24 3.03 PlyA + 70435 70440 6 1.05 4.00 Prom + 72882 72921 40 -4.65 4.01 Init + 73529 73628 100 1 1 55 59 99 0.284 4.18 4.02 Term + 73979 74562 584 0 2 20 47 377 0.990 20.47 4.03 PlyA + 75622 75627 6 1.05 5.00 Prom + 81710 81749 40 -4.95 5.01 Init + 93451 93533 83 0 2 86 99 150 0.589 16.41 5.02 Intr + 93675 93803 129 0 0 95 57 44 0.143 0.89 5.03 Intr + 107544 107631 88 0 1 88 70 64 0.812 3.65 5.04 Intr + 114432 114602 171 2 0 80 94 96 0.982 8.62 5.05 Intr + 115121 115384 264 1 0 95 67 127 0.926 8.09 5.06 Intr + 116434 116591 158 0 2 57 69 97 0.684 2.59 5.07 Intr + 117694 117823 130 1 1 95 98 5 0.680 1.98 5.08 Intr + 120889 121090 202 0 1 61 84 51 0.672 -0.06 5.09 Intr + 122394 122620 227 1 2 94 79 123 0.972 8.78 5.10 Intr + 127199 127327 129 1 0 100 89 65 0.976 7.77 5.11 Intr + 128642 128808 167 1 2 73 47 100 0.034 2.24 5.12 Intr + 151336 151462 127 1 1 56 64 61 0.011 0.26 5.13 Intr + 166106 166383 278 1 2 71 28 156 0.080 3.29 5.14 Intr + 166388 166478 91 2 1 47 84 110 0.941 5.58 5.15 Intr + 167103 167273 171 2 0 86 88 189 0.993 17.92 5.16 Intr + 167368 167538 171 0 0 103 25 172 0.998 11.62 5.17 Intr + 170465 170647 183 1 0 73 107 181 0.896 17.56 5.18 Intr + 173446 173604 159 0 0 73 99 105 0.932 9.36 5.19 Intr + 174072 174308 237 2 0 69 77 156 0.930 9.49 5.20 Intr + 174592 174732 141 0 0 36 94 97 0.810 4.73 5.21 Intr + 176434 176544 111 0 0 42 101 66 0.871 2.96 5.22 Intr + 176719 176820 102 0 0 127 39 66 0.932 5.05 5.23 Intr + 180812 180896 85 1 1 47 101 104 0.954 6.07 5.24 Intr + 181007 181137 131 0 2 49 117 77 0.549 6.19 5.25 Intr + 188620 188704 85 0 1 28 35 113 0.182 -1.53 5.26 Intr + 188762 188963 202 0 1 59 7 265 0.534 12.92 5.27 Intr + 190990 191045 56 1 2 136 108 36 0.770 8.10 5.28 Intr + 207391 207554 164 2 2 59 92 105 0.949 6.77 5.29 Term + 208902 208994 93 2 0 111 38 81 0.981 2.25 5.30 PlyA + 209533 209538 6 1.05 6.03 PlyA - 209682 209677 6 1.05 6.02 Term - 217597 217063 535 0 1 61 47 495 0.553 35.43 6.01 Init - 220198 220080 119 1 2 68 74 68 0.785 3.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 128642 128812 171 1 0 73 50 118 0.952 3.34 S.002 Init + 165112 165190 79 0 1 53 86 39 0.843 1.47 S.003 Intr + 166074 166383 310 1 1 66 28 175 0.827 3.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:69000806_69229614|GENSCAN_predicted_peptide_1|139_aa MARDPGVSQLTQDHHFSGCHLTSPNHSSTLPMSTNAMSTTSWVSHVVIPGARIENKFFLT TCKSIVRLSNIQLTFKHIFGNPLDTRGIIPVRMTIEQPIAGAGFRTAFADRAVFCSLRSE GDLSAGYKAAKPAGPREAL >gi568815593f:69000806_69229614|GENSCAN_predicted_CDS_1|420_bp atggccagggatcctggagtttctcagctcacacaggaccaccatttctcagggtgtcat ttgaccagccccaaccactccagtactctgcccatgtccaccaatgccatgtccaccacc tcatgggtgtcacatgtggttattcctggagctagaattgagaacaaattcttccttaca acatgcaaatctattgttcgactttcaaacatccaactaactttcaaacacatttttggg aacccattagatacgaggggcatcatccctgtgaggatgacaatcgagcagcccattgca ggggctggctttcgcactgcctttgccgatagagctgtcttctgctctctgagatcagaa ggagatctaagtgctggatataaagctgccaaaccagcaggcccaagagaggcactctaa >gi568815593f:69000806_69229614|GENSCAN_predicted_peptide_2|249_aa MQYTNARGCKSHGLAEVMLVVRVCSVPSGSSKNVDWLTITESAGLRKQEVVSLEEGSFSP PPENEFQATPAYLCSSLFDSCVKCESPAKREDGEFSIEGGNCEPLAANTHSSQEMGALAS SGDLGRATAAPPADGSETPLKKKKRKKRKKRKRKKRKRKKRKKRKKKKKEEEEEEEEEEE EEEEEEEEEEEKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEERRRRRRRRRR RRRRRRRRR >gi568815593f:69000806_69229614|GENSCAN_predicted_CDS_2|750_bp atgcagtacacaaatgcaagaggctgtaaaagccacggactagcagaagttatgttagtt gtgagagtgtgttctgtacccagtggtagctcaaaaaatgttgactggttgacaatcact gagtcagctgggttgaggaaacaagaagtggtgtccttggaggaaggcagcttttctcca cctccagaaaatgaattccaagcaacacctgcctatttgtgttcctctctttttgatagt tgtgtaaagtgtgagtcacctgcaaaacgggaggatggagaattctctatagaagggggc aactgtgaaccattagctgctaacactcatagcagccaagagatgggtgcactggcttct tcaggggatctaggccgagctacagcagcaccccctgcagatggcagtgagactccatta aaaaaaaagaagaggaagaagaggaagaagaggaagaggaagaagaggaagaggaagaag aggaagaagaggaagaagaagaagaaagaagaagaggaagaagaggaagaagaggaagag gaagaagaggaagaagaggaagaagaggaagaaaaagaagaagaagaagaggaagaagaa gaagaagaggaagaagaagaagaggaagaggaagaggaagaggaagaggaggaagaggaa gaggaagaagaagaagaagaagaagaagaaagaagaagaagaagaagaagaagaagaaga agaagaagaagaagaagaagaagaagatga >gi568815593f:69000806_69229614|GENSCAN_predicted_peptide_3|139_aa MTLNEHAAFKHLFNKAHLAPPLIHLTLSGHSTCFREQRVGGKVTDQQDPKAEEFFLVQNK MKSLPCLPLSTQTRQPSDFSISSPPFPPFYSTKPPLSSWPVLTSTSLLSCCFTKEAPSPH QTSTQAAPRAAEEGSSSGG >gi568815593f:69000806_69229614|GENSCAN_predicted_CDS_3|420_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggacacagcacatgtttcagagagcagagggttggg ggtaaggtcacagatcaacaggatcccaaggcagaagaatttttcttagtacagaacaaa atgaaaagtctcccatgtctacctctttctacacagacacggcaaccatccgatttctca atctcttccccacctttcccccctttctattccacaaaaccgccattgtcatcatggccc gttctcacaagcaccagtctcctcagctgctgcttcacaaaagaggcaccaagtccacac caaacgagcacacaagcagccccaagagcagcagaagaaggaagcagcagtggtggatga >gi568815593f:69000806_69229614|GENSCAN_predicted_peptide_4|227_aa MFSKEKDKGSPLTMEESATGHARKAKDSRGMDGAGNFQQPGANRTCEGGSGSPEPSSREA TFRERAAAQEKALPPTGVLLHSGTLPICGHPPTSSLSPPIPPAGDRHRGTVPSPRAPPQG CHVAEEAGELTLGCSQAGLSSVAPPHPSYTHRPGCQPAGVSARGAFQHPPVLEELHLSPN IFTRLLGAAFSALACMVHHLHLSEKLAWVPVEAFVGLQIQVNQSANP >gi568815593f:69000806_69229614|GENSCAN_predicted_CDS_4|684_bp atgttttccaaggaaaaggacaagggaagtccactcaccatggaagaaagtgccacaggg catgctagaaaggccaaggacagcagagggatggatggagccggcaacttccagcagcca ggagcaaacaggacttgtgaaggaggcagtggatctccagagcccagcagcagggaggct actttccgggagagggcagcagcacaggagaaggccctgccgcctacaggggtcctactc cactccgggaccctgcccatctgtggccatcctcccaccagcagcctgtcacctcccatc cctcccgctggtgacaggcaccggggtaccgtgcccagcccccgggcacctccacagggc tgccatgtggcagaggaagccggtgaactgacgcttggctgcagccaggcaggcctaagc tctgttgccccgccgcatcccagctacacccacaggcctggatgccaaccagctggcgtc agtgcccgcggtgccttccagcacccgcctgtcctggaggagctgcacctgtcccctaat atcttcacccgcctcttaggggcagccttctcggccctggcgtgcatggtgcaccacctc cacctctctgaaaagctggcctgggtgcccgtggaggccttcgtggggctgcagatccaa gtgaaccaatccgccaacccgtga >gi568815593f:69000806_69229614|GENSCAN_predicted_peptide_5|1444_aa MEEKYGGDVLAGPGGGGGLGPVDVPSARLPPGSPGSSGACCRRKALKPKSFCLNFWVTRQ TVSLRTARQFTTLLLFEHSDIVVISLLSVLFTSSGGGPAKGGVLLLVLALCCKVGFHTAS RKLSVDVGGAKRLQALSHLVSVLLLCPWVIVLSVTTESKVESWFSLIMPFATVIFFVMIL DFYVDSICSVKMEVSKCARYGSFPIFISALLFGNFWTHPITDQLRAMNKAAHQESTEHVL SGGVVLFTFVELFYGVLTNSLGLISDGFHMLFDCSALVMGLFAALMSRWKATRIFSYGYG RIEILSGFINGLFLIVIAFFVFMESVARLIDPPELDTHMLTPVSVGGLIVNLIGICAFSH AHSHAHGASQGSCHSSDHSHSHHMHGHSDHGHGHSHGSAGGGMNANMRGVFLHVLADTLG SIGVIVSTVLIEQFGWFIADPLCSLFIAILIFLSVVPLIKDACQVLLLRLPPEYEKELHI ALEKIQKIEGLISYRDPHFWRHSASIVAGTIHIQVTSDVLEQRIVQQVTGILKDAGVNNL TIQVEKEAYFQHMSGLSTGFHDVLAMTKQMESMKYCKDGTYIITFSVGLIHSPETILLLH LEGYKSSCQPFGDWVGEGDWRSYHLVAAAAARERRRRGRPRESLARPSGLASLWPRPSRT PSRDRPGNAFSATGSRQWEGSECHEQANKEGAVRGLNLRLGWLFSACCGGTAVGFCWVSL AGRASGVLLLPAELLPGEEEAMALRVTRNSKINAENKAKINMAGAKRVPTAPAATSKPGL RPRTALGDIGNKVSEQLQAKMPMKKEAKPSATGKVIDKKLPKPLEKVPMLVPVPVSEPVP EPEPEPEPEPVKEEKLSPEPILVDTASPSPMETSGCAPAEEDLCQAFSDVILAVNDVDAE DGADPNLCSEYVKDIYAYLRQLEEEQAVRPKYLLGREVTGNMRAILIDWLVQVQMKFRLL QETMYMTVSIIDRFMQNNCVPKKMLQLVGVTAMFIASKYEEMYPPEIGDFAFVTDNTYTK HQIRQMEMKILRALNFGLGRPLPLHFLRRASKIGEVDVEQHTLAKYLMELTMLDYDMVHF PPSQIAAGAFCLALKILDNGEWTPTLQHYLSYTEESLLPVMQHLAKNVVMVNQGLTKHMT VKNKYATSKHAKISTLPQLNSALVQDLAKAVAKRFPIVSEICSFRWVLGLVDFKNEATDP RGVKPQTFDLCSECYSVTAHKDSTVLFSRWVHGLAGFRSEAADLPPSAVIFTSRARMKEE PVVTTATEKQGGKVAGKATFSERVCLLSGSLSPQPAMEEQPQMQDADEPADSGGEGRAGG PPQVAGAQAACSEDRMTLLLRLRAQTKQQLLEYKSMVDAKLKQASESKLLEIQTEKNKQK IDLDSMENSERIKIIRQNLQMEIKITTVIQHVFQNLILGSKVNWAEDPALKEIVLQLEKN VDMM >gi568815593f:69000806_69229614|GENSCAN_predicted_CDS_5|4335_bp atggaggagaaatacggcggggacgtgctggccggccccggcggcggcggcggccttggg ccggtggacgtacccagcgctcggctgcctccgggctccccgggctcttcgggtgcctgc tgcagaaggaaggcgctaaagccaaaaagcttctgcttgaatttctgggttacaaggcag acagtcagtcttagaactgcccgccagttcacgactttgctgctatttgagcacagtgat attgttgtcatttcactactcagtgttttgttcaccagttctggaggaggaccagcaaag ggtggagtattattgctagtactggctttgtgttgtaaagttggttttcatacagcttcc agaaagctctctgtcgacgttggtggagctaaacgtcttcaagctttatctcatcttgtt tctgtgcttctcttgtgcccatgggtcattgttctttctgtgacaactgagagtaaagtg gagtcttggttttctctcattatgccttttgcaacggttatcttttttgtcatgatcctg gatttctacgtggattccatttgttcagtcaaaatggaagtttccaaatgtgctcgttat ggatcctttcccatttttattagtgctctcctttttggaaatttttggacacatccaata acagaccagcttcgggctatgaacaaagcagcacaccaggagagcactgaacacgtcctg tctggaggagtggtactttttacctttgtggaattattctatggcgtgctgaccaatagt ctgggcctgatctcggatggattccacatgctttttgactgctctgctttagtcatggga ctttttgctgccctgatgagtaggtggaaagccactcggattttctcctatgggtacggc cgaatagaaattctgtctggatttattaatggactttttctaatagtaatagcgtttttt gtgtttatggagtcagtggctagattgattgatcctccagaattagacactcacatgtta acaccagtctcagttggagggctgatagtaaaccttattggtatctgtgcctttagccat gcccatagccatgcccatggagcttctcaaggaagctgtcactcatctgatcacagccat tcacaccatatgcatggacacagtgaccatgggcatggtcacagccacggatctgcgggt ggaggcatgaatgctaacatgaggggtgtatttctacatgttttggcagatacacttggc agcattggtgtgatcgtatccacagttcttatagagcagtttggatggttcatcgctgac ccactctgttctctttttattgctatattaatatttctcagtgttgttccactgattaaa gatgcctgccaggttctactcctgagattgccaccagaatatgaaaaagaactacatatt gctttagaaaagatacagaaaattgaaggattaatatcataccgagaccctcatttttgg cgtcattctgctagtattgtggcaggaacaattcatatacaggtgacatctgatgtgcta gaacaaagaatagtacagcaggttacaggaatacttaaagatgctggagtaaacaattta acaattcaagtggaaaaggaggcatactttcaacatatgtctggcctaagtactggattt catgatgttctggctatgacaaaacaaatggaatccatgaaatactgcaaagatggtact tacatcataactttttctgttgggctgatccattccccagaaaccattctcttacttcac ttagaagggtataagtccagctgccagccttttggggactgggtaggagaaggcgactgg aggtcttaccatttggtggccgctgcagctgcccgagagcgcaggcgcagaggcagacca cgtgagagcctggccaggccttccggcctagcctcactgtggccccgcccctctcgaacg ccttcgcgcgatcgccctggaaacgcattctctgcgaccggcagccgccaatgggaaggg agtgagtgccacgaacaggccaataaggagggagcagtgcggggtttaaatctgaggcta ggctggctcttctcggcgtgctgcggcggaacggctgttggtttctgctgggtgtccttg gctggtcgggcctccggtgttctgcttctccccgctgagctgctgcctggtgaagaggaa gccatggcgctccgagtcaccaggaactcgaaaattaatgctgaaaataaggcgaagatc aacatggcaggcgcaaagcgcgttcctacggcccctgctgcaacctccaagcccggactg aggccaagaacagctcttggggacattggtaacaaagtcagtgaacaactgcaggccaaa atgcctatgaagaaggaagcaaaaccttcagctactggaaaagtcattgataaaaaacta ccaaaacctcttgaaaaggtacctatgctggtgccagtgccagtgtctgagccagtgcca gagccagaacctgagccagaacctgagcctgttaaagaagaaaaactttcgcctgagcct attttggttgatactgcctctccaagcccaatggaaacatctggatgtgcccctgcagaa gaagacctgtgtcaggctttctctgatgtaattcttgcagtaaatgatgtggatgcagaa gatggagctgatccaaacctttgtagtgaatatgtgaaagatatttatgcttatctgaga caacttgaggaagagcaagcagtcagaccaaaatacctactgggtcgggaagtcactgga aacatgagagccatcctaattgactggctagtacaggttcaaatgaaattcaggttgttg caggagaccatgtacatgactgtctccattattgatcggttcatgcagaataattgtgtg cccaagaagatgctgcagctggttggtgtcactgccatgtttattgcaagcaaatatgaa gaaatgtaccctccagaaattggtgactttgcttttgtgactgacaacacttatactaag caccaaatcagacagatggaaatgaagattctaagagctttaaactttggtctgggtcgg cctctacctttgcacttccttcggagagcatctaagattggagaggttgatgtcgagcaa catactttggccaaatacctgatggaactaactatgttggactatgacatggtgcacttt cctccttctcaaattgcagcaggagctttttgcttagcactgaaaattctggataatggt gaatggacaccaactctacaacattacctgtcatatactgaagaatctcttcttccagtt atgcagcacctggctaagaatgtagtcatggtaaatcaaggacttacaaagcacatgact gtcaagaacaagtatgccacatcgaagcatgctaagatcagcactctaccacagctgaat tctgcactagttcaagatttagccaaggctgtggcaaagagatttcctattgtgtccgaa atttgttccttccggtgggttcttggtctcgttgacttcaagaatgaagccacggaccct cgcggagtgaagccgcagaccttcgacctttgcagtgagtgttacagtgttacagctcat aaagacagcactgttctcttctcccggtgggttcatggtctcgctggcttcaggagtgaa gctgcagaccttccgccgtctgcagttattttcaccagtagagcccggatgaaagaggag cccgtagtaaccacggcaaccgaaaaacaaggcggaaaggtggcgggaaaagcgaccttt tctgagcgcgtttgcctgttgagtggtagcctttcccctcaaccagcaatggaggagcag ccccagatgcaagacgccgacgagcccgcggactccggaggggaaggccgggcaggcggg ccaccgcaggtcgccggcgcccaggcggcgtgcagcgaggaccgcatgaccctgctcctc aggctgagagcacagacaaaacaacaactcttagaatataaatcaatggttgatgcaaaa ttaaaacaagcttcagaaagtaagcttttagaaatacagactgaaaagaacaaacagaag attgatttggacagtatggaaaactcagagaggataaagatcatacgacaaaacctacag atggagataaaaattactactgttattcaacatgtgttccagaaccttattttggggagt aaagtcaattgggcagaggatcctgcccttaaggaaattgttctgcagcttgagaagaat gttgacatgatgtaa >gi568815593f:69000806_69229614|GENSCAN_predicted_peptide_6|217_aa MTLNEHAAFKHLFNKAHLAPPLIHLTLSGHSTFQRARGWGQTLGEVGENPLRRGRSTRGQ LPDHSRDLEGTVGQIQGSLPSTPRYRLPSDWLIGGTTANPRRRLARLPPGSPHPHTPKTN KVLAEERGIPAVGTGPTCQDAARLPQGSNPRRRTHLTARSNRLKVPSSHTPAGAVTGPPQ RRGHRSASATAFTPRRRRHSQHRPYRPRCGNATNCFT >gi568815593f:69000806_69229614|GENSCAN_predicted_CDS_6|654_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggacacagcacgtttcagagagcacggggttggggc cagacgcttggagaggttggggagaacccgctccggcgaggacgcagtacgcggggccag ttgcccgaccattcccgagaccttgaagggacagtgggtcagatccagggttcgcttcct tccacgccccgctaccgccttccttccgattggctgatcgggggcaccacagccaatcct agacggcgattggccaggctaccgccagggtcgcctcacccgcacacgcctaagactaac aaggtcctggcggaggagcgggggatcccggcagtaggtacaggacctacctgccaagat gctgcacggttgccgcaaggctcgaaccctcggcgccggacccacctcactgcccggtcc aacaggctcaaagtcccatcctcacacacacccgcgggcgccgtcactggccctccgcag aggcgcggtcaccgctctgcctctgccaccgcctttactccccgacgccgtcgccatagt cagcaccgtccctaccgcccaagatgcggaaacgcgacaaattgctttacctga