GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:37:56 Sequence gi568815596r:42251222_42461161 : 209940 bp : 40.86% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5280 5409 130 1 1 80 75 127 0.961 9.43 1.02 Intr + 11957 12085 129 2 0 52 108 105 0.955 7.79 1.03 Intr + 13485 13510 26 0 2 104 94 50 0.912 3.95 1.04 Intr + 29629 29746 118 2 1 84 -12 105 0.169 -1.30 1.05 Intr + 33413 33482 70 2 1 81 96 56 0.897 3.97 1.06 Intr + 35048 35158 111 1 0 100 92 49 0.976 6.16 1.07 Intr + 37006 37101 96 0 0 85 94 53 0.959 4.89 1.08 Intr + 43904 44038 135 1 0 79 78 22 0.512 0.14 1.09 Intr + 44160 44295 136 2 1 76 87 74 0.956 5.32 1.10 Intr + 46252 46419 168 2 0 57 89 59 0.525 1.90 1.11 Intr + 50020 50171 152 2 2 64 94 62 0.984 3.36 1.12 Intr + 51883 52008 126 0 0 88 115 93 0.999 11.96 1.13 Intr + 53263 53330 68 0 2 107 91 39 0.926 2.88 1.14 Intr + 64741 64829 89 1 2 77 68 62 0.639 1.80 1.15 Intr + 74933 75031 99 0 0 44 87 92 0.033 3.86 1.16 Intr + 77665 77795 131 2 2 41 92 98 0.777 4.99 1.17 Term + 78513 78986 474 2 0 56 41 483 0.997 34.30 1.18 PlyA + 81296 81301 6 1.05 2.00 Prom + 81758 81797 40 -6.85 2.01 Init + 82180 82264 85 0 1 52 47 77 0.501 1.03 2.02 Intr + 82389 82631 243 1 0 31 26 191 0.023 3.75 2.03 Intr + 85199 85351 153 0 0 60 37 97 0.021 0.92 2.04 Intr + 86572 86739 168 2 0 57 -11 219 0.005 7.90 2.05 Intr + 90255 90378 124 1 1 115 31 68 0.003 2.52 2.06 Intr + 91088 91192 105 2 0 123 96 28 0.057 5.51 2.07 Intr + 92692 92827 136 1 1 47 2 96 0.181 -3.45 2.08 Intr + 94874 95075 202 1 1 111 82 115 0.924 11.14 2.09 Term + 97104 97243 140 1 2 50 42 108 0.347 -0.46 2.10 PlyA + 98397 98402 6 1.05 3.07 PlyA - 98515 98510 6 1.05 3.06 Term - 100138 99998 141 1 0 72 44 193 0.998 10.25 3.05 Intr - 102122 101991 132 2 0 126 94 97 0.997 14.02 3.04 Intr - 110051 109869 183 2 0 -18 96 160 0.041 5.26 3.03 Intr - 117924 117647 278 1 2 80 111 69 0.119 4.71 3.02 Intr - 133725 133641 85 0 1 60 80 99 0.008 4.77 3.01 Init - 147774 147712 63 0 0 51 72 77 0.134 3.60 3.00 Prom - 148023 147984 40 -3.25 4.00 Prom + 149890 149929 40 -6.85 4.01 Init + 156369 156818 450 2 0 74 69 270 0.575 17.66 4.02 Intr + 157775 157967 193 1 1 25 89 97 0.611 1.64 4.03 Term + 160722 160936 215 1 2 21 44 163 0.634 1.61 4.04 PlyA + 165001 165006 6 1.05 5.04 PlyA - 167308 167303 6 1.05 5.03 Term - 173952 173712 241 2 1 86 55 194 0.549 10.41 5.02 Intr - 193358 192750 609 1 0 117 48 307 0.032 20.20 5.01 Init - 194382 194264 119 0 2 70 105 62 0.703 5.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 193358 192713 646 1 1 117 38 296 0.942 20.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:42251222_42461161|GENSCAN_predicted_peptide_1|752_aa XQPSPRAVIPMSCITNGSGANRKPSHTSAVSIAGKETLSSAAKSIKRPSPAEKSHNSWEN SDDSRNKLSKIPSTPKLIPKVTKTADKHKDVIINQEGEYIKMFMRGRPITMFIPSDVDNY DDIRTELPPEKLKLDLAIHPDKIRIATGQIAGVDKDGRPLQPHVRVWDSVTLSTLQIIGL GTFERGVGCLDFSKADSGVHLCIIDDSNEHMLTVWDWQKKAKGAEIKTTNEVVLAVEFHP TDANTIITCGKSHIFFWTWSGNSLTRKQGIFGKYEKPKFVQCLAFLGNGDVLTGDSGGVM LIWSKTTVEPTPGKGPKGNRKSNLVNCWAHEMACVVLCHMTNPIMGKVAKNLRFSIGWEE FGERSEINGSPDQSVYQISKQIKAHDGSVFTLCQMRNGMLLTGGGKDRKIILWDHDLNPE REIEVPDQYGTIRAVAEGKADQFLVGTSRNFILRGTFNDGFQIEVQEPGHCADFHPSGTV VAIGTHSGRWFVLDAETRDLVSIHTDGNEQLSVMRYSIGDIPNGCKLIRNRSDCKDIDWT TYTCVLGFQVFGVWPEGSDGTDINALVRSHNRKVIAVADDFCKVHLFQYPCSKAKAPSHK YSAHSSHVTNVSFTHNDSHLISTGGKDMSIIQWKLVEKLSLPQNETVADTTLTKAPVSST ESVIQSNTPTPPPSQPLNETAEEESRISSSPTLLENSLEQTVEPSEDHSEEESEEGSGDL GEPLYEEPCNEISKEQAKATLLEDQQDPSPSS >gi568815596r:42251222_42461161|GENSCAN_predicted_CDS_1|2259_bp ngccaaccaagccctcgagcagttattcccatgtcctgtataaccaatggaagtggtgca aacagaaaaccaagtcataccagtgctgtctcaattgcaggaaaagaaactctttcatct gctgctaaaagcataaaacgaccatcaccagctgaaaagtcacataattcttgggaaaat tcagatgatagccgtaataaattgtcgaaaataccttcaacacccaaattaataccaaaa gttaccaaaactgcagacaagcataaagatgtcatcatcaaccaagaaggagaatatatt aaaatgtttatgcgcggtcggccaattaccatgttcattccttccgatgttgacaactat gatgacatcagaacggaactgcctcctgagaagctcaaactggaccttgctatacatcct gacaaaattaggattgcaactggacagatagctggcgtggataaagatggaaggcctcta caaccccacgtcagagtgtgggattctgttactctatccacactgcagattattggactt ggcacttttgagcgtggagtaggatgcctggatttttcaaaagcagattcaggtgttcat ttatgtattattgatgactccaatgagcatatgcttactgtatgggactggcagaagaaa gcaaaaggagcagaaataaagacaacaaatgaagttgttttggctgtggagtttcaccca acagatgcaaataccataattacatgcggtaaatctcatattttcttctggacctggagc ggcaattcactaacaagaaaacagggaatttttgggaaatatgaaaagccaaaatttgtg cagtgtttagcattcttggggaatggagatgttcttactggagactcaggtggagtcatg cttatatggagcaaaactactgtagagcccacacctgggaaaggacctaaaggaaacagg aaatccaatcttgtaaattgctgggcacatgaaatggcctgtgttgtactttgccacatg actaaccctattatgggcaaagttgctaagaatttgaggttttcaattggctgggaagaa tttggtgagcgcagtgaaataaatggaagtccagaccagagtgtatatcaaatcagcaaa caaatcaaagctcatgatggcagtgtgttcacactttgtcagatgagaaatgggatgtta ttaactggaggagggaaagacagaaaaataattctgtgggatcatgatctgaatcctgaa agagaaatagaggttcctgatcagtatggcacaatcagagctgtagcagaaggaaaggca gatcaatttttagtaggcacatcacgaaactttattttacgaggaacatttaatgatggc ttccaaatagaagtacaggaaccaggacactgtgcagattttcatccaagtggcacagtg gtggccataggaacgcactcaggcaggtggtttgttctggatgcagaaaccagagatcta gtttctatccacacagacgggaatgaacagctctctgtgatgcgctactcaataggggac attccaaatggctgcaaactaatcaggaatcgatcggattgtaaggacattgattggacg acatatacctgtgtgctaggatttcaagtatttggtgtctggccagaaggatctgatggg acagatatcaatgcactggtgcgatcccacaatagaaaggtgatagctgttgccgatgac ttttgtaaagtccatctgtttcagtatccctgctccaaagcaaaggctcccagtcacaag tacagtgcccacagcagccatgtcaccaatgtcagttttactcacaatgacagtcacctg atatcaactggtggaaaagacatgagcatcattcagtggaaacttgtggaaaagttatct ttgcctcagaatgagactgtagcggatactactctaaccaaagcccccgtctcttccact gaaagtgtcatccaatctaatactcccacaccgcctccttctcagcccttaaatgagaca gctgaagaggaaagtagaataagcagttctcccacacttctggagaacagcctggaacaa actgtggagccaagtgaagaccacagcgaggaggagagtgaagagggcagcggagacctt ggtgagcctctttatgaagagccatgcaacgagataagcaaggagcaggccaaagccacc cttctggaggaccagcaagacccttcgccctcgtcctaa >gi568815596r:42251222_42461161|GENSCAN_predicted_peptide_2|451_aa MDQEELPLEEYLLWAQHLPIPRCSGIQGGERHSLDIEGKDQGRFTDLLWKGDKGNRRESV SVLVLSLGRGESTTQQCRTSVSPSLKPCWKSPFVQDAHDWSLQQPQKQNDLIEGTEKKVS ARRKGLREEKEFLNMQSLQPGHPISCLTDHSNQALVPPRKGNMEKKMSLTTTKPFETLPA RVHLQCSTDQSEEAVRTVGPSGAHWLSSARSATMGPSMGLLFASHVVQSSQQCSPSPPAA VGAHILQLKEQVLTQSDGRAETRDLSSLLGKMPSETTPISAPTYSLALQAGTSFTAKSHW MKGPRRRAKASQTLKGNLADRKSKARDKWALLYKDKCKPGTKQLTGPVEDTSPKWDNNIL SLLELRACCQQLDNNFLPNPMRNRESWSAETELNSCIIKRVTERTGYLLTAKGKGYQLDK GEIRQTQLQITNNGTNWYQVPPDITRTPYHL >gi568815596r:42251222_42461161|GENSCAN_predicted_CDS_2|1356_bp atggaccaagaagagctgcccctggaggagtacctactgtgggcccaacatctcccaata cccaggtgttcagggatacagggaggagaaagacattcattagacatagaaggaaaggac cagggtagattcacagacttactttggaaaggagataagggaaacagaagagagtctgtg tcagtcttggtcctcagtctgggacgaggtgagtccacgacacagcagtgccgcacatct gtatcaccatcactgaagccctgctggaaaagtccctttgtccaagatgcccacgactgg agtctccagcagccacagaaacagaatgatctcattgaaggcactgagaagaaagtgagt gcccgaagaaagggcctcagggaggaaaaggaatttcttaatatgcaaagccttcagcct ggtcatcccatctcatgcttgactgatcacagtaaccaagcccttgtgcctcctaggaaa gggaatatggagaagaaaatgagccttactaccacgaagccttttgaaactctccctgca cgtgtgcacctgcagtgctccactgaccagtcagaggaggcagtgcggaccgtgggacca agcggtgcccactggctttctagcgcccgctctgctaccatgggtccctccatgggattg ctttttgcaagtcatgtggttcaatctagccagcagtgctcccccagccctcctgctgcc gtgggcgcccacatccttcagctcaaagaacaggtcctgactcagagtgatgggagagct gagacgagggacttgagctctctgcttggcaagatgccctcagaaaccaccccaatttct gcacccacttattccttggcccttcaggcaggcaccagcttcacagcaaagtcacactgg atgaaggggccgagaaggagagcgaaagctagtcagaccctgaaagggaatctggctgac agaaagagcaaagccagggataagtgggcactgctttacaaggacaagtgtaaacctggc accaaacagctgactggcccagtggaggacacctcacccaaatgggacaataacattctc tctctcctggaattaagggcatgttgccagcagctggacaacaatttccttccaaatcca atgagaaacagagaaagctggtctgcagagacagaattaaacagctgcataatcaagaga gtgacagaaaggacaggctacttattaactgcaaaggggaaagggtatcaacttgacaag ggagaaatccggcagacacaacttcaaatcaccaataatgggacaaactggtatcaagtg ccacctgacataacgaggacaccatatcacctataa >gi568815596r:42251222_42461161|GENSCAN_predicted_peptide_3|293_aa MNRQFTKDEPYKAMNIEKMLRLPCSSNSLADDHSPETGKAGTLQSWGKAAASRCPKQEDI TVIGHREHGWGVCQAGQCREAKNSGRKHNKVRPAQNGKHSSRATTHFPSGSGGESSQGGR EMDGNCWGCEKPGALTGVVIDLSQHRVLALSLGTTAGEAGSFSGAVALAADAGSRTLGVM YYKFSGFTQKLAGAWASEAYSPQGLKPVVSTEAPPIIFATPTKLTSDSTVYDYAGKNKVP ELQKFFQKADGVPVYLKRGLPDQMLYRTTMALTVGGTIYCLIALYMASQPKNK >gi568815596r:42251222_42461161|GENSCAN_predicted_CDS_3|882_bp atgaacaggcagttcacaaaagatgagccttacaaggcaatgaacattgaaaagatgctg aggcttccttgcagcagcaacagccttgctgacgaccacagtccagaaactggcaaggct ggcaccttgcagagctgggggaaggcagcagcaagcagatgcccaaaacaggaggacatc acggtgataggccacagagaacatggctggggcgtgtgccaggcaggacaatgcagggaa gccaaaaacagtgggagaaaacacaataaagtgagacccgcccagaatggtaaacacagc agcagagctaccacacacttcccttctgggagcggtggggaaagcagccagggaggtcgg gagatggacggaaactgctggggatgtgagaagcctggggcactaactggagttgttatt gacttgtcccagcaccgggtccttgcgctgagtctcgggaccacagccggggaggcgggg tccttctctggggcggtcgcgttggcagcggatgcgggaagccggactctgggcgtcatg tactacaagtttagtggcttcacgcagaagttggcaggagcatgggcttcggaggcctat agcccgcagggattaaagcctgtggtttccacagaagcaccacctatcatatttgccaca ccaactaaactgacctccgattccacagtgtatgattatgctgggaaaaacaaagttcca gagctacaaaagtttttccagaaagctgatggtgtgcccgtctacctgaaacgaggcctg cctgaccaaatgctttaccggaccaccatggcgctgactgtgggagggaccatctactgc ctgatcgccctctacatggcttcgcagcccaaaaacaaatga >gi568815596r:42251222_42461161|GENSCAN_predicted_peptide_4|285_aa MAVDPGLLLLQGAPLGGQLQLPKSQLQTRASLCSCRCQEQAGALPSLVQLQTRASLCSCR CQKQAGALPSLVQLQTRASMQQAGTPSPPTLGQLQPPKPWLWTKASLHSWGPRKPHLTLQ ARKCLLPLPDFSLLLVPTPISEQSWGEDSQPHREPVPVLAPGAAHPTAAASMSDCVQWPD PTLTHTPLAIPCLTCPWWCGIQAGSKNQMQPARPTNSRYDLALMNQIHTYIQEQEDGKDV EPLVLLLWWLASQVELTTPGLNIPVSSHYLCGHEKIVVAAAVTAS >gi568815596r:42251222_42461161|GENSCAN_predicted_CDS_4|858_bp atggctgtggacccaggcctcctgctcctccagggagccccgcttggggggcagctgcag ctgcccaaatcacagctgcagacccgggcctccctgtgctcttgcaggtgccaagagcaa gcaggagccctgccctccctggtgcagcttcagacccgggcctccctgtgctcttgcagg tgccaaaagcaagcaggagccctgccctccctggtgcagcttcagacccgggcctccatg cagcaggcaggaaccccatccccgccaactctggggcagctgcagccacccaaaccgtgg ctgtggaccaaggcatccctacactcttgggggcccagaaagccccacctgaccttgcag gctcggaaatgcctgctcccactgcctgacttctccctgctgttggtgcccacgccgatc tcagaacaaagctggggtgaagacagccagcctcacagagagccagtgcctgtgctggca cctggagctgcccaccccactgcagcagccagcatgtctgactgtgtgcagtggccagac cccacactcactcacacaccccttgccattccatgtctgacttgcccttggtggtgtggg atccaggctggcagcaagaaccaaatgcagcctgccaggccaacaaattctagatatgat ttagctttgatgaatcaaatacatacatacatacaagagcaggaagatggaaaagacgtg gagccccttgtcctcctgctgtggtggctggcaagccaggttgaactgactacccctgga ctcaatattccagtgtctagtcactacctttgtggacatgagaagattgtggtggcagca gcagtgacagcttcctga >gi568815596r:42251222_42461161|GENSCAN_predicted_peptide_5|322_aa MMMEYKVFVVFGKILAAILLQCGSPRPRFCWDLFRIVVTWIIEAICIGWFTAECIVRFIV SKNKCEFVKRPLNIIDLLAITPYYISVLMTVFTGENSQLQRAGVTLRVLRMMRIFWVIKL ARHFIGLQTLGLTLKRCYREMVMLLVFICVAMAIFSALSQLLEHGLDLETSNKDFTSIPA ACWWVIISMTTVGYGDMYPITVPGRILGGVCVVSGIVLLALPITFIYHSFVQCYHELKFR SASGPPVEQLPPDPLTRWCFHPAGSTLCGPANSMAVASPGSRPAAPGGGFLRTEALVLIV AAGPVDGLNCENHPFRGGCKDF >gi568815596r:42251222_42461161|GENSCAN_predicted_CDS_5|969_bp atgatgatggaatacaaagtatttgtggtctttgggaaaatcctagcggccattttgctc caatgtgggagtcctcggcctcgtttctgctgggacctttttaggatagttgtgacctgg ataattgaagctatctgcataggttggttcactgccgagtgcatcgtgaggttcattgtc tccaaaaacaagtgtgagtttgtcaagagacccctgaacatcattgatttactggcaatc acgccgtattacatctctgtgttgatgacagtgtttacaggcgagaactctcaactccag agggctggagtcaccttgagggtacttagaatgatgaggattttttgggtgattaagctt gcccgtcacttcattggtcttcagacactcggtttgactctcaaacgttgctaccgagag atggttatgttacttgtcttcatttgtgttgccatggcaatctttagtgcactttctcag cttcttgaacatgggctggacctggaaacatccaacaaggactttaccagcattcctgct gcctgctggtgggtgattatctctatgactacagttggctatggagatatgtatcctatc acagtgcctggaagaattcttggaggagtttgtgttgtcagtggaattgttctattggca ttacctatcacttttatctaccatagctttgtgcagtgttatcatgagctcaagtttaga tctgctagtggcccaccggtggagcagctgcccccagaccccttgacccggtggtgcttc caccctgccggaagcaccttgtgtggccccgccaacagcatggcggttgcatccccagga agcaggcccgcagcgccaggagggggtttcctgaggacagaggcccttgtcctgattgtc gcagcaggccctgtcgatggacttaactgtgaaaatcaccctttcaggggtggatgcaag gacttctga