GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:06:54 Sequence gi568815574r:1090900_1312634 : 221735 bp : 48.39% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9037 9288 252 0 0 80 34 519 0.800 43.04 1.02 Intr + 9298 9710 413 0 2 -217 47 746 0.593 32.98 1.03 Term + 21783 21891 109 0 1 -16 44 144 0.164 -2.42 1.04 PlyA + 23402 23407 6 1.05 2.06 PlyA - 26129 26124 6 1.05 2.05 Term - 30705 30557 149 1 2 78 38 67 0.319 -1.24 2.04 Intr - 33967 33884 84 2 0 70 111 49 0.641 5.19 2.03 Intr - 36521 36384 138 0 0 110 30 110 0.580 7.84 2.02 Intr - 57199 56975 225 2 0 55 101 97 0.117 5.66 2.01 Init - 65357 65288 70 2 1 64 78 39 0.113 -0.29 2.00 Prom - 68244 68205 40 -4.66 3.00 Prom + 70605 70644 40 -5.06 3.01 Init + 81684 81761 78 2 0 83 52 104 0.403 7.36 3.02 Intr + 81839 81992 154 1 1 38 65 228 0.660 15.15 3.03 Term + 83899 84080 182 2 2 6 43 285 0.095 13.57 3.04 PlyA + 84795 84800 6 1.05 4.08 PlyA - 85349 85344 6 1.05 4.07 Term - 100261 99998 264 1 0 52 45 300 0.997 17.61 4.06 Intr - 102403 102319 85 0 1 121 68 74 0.950 8.52 4.05 Intr - 107825 107614 212 2 2 66 7 145 0.127 1.91 4.04 Intr - 111636 111503 134 1 2 80 99 198 0.972 20.46 4.03 Intr - 115700 115534 167 1 2 73 103 110 0.994 10.60 4.02 Intr - 118006 117907 100 2 1 69 111 119 0.558 11.47 4.01 Init - 121735 121657 79 1 1 80 91 154 0.999 14.12 4.00 Prom - 135059 135020 40 -4.06 5.02 PlyA - 136822 136817 6 1.05 5.01 Sngl - 173918 172791 1128 2 0 35 40 638 0.588 50.33 5.00 Prom - 175023 174984 40 -2.46 6.05 PlyA - 179046 179041 6 1.05 6.04 Term - 183171 183072 100 2 1 117 55 28 0.492 -0.00 6.03 Intr - 189873 189680 194 0 2 -8 -110 318 0.575 0.69 6.02 Intr - 190190 189922 269 0 2 -297 68 640 0.724 20.75 6.01 Init - 192172 192115 58 1 1 98 55 46 0.740 3.77 6.00 Prom - 193191 193152 40 -8.36 7.00 Prom + 193199 193238 40 -10.25 7.01 Init + 194935 195021 87 0 0 71 76 37 0.571 1.44 7.02 Intr + 197620 197743 124 0 1 73 115 64 0.999 7.76 7.03 Intr + 197860 197989 130 2 1 83 106 10 0.918 2.05 7.04 Intr + 199438 199610 173 1 2 102 66 22 0.641 0.99 7.05 Intr + 203429 203562 134 0 2 89 71 161 0.798 14.86 7.06 Intr + 204528 204557 30 2 0 107 119 -9 0.662 2.43 7.07 Intr + 205404 206816 1413 2 0 12 -18 545 0.172 25.88 7.08 Intr + 209592 209727 136 2 1 83 99 77 0.886 8.44 7.09 Intr + 213024 213120 97 1 1 76 86 114 0.806 9.07 7.10 Intr + 214547 214628 82 2 1 128 80 87 0.954 11.54 7.11 Intr + 217945 218052 108 0 0 62 61 86 0.900 3.78 7.12 Term + 218503 218580 78 0 0 90 53 101 0.988 4.56 7.13 PlyA + 219004 219009 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815574r:1090900_1312634|GENSCAN_predicted_peptide_1|257_aa MEEEEEKMEEKEEMAKEEMVEMEEEIEEKEEMVEIEEEEKMEEEQMEEMEREEEMEMEEE ELEEEQTEEMEMDKEEREEEEMEEMEEEEEEMQEIEEEEEELEEEQMEKEETEEIEMEEM VAEEEEMEEEEEEMQEIEEEQEEMEEEETKEVEMEEIEEEEMEEEQKEEEETEGIEMEEM EEEMEEEEEEMQEIEEEEEEMEEEEMEESGEIFLLETVLSFGKSYGFCKSGRSHRQKHPI RKNKATPPTSTSSELAV >gi568815574r:1090900_1312634|GENSCAN_predicted_CDS_1|774_bp atggaggaggaggaggagaagatggaggagaaggaggagatggcaaaggaggagatggtg gagatggaggaggagatagaggagaaggaggagatggtagagatagaggaggaggagaag atggaggaggagcagatggaggaaatggagagggaggaggagatggagatggaggaggag gagctggaggaagagcagacagaggagatggagatggacaaggaggagagggaggaggag gagatggaggagatggaagaggaggaggaagagatgcaggagatagaggaggaggaggag gagctggaggaagagcagatggagaaagaggagacagaggagatagagatggaagagatg gtggcggaggaggaggagatggaagaggaggaggaagagatgcaggagatagaggaagag caggaggagatggaggaggaggagacaaaggaggtagagatggaggagatagaggaggag gagatggaggaagagcagaaggaggaagaggagacagaggggatagagatggaggagatg gaggaggagatggaagaggaggaggaagagatgcaggagatagaggaggaggaggaggag atggaggaagaggagatggaggagagtggggagattttcctgctcgaaacagtgctcagc tttggcaaatcctatggattttgcaaaagtgggcgctcacatcggcagaagcatccgatt cggaaaaataaggcgacccctcctacaagcacctcctccgaactggccgtctag >gi568815574r:1090900_1312634|GENSCAN_predicted_peptide_2|221_aa MGWTWRLTSVIPALWEAEAGGSREASSVSFRPHEASCSETQPPLPTGGREEEERARHERE GPLGKAVFAALVKISPWCLSSAVRKLTGLWNVGGSVVAVTSAPPPACGCTFTYAYTCIDT STFIFTSTYIYTCTCTCTCTCVCTGLNKCNHNELMMNAKSKQQFRVCVFITVGRPSFFWG PLSQALYRPATNLPDTSPVLFAAGPECSWNFSMTQLRTELD >gi568815574r:1090900_1312634|GENSCAN_predicted_CDS_2|666_bp atgggctggacgtggaggctcacatctgtcatcccagcactttgggaggccgaggcggga ggatcacgagaagcatcttctgtgtctttccgcccccatgaagccagttgctcagaaacg cagccgccgctgcccacaggagggagggaagaggaggaacgggctcgacatgaaagagaa ggcccgttgggaaaggctgtttttgcagcactggtgaagataagcccctggtgtttgagt tccgctgtgagaaagctgaccgggctttggaacgtgggaggaagtgtggtggcagtgaca tctgcacctccgcctgcctgtggctgtacttttacgtatgcctatacctgtatcgacacc tccacctttatctttaccagcacctacatctatacctgtacctgtacctgtacctgtacc tgtgtctgcactggtttaaataaatgtaaccacaacgaactcatgatgaacgcaaagtca aaacagcagtttcgggtgtgtgtgttcatcacggtcggcagaccttccttcttctggggt cctttatcccaagccctttatagacctgcaaccaacttgcccgacacttcacctgtttta ttcgcagcaggtcctgaatgcagctggaatttcagcatgactcagctccgaacggaactc gactga >gi568815574r:1090900_1312634|GENSCAN_predicted_peptide_3|137_aa MAGEKVEKADTKEKKPEAKKADAGGERNWQAIPISPVYSGKAVYKRKYSATKSNVEKKKK EKVLATVTQPVGGDKDGVPPRRTHQKFVIATSTKIGISNVKIPKHLTDAYFKKQQLQKPR HQEGAIVDTEKEKYEIM >gi568815574r:1090900_1312634|GENSCAN_predicted_CDS_3|414_bp atggcaggtgaaaaagttgagaaggcagatactaaagagaagaaacctgaagccaagaag gctgatgctggtggcgagaggaattggcaggcaatccctatatctcccgtgtattccggg aaggccgtgtacaagaggaagtactcagccactaaatccaacgttgaaaagaaaaagaag gagaaggttcttgcgaccgttacacaaccagttggcggtgacaaggacggagtccctcca cgaagaacgcaccagaaatttgtcattgccacctcaaccaaaatcggtatcagcaatgta aaaatcccaaaacatcttactgatgcttacttcaagaagcagcagctgcagaagcccaga caccaggaaggtgccatcgtcgacacagaaaaagagaaatatgagattatgtag >gi568815574r:1090900_1312634|GENSCAN_predicted_peptide_4|346_aa MGRLVLLWGAAVFLLGGWMALGQGGAEGVQIQIIYFNLETVQVTWNASKYSRTNLTFHYR FNGDEAYDQCTNYLLQEGHTSGCLLDAEQRDDILYFSIRNGTHPVFTASRWMVYYLKPSS PKHVRFSWHQDAVTVTCSDLSYGDLLYEVQYRSPFDTEWQSKQENTCNVTIEGLDAEKCY SFWVRVKAMEDVYGPDTYPSDWSEVTCWQRGEIRGNACYAAVSHSLVTRLEVKKFLIPSV PDPKSIFPGLFEIHQGNFQEWITDTQNVAHLHKMAGAEQESGPEEPLVVQLAKTEAESPR MLDPQTEEKEASGGSLQLPHQPLQGGDVVTIGGFTFVMNDRSYVAL >gi568815574r:1090900_1312634|GENSCAN_predicted_CDS_4|1041_bp atggggcggctggttctgctgtggggagctgccgtctttctgctgggaggctggatggct ttggggcaaggaggagcagaaggagtacagattcagatcatctacttcaatttagaaacc gtgcaggtgacatggaatgccagcaaatactccaggaccaacctgactttccactacaga ttcaacggtgatgaggcctatgaccagtgcaccaactaccttctccaggaaggtcacact tcggggtgcctcctagacgcagagcagcgagacgacattctctatttctccatcaggaat gggacgcaccccgttttcaccgcaagtcgctggatggtttattacctgaaacccagttcc ccgaagcacgtgagattttcgtggcatcaggatgcagtgacggtgacgtgttctgacctg tcctacggggatctcctctatgaggttcagtaccggagccccttcgacaccgagtggcag tccaaacaggaaaatacctgcaacgtcaccatagaaggcttggatgccgagaagtgttac tctttctgggtcagggtgaaggctatggaggatgtatatgggccagacacatacccaagc gactggtcagaggtgacatgctggcagagaggcgagattcggggtaatgcttgttacgcg gcagtgtcccatagccttgtcaccaggctggaagtgaagaagtttctcattcccagcgtg ccagacccgaaatccatcttccccgggctctttgagatacaccaagggaacttccaggag tggatcacagacacccagaacgtggcccacctccacaagatggcaggtgcagagcaagaa agtggccccgaggagcccctggtagtccagttggccaagactgaagccgagtctcccagg atgctggacccacagaccgaggagaaagaggcctctgggggatccctccagcttccccac cagcccctccaaggcggtgatgtggtcacaatcgggggcttcacctttgtgatgaatgac cgctcctacgtggcgttgtga >gi568815574r:1090900_1312634|GENSCAN_predicted_peptide_5|375_aa MAHTLFLTFTIHVEKFVFPRAKSDDSPRLLEVGTIMGLFVFNESCPVVDINVKKFIFPRI QPDGSARLLELGIMVDLFVKGSYLLVDIDVEKFIFPRAQPDGSARLLELGIMVDLFVKGS YLLVDIDVEKFIFPRAQPDGSARLLELGIMVDLFVKGSYLLVDIDVEKFIFPRAQPDGSA RLLELGITVDLFVKGSYLLVDIDVEKLIFPRAQPDGSARLLELGITVDLFVKGSYLLVDI DVEKFIFPRAQPDGSPRLLELGIMVDLFVKGSYLLVDIDVEKFIFPRAQPDGSPRLLELG IMVDILVKGSYLPFDIDVEKFIFPRAQPDGSPRLLELALLYYFLSMAHTLFLSFTINVEN FKFPSSTRWHTRCTP >gi568815574r:1090900_1312634|GENSCAN_predicted_CDS_5|1128_bp atggctcataccctgtttctgactttcaccattcatgtagagaagtttgtctttccaaga gccaaatcagatgactcaccaaggctgctggaagtgggcacaataatgggcctttttgtc tttaatgaatcatgccctgttgttgacatcaatgtaaagaagtttatcttcccaagaatt caaccagatggctcagccaggctgctggaactgggcatcatggtggatctttttgtcaaa ggctcatatcttcttgttgacattgatgtagagaagtttatcttcccaagagctcaacca gatggctcagccaggctgctggaactgggcatcatggtggatctttttgtcaaaggctca tatcttcttgttgacattgatgtagagaagtttatcttcccaagagctcaaccagatggc tcagccaggctgctggaactgggcatcatggtggatctttttgtcaaaggctcatatctt cttgttgacattgatgtagagaagtttatcttcccaagagctcaaccagatggctcagcc aggctgctggaactgggcatcacggtggatctttttgtcaaaggctcatatcttcttgtt gacattgatgtagagaagcttatcttcccaagagctcaaccagatggctcagccaggctg ctggaactgggcatcacggtggatctttttgtcaaaggctcatatcttcttgttgacatt gatgtagagaagtttatcttcccaagagctcaaccagatggctcacccaggctgctggaa ctgggcatcatggtggatctttttgtcaaaggctcatatcttcttgttgacattgatgta gagaagtttatcttcccaagagctcaaccagatggctcacccaggctgctggaactgggc atcatggtggatattcttgtcaaaggctcatatcttccttttgacattgatgtagagaag tttatcttcccaagagctcaaccagacggttcaccaaggctgctggaactggcactgtta tactacttcttgtcaatggctcacaccttgtttctgagtttcaccatcaatgtggagaac tttaaattcccgagttcaaccagatggcacaccagatgcacaccgtga >gi568815574r:1090900_1312634|GENSCAN_predicted_peptide_6|206_aa MGPHPGPITTWMVVGLNCGGGEGGGGEGGGEGEQEENEEEEKEEEEKEEEEKEEEEKQEE EKEEEEKQEEEEEKEEEEEKEQEEEEEKEQEKEEEEEEKERKEEKRSYSPIRKKKKKEEE KEEEEEEEGEREEEERREEEGGGGRGGRGGRGEKEEEEEKEEELQTQPIVNQKIMSFLSP FWKFSADPKIEAAKLPSQIQSKNDGE >gi568815574r:1090900_1312634|GENSCAN_predicted_CDS_6|621_bp atgggtcctcatcctggccccatcactacctggatggtcgttgggttaaactgtggagga ggagaaggaggaggaggagaaggaggaggagaaggggagcaggaggagaatgaggaggag gagaaggaggaggaggagaaggaggaggaggagaaggaggaggaggagaagcaggaggag gagaaggaggaggaggagaagcaggaggaggaggaggagaaggaggaggaggaggagaag gagcaggaggaggaggaggagaaggagcaggagaaggaggaggaggaggaggagaaggag aggaaggaggagaaaagaagctactcgcccataaggaagaagaagaaaaaggaggaggag aaggaggaggaagaagaagaagaaggggaaagagaagaagaagaaagaagagaagaagaa ggaggaggaggaagaggaggaagaggaggaaggggggagaaggaggaggaagaagagaag gaggaggagctacaaactcaaccaattgtcaaccagaaaataatgagttttctttcccct ttctggaaattttcggctgatcctaaaatagaggccgccaagcttccttctcagattcaa agcaagaatgatggagagtga >gi568815574r:1090900_1312634|GENSCAN_predicted_peptide_7|863_aa MNLSWDCQENTTFSKCFLTDKKNRVVEPRLSNNECSCTFREICLHEGVTFEVHVNTSQRG FQQKLLYPNSGREGTAAQNFSCFIYNADLMNCTWARGPTAPRDVQYFLYIRNSKRRREIR CPYYIQDSGTHVGCHLDNLSGLTSRNYFLVNGTSREIGIQFFDSLLDTKKIERFNPPSNV TVRCNTTHCLVRWKQPRTYQKLSYLDFQYQLDVHRKNTQPGTENLLSPTHDPYTLLPMTP GGTLQSPTHDPYSLLPMTPGRTLQSPTHDPYTLLPMTPGRTVQSPTHDPYSLLPMTPGGT VQSPTHDPYSLLPMTPGGTVQSPTHDPYTLLPMTPGGTLQSPTHDPYSLLPMTPGGTLQS PTHDPYSLLPMTPGGTVQSPTHDPYSLLPMTPGGTLQSPTHDPYSLLPMTPGRTLQSPTH DPYTLLPMTPGGTVQSPTHDPYSLLPMTPGGTVQSPTHDPYTLLPKTPGGTLQSPTHDPY TFLPMTPGGTLQSPTHDPYTLLPMTPGGTLQSPTHDPYSLLHMTPGGTLQSPTHDPYSLL PMTPGGTLQSPTHDPYSLLPMTPGGTVQSPTHDPYSLLPMTPGGTLQSPTHDPYSLLPMT PGRTLQSPTHDPYTLLPITPGGTVQSPTHDPYSLLPMTPGGTLQSPTHDPYSLLPMTPGG TLQSPTHDPYSLLLMTPGVTLQSPTHDPYSLLLMTPGINVSGDLENRYNFPSSEPRAKHS VKIRAADVRILNWSSWSEAIEFGSDDGNLGSVYIYVLLIVGTLVCGIVLGFLFKRFLRIQ RLFPPVPQIKDKLNDNHEVEDEVSCSQALPKTTLFIVNYEVPWVDLSQIFTPGKTHGQII WEEFTPEEGKGYREEVLTVKEIT >gi568815574r:1090900_1312634|GENSCAN_predicted_CDS_7|2592_bp atgaatttaagctgggactgccaagaaaacacaaccttcagcaagtgtttcttaactgac aagaagaacagagtcgtggaacccaggctcagtaacaacgaatgttcgtgcacatttcgt gaaatttgtctgcatgaaggagtcacatttgaggttcacgtgaatactagtcaaagagga tttcaacagaaactgctttatccaaattcaggaagggagggtaccgctgctcagaatttc tcctgtttcatctacaatgcggatttaatgaactgtacctgggcgaggggtccgacggcc ccccgtgacgtccagtattttttgtacatacgaaactcaaagagaaggagggagatccgg tgtccttattacatacaagactcaggaacccatgtgggatgtcacctggataacctgtca ggattaacgtctcgcaattactttctggttaacggaaccagccgagaaattggcatccaa ttctttgattcacttttggacacaaagaaaatagaacgattcaaccctcccagcaatgtc accgtacgttgcaacacgacgcactgcctcgtacggtggaaacagcccaggacctatcag aagctgtcgtacctggactttcagtaccagctggacgtccacagaaagaatacccagcct ggcacggaaaacctactgtcccctactcacgacccctacactctcctacccatgacccct ggcggaaccctacagtcccctactcacgacccgtacagtctcctacccatgacccctggc agaaccctacagtcccctactcacgacccctacactctcctacccatgacccctggcaga accgtacagtcccctactcacgacccctacagtctcctacccatgacccctggcggaact gtacagtcccctactcacgacccctacagtctcctacccatgacccctggcggaactgta cagtcccctactcacgacccctacactctcctacccatgacccctggcggaaccctacag tcccctactcacgacccctacagtctcctacccatgacccctggaggaaccctacagtcc cctactcacgacccctacagtctcctacccatgacccctggcggaactgtacagtcccct actcacgacccctacagtctcctacccatgacccctggcggaaccctacagtcccctact cacgacccgtacagtctcctacccatgacccctggcagaaccctacagtcccctactcac gacccctacactctcctacccatgacccctggcggaaccgtacagtcccctactcacgac ccctacagtctcctacccatgacccctggcggaaccgtacagtcccctactcacgacccc tacactctcctacccaagacccctggcggaaccctacagtcccctactcacgacccctac actttcctacccatgacccctggcggaaccctacagtcccctactcacgacccctacact ctcctacccatgacccctggcggaaccctacagtcccctactcacgacccctacagtctc ctacacatgacccctggcggaaccctacagtcccctactcacgacccctacagtctccta cccatgacccctggaggaaccctacagtcccctactcacgacccctacagtctcctaccc atgacccctggcggaactgtacagtcccctactcacgacccctacagtctcctacccatg acccctggcggaaccctacagtcccctactcacgacccgtacagtctcctacccatgacc cctggcagaaccctacagtcccctactcacgacccctacactctcctacccataacccct ggcggaaccgtacagtcccctactcacgacccctacagtctcctacccatgacccctggc ggaaccctacagtcccctactcacgacccctacagtctcctacccatgacccctggagga accctacagtcccctactcacgacccctacagtctcctactcatgacccctggagtaacc ctacagtcccctactcacgacccctacagtctcctactcatgacccctggaattaatgtt tctggtgatttggaaaatagatacaactttccaagctctgagcccagagcaaaacacagt gtgaagatcagagctgcagacgtccgcatcttgaattggagctcctggagtgaagccatt gaatttggttctgacgacgggaacctcggctctgtgtacatttatgtgctcctaatcgtg ggaacccttgtctgtggcatcgtcctcggcttcctctttaaaaggttccttaggatacag cggctgttcccgccagttccacagatcaaagacaaactgaatgataaccatgaggtggaa gacgaggtttcatgctcacaagctcttccaaagaccaccctgttcatcgtcaactatgag gtcccatgggtggatctgtctcagatctttaccccaggaaaaacccatggccagatcatc tgggaggaattcaccccagaggaagggaaaggctaccgcgaagaggtcttgaccgtgaag gaaattacctga