GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:36:35 Sequence gi568815592r:146702798_146903709 : 200912 bp : 37.35% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 14086 14272 187 2 1 88 89 111 0.786 9.03 1.02 Intr + 16314 16419 106 0 1 68 83 37 0.810 0.50 1.03 Intr + 18606 18708 103 2 1 71 94 79 0.989 5.63 1.04 Intr + 21389 21530 142 0 1 60 51 125 0.874 4.49 1.05 Intr + 23286 23400 115 0 1 88 100 87 0.822 9.43 1.06 Intr + 25777 25944 168 0 0 65 98 170 0.995 14.92 1.07 Intr + 26342 26457 116 1 2 69 80 55 0.294 1.13 1.08 Intr + 28653 28690 38 0 2 73 105 -22 0.217 -5.01 1.09 Intr + 31096 31233 138 2 0 78 80 99 0.954 7.61 1.10 Intr + 33701 33794 94 0 1 72 98 52 0.948 2.70 1.11 Intr + 37662 37796 135 0 0 81 94 66 0.903 5.26 1.12 Intr + 38321 38474 154 2 1 24 100 56 0.697 -0.45 1.13 Intr + 43125 43312 188 2 2 83 87 73 0.547 4.27 1.14 Intr + 49733 49917 185 2 2 130 84 117 0.729 14.01 1.15 Intr + 61104 61303 200 1 2 98 76 137 0.704 11.65 1.16 Intr + 66223 66334 112 0 1 89 95 50 0.577 4.83 1.17 Intr + 79223 79395 173 0 2 100 94 110 0.997 11.54 1.18 Intr + 81821 81997 177 1 0 28 36 204 0.997 8.39 1.19 Intr + 82813 82915 103 0 1 90 113 92 0.999 10.73 1.20 Intr + 85592 85813 222 0 0 16 88 211 0.470 11.08 1.21 Intr + 98386 98482 97 2 1 49 123 151 0.943 12.85 1.22 Intr + 99031 99214 184 1 1 18 109 223 0.998 16.27 1.23 Term + 104605 104790 186 0 0 36 41 230 0.927 9.61 1.24 PlyA + 104988 104993 6 1.05 2.00 Prom + 109431 109470 40 -4.65 2.01 Init + 109805 109819 15 1 0 88 66 15 0.909 -0.45 2.02 Term + 112235 112420 186 1 0 37 41 246 0.964 11.31 2.03 PlyA + 112618 112623 6 1.05 3.05 PlyA - 113145 113140 6 1.05 3.04 Term - 117715 117522 194 2 2 31 53 183 0.237 5.70 3.03 Intr - 121213 121089 125 0 2 46 92 77 0.016 3.21 3.02 Intr - 138336 138256 81 2 0 65 99 70 0.011 3.63 3.01 Init - 138503 138478 26 2 2 64 98 -1 0.011 -2.87 3.00 Prom - 139563 139524 40 -5.05 4.11 PlyA - 139700 139695 6 1.05 4.10 Term - 142640 142173 468 2 0 32 42 317 0.924 15.49 4.09 Intr - 142985 142802 184 1 1 46 85 160 0.383 10.37 4.08 Intr - 151012 150934 79 2 1 78 75 27 0.250 -1.91 4.07 Intr - 151173 151095 79 0 1 60 93 135 0.168 9.41 4.06 Intr - 152619 152453 167 1 2 35 67 31 0.002 -5.54 4.05 Intr - 155079 154881 199 0 1 48 48 168 0.005 6.80 4.04 Intr - 157232 157210 23 0 2 69 98 2 0.004 -4.36 4.03 Intr - 158097 157947 151 0 1 115 41 102 0.055 7.01 4.02 Intr - 161805 161673 133 2 1 57 82 68 0.338 2.83 4.01 Init - 175364 175318 47 2 2 78 115 14 0.298 3.31 4.00 Prom - 181926 181887 40 -4.25 5.00 Prom + 189098 189137 40 -3.05 5.01 Init + 191796 191966 171 2 0 82 73 127 0.840 10.09 5.02 Term + 193243 193464 222 0 0 24 33 146 0.495 -1.37 5.03 PlyA + 200190 200195 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 130961 131112 152 0 2 89 53 88 0.952 2.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:146702798_146903709|GENSCAN_predicted_peptide_1|1107_aa XTDEQTDFGLGDAHQSDGLNLEREIVSQTTATQEKSQEELPTTNNSVSKEIWLDFEDFCV CFHKPCERDTITIILHLRKMKQIKCEITYVKHLEECLLFSEERVSYYLFVDSLKPIELLV CFSALVRWGEYGALTKDSPPIEPGLLTAETFSWKSLKPGSLVLKIHTYATKATVVRLPVG RHMLLFNAYSPVGHSIHICSMVSFVIGDEHVVLPNFEPESCRFTEQSLLIMKAIGNVIAN FKDKGKLSAALKDLQTAHYPVPFHDKELTAQHFRAEQHSSFTGSLEGVKQRDFLVTPSQG TYGPVHCSCTKTRWPCFAASSSCLPVEWLDVKYCMPTSDKEYSAEEVAAAIKIQAMWRGT YVRLLMKARIPDTKENISVADTLQKVWAVLEMNLEQYAVSLLRLMFKSKCKSLESYPCYQ DEETKIAFADYTVTYQEQPPNSWFIVFRETFLVHQDMILVPKVYTTLPICILHIVNNDTM EQVPKVFQKVVPYLYTKNKKGYTFVAEAFTGDTYVAASRWKLRLIGSSAPLPCLSRDSPC NSFAIKEIRDYYIPNDKKILFRYSVKVLTPQPATIQVRTSKPDAFIKLQVLENEETMVSS TGKGQAIIPAFHFLKSEKGLSSQSSKHILSFHSASKKEQEVYVKKKAAQGIQKSPKGRAV SAIQDIGLPLVEEETTSTPTREDSSSTPLQNYKYIIQCSVLYNSWPLTESQLTFVQALKD LKKSNTKAYGERHEELINLGSPDSHTISEGQKSSVTSKTTRKGKEKSSEKEKTAKEKQAP RFEPQISTVHPQQEDPNKPYWILRLVTEHNESELFEVKKDTERADEIRAMKQAWETTEPG RAIKASQARLHYLSGFIKKTSDAESPPISESQTKPKEEVETAARGVKEPNSKNSAGSESK EMTQTGSGSAVWKKWQLTKGLRDVAKSTSSESGGVSSPGKEEREQSTRKENIQTGPRTRS PTILETSPRLIRKALEFMDLSQYVRKTDTDPLLQTDELNQQQAMQKAEEIHQFRQHRTRV LSIRNIDQEERLKLKDEVLDMYKEMQDSLDEARQKIFDIREEYRNKLLEAEHLKLEALSA QEAAMKLETEKMTPAPDTQKKKKGKKK >gi568815592r:146702798_146903709|GENSCAN_predicted_CDS_1|3324_bp ngcacagatgaacaaacagactttggattgggtgatgctcatcagagtgatggattaaac ttggaaagagagatagtcagccagaccacagcaacacaggaaaagtcacaggaagaactt ccaacaacaaataatagtgtttctaaagaaatatggttagattttgaagatttctgtgta tgctttcacaaaccctgtgaaagagatactattaccatcattttacatctaaggaaaatg aagcagatcaaatgtgaaataacatatgtaaaacacttagaagaatgcctattattctca gaagaacgagtgtcctactatctatttgtagatagtctaaaacctattgaactactggtt tgcttttctgcattggtacgctggggggagtatggagccttaacaaaagacagtcctccc atagagcctggacttctcacagctgaaacgttttcttggaaatccctgaaaccaggcagt cttgttctgaagattcacacatatgctaccaaggctacagtggttcgtctgcctgttggg agacacatgctactcttcaacgcatactccccagtaggacactccatacacatctgcagc atggtgtcatttgtcattggggatgaacacgttgtactgcccaactttgaaccagagagc tgccgatttacggaacagtctctgttgattatgaaagctattggaaatgtgattgctaat ttcaaagataagggtaaactctctgcagctttgaaggatctgcaaacagctcactaccct gtccccttccatgataaagaactaactgcacagcacttcagggctgagcagcattctagc ttcacaggcagccttgagggggtgaagcaaagagacttcctggtaaccccttcacaaggc acatatggacctgtccactgcagctgcactaaaaccagatggccttgttttgcagccagt tctagttgtttacctgtggaatggctggacgttaaatattgtatgcccacaagtgataaa gagtattctgctgaggaagtagcagcagcaattaaaattcaagccatgtggagaggaact tacgttagattgcttatgaaagccagaataccagacacaaaagaaaatatcagtgttgca gatactcttcaaaaagtttgggctgtattggaaatgaatttagaacagtatgcagtttct ctcttaagactaatgtttaaaagcaagtgcaagtctttggaatcttatccatgctatcaa gatgaagaaactaagattgcttttgcagattatactgtgacttatcaagaacagccacca aattcttggtttatagtattcagagaaacatttttggttcatcaagacatgattttggtt cccaaagtatatactacacttccaatctgtatcctacacattgttaataatgacacaatg gagcaagtgccaaaggtgttccaaaaagtggtgccttatctttataccaagaataagaag ggatacacttttgtggcggaagcatttacaggcgacacatatgtagcagcctcacgatgg aaactgcgtctcatcggttcttctgctccactgccatgcctctctcgagactctccatgc aattcctttgccataaaggaaatccgagattactacatacccaatgataagaaaatttta ttcaggtattcggttaaagttctaacaccacaacctgctacaatacaggtacgcacatcc aaaccagatgcattcatcaagctgcaggtcctagaaaatgaagaaactatggtgagctcc actggaaaaggccaagctataatcccagcatttcacttcttgaagagtgagaaaggtttg agctcccagtctagcaagcacattctttcatttcactctgcatccaagaaagagcaagaa gtgtatgttaagaagaaagctgctcagggaattcagaaatcccccaagggtagagctgta agtgcaatacaagacattggtctaccccttgtggaggaggaaactaccagtacacccact agagaagacagttccagcacaccactgcagaattacaagtatattatacagtgttcggtg ttgtataacagttggcctctcactgaaagccagctgacatttgttcaagcactgaaagac ttaaagaaaagtaataccaaagcttatggtgaaagacacgaggagttaattaacttagga agcccagactcccacactattagtgagggacaaaaatcttcagtaacttccaaaacaaca aggaaaggcaaagaaaagtcttctgagaaagaaaagacagccaaagaaaaacaagcacct cgctttgagcctcagatatccactgttcaccctcaacaagaagacccaaataaaccctac tggattttgaggttggtcactgaacacaatgaatcagaattatttgaagtgaaaaaggat acagaaagggcagatgaaatccgagccatgaaacaagcctgggagacaactgagccagga agagcaatcaaggcttctcaggctcgtttgcattaccttagcgggttcattaagaaaaca tctgatgctgagagtccgcctatatctgaaagccaaactaaaccaaaagaagaagtagaa acagctgcacgtggcgtaaaagaaccaaactcaaagaattctgcaggttcagagagcaaa gagatgacacaaacaggatcagggagtgcggtgtggaagaagtggcaattgaccaaaggc ttgagggatgtggcaaaatccacgagtagcgaaagtggaggagtgtcttcaccagggaaa gaagagcgcgagcagagcacacggaaggaaaacattcaaacaggacctcgtacacgatct ccaacaattttggaaacatctccacgacttattcgaaaagcactagaatttatggattta agtcaatatgttcggaaaacagatacagatcctctgctgcaaacagatgaattgaatcag cagcaggcaatgcaaaaggcggaagaaattcatcagtttcgacagcataggaccagagtc cttagcattcgaaacattgaccaagaagagcggttgaagttaaaggatgaagtcctggat atgtataaggaaatgcaggactccttagatgaagcccgacagaaaattttcgacatccgg gaagagtacagaaacaaattgctggaagctgagcacctaaagctggaagctctctctgct caggaagccgccatgaagctggagacagaaaagatgaccccagctcctgacacacagaaa aaaaagaaaggaaagaaaaagtaa >gi568815592r:146702798_146903709|GENSCAN_predicted_peptide_2|66_aa MQTSQDSLDEARQKIFDIREEYRNKLLEAEHLKLETLAAQEAAMKLETEKMTPAPDTQKK KKGKKK >gi568815592r:146702798_146903709|GENSCAN_predicted_CDS_2|201_bp atgcagacatctcaggactccttagatgaagcccgacagaaaattttcgacatccgggaa gagtacagaaacaaattgctggaagctgagcacctaaagctggaaactctggctgctcag gaagcagccatgaagctggagacagaaaagatgaccccagctcctgacacacagaaaaaa aagaaaggaaagaaaaagtaa >gi568815592r:146702798_146903709|GENSCAN_predicted_peptide_3|141_aa MPLLFWTPREEETGSRECFNVETLNSFHTSAFWGHRDYFYHLKGKPMSSEQEHCFLCPYT SPEKGQLDLEVLEGLKRVEDLGNHRTQVQAITQVSWGTVSSPRAAQVGHHWLGVPGLQSD REKENPASVVVAQIKTISRII >gi568815592r:146702798_146903709|GENSCAN_predicted_CDS_3|426_bp atgcctctcttgttttggactccaagggaagaagagactgggtcacgggaatgctttaat gtggaaaccctgaattcgttccacacatctgccttctggggacacagggattatttctat catctcaaaggcaagcctatgtcttcagaacaagaacactgttttctctgcccttatacc agcccagagaagggacagcttgatcttgaagtacttgaagggctgaaaagagtagaagat cttgggaaccacagaacacaagtacaagctataacacaggtgagctggggcacagtatca tccccgagagcggcccaggtagggcatcactggctgggggtccctggcttgcaaagcgac cgagaaaaagaaaatcctgcctctgtagttgtggcccaaataaagacaatcagcaggatt atctga >gi568815592r:146702798_146903709|GENSCAN_predicted_peptide_4|509_aa MGKIITNKGLLRPFICVTGSWFMLSLKFDLLNISRKTLSGEYRSSEQLSTDFVIRLPFQK RPVHYEGSAVLASLQAINGSCPAVAQTPGHSPQGQKPVEGAVIVIYFSLAGSCCFHMQLP AARSASQPETQLSSFQDFMGRMTIPESPCSPLGKWATWAPDVRVFSGLRPASASLAREPA PSSLLYGKRLASKISVSKKGTNRKMKDSVLIRILQRDRTNKIDVYMNGSLLRSIDSHDHK MVKALLSDVGCESLVVLDLQLPGSGKVAMDYMSIFPKNHIWKPKSPQYDGICRDLWNFEL ERDDLGYLAEEIFKQQSIQEEAEHKSLKNLLVDNAVVKKNPFSGEKFKPAAEICPGDLVP CIPDTPAVAKSGQVTACAVASEGVDPKPWQLTHGVGPVAAQKSRIKLWKRPPRFLRMYGN TCMSRQMCAAGAEPSRRTSARAVWKGDVGLEPPHRVTTGVRPSRAVRRGSLSSRSQNGRS IDSLHCAPRKAIDTQCHSETEARMGSYTL >gi568815592r:146702798_146903709|GENSCAN_predicted_CDS_4|1530_bp atggggaaaataattactaataaaggcctactacggccttttatatgtgtaactggatct tggttcatgctgagcttaaaatttgatcttctgaatatcagcagaaagactctttctggt gaatacagatcttctgagcaattatcgactgactttgtcattagactgccctttcagaag aggccagttcactatgagggctctgctgtccttgccagccttcaagcaatcaacggctct tgtccagcagtggctcaaactcctggccattcaccccagggacagaagccagtggaggga gcagtcattgttatttacttttcattggcaggttcctgctgctttcacatgcagctgcca gctgcacgcagcgcaagtcaaccagaaactcaactaagcagtttccaggatttcatggga aggatgacgattcctgagagtccttgctcgcctctggggaaatgggccacttgggcccca gatgtgcgggtcttttccggactcaggcctgcgtcagcatccttggcccgcgagcctgcg ccctcatcgttgctatatgggaagagactggcttctaaaataagtgtaagcaagaaggga accaacagaaaaatgaaggacagtgtattaatcaggattctccagagggacaggactaat aagatagatgtatatatgaatgggagtttattaaggagtattgactcacatgatcacaag atggtcaaagctctcctgtcggatgttggttgtgaatctttggtggtccttgacctgcag ctccctggaagtgggaaagttgctatggactacatgtctattttccccaagaatcatata tggaagcctaaatctccccagtatgatggtatctgcagagatctatggaactttgaactt gagagagatgatttagggtatctggcagaagaaatttttaagcagcaaagcattcaagag gaagcagagcataaaagtttgaaaaatttgctggtagacaatgcagtagtaaagaaaaac ccattttctggggagaaattcaagccagcagcagaaatttgccctggggacttggtgccc tgcatcccagacactccagctgtggctaaaagcggccaagttacagcttgtgctgtggct tcagagggtgtggaccccaagccttggcagcttacacatggtgttgggcctgtggctgca cagaagtcaagaattaagctttggaaacgtccacctagatttctgaggatgtatggaaac acctgcatgtctaggcagatgtgtgctgccggggcagagccctcaaggaggacctctgcg agggcagtatggaagggagatgtgggattggagcccccacacagagtcaccactggggta cggcctagtagagctgtgagaagagggtcactgtcttccagatcccagaatggtagatcc attgacagcctgcactgtgcacctagaaaagccatagacactcaatgccattctgagaca gaagccaggatggggagctataccctgtaa >gi568815592r:146702798_146903709|GENSCAN_predicted_peptide_5|130_aa MGIMGSNPIRKKKEFDEQKDRGLVEGLNFLTGWMNAICGPHRLRKLRMQDTDQRMALNIS SREYYKQLACERKYIKEPCKKQRKVGSSEGVHNAVNILKVSCQGIRVLNLGEKHSVLSQC PMKVQSSTSS >gi568815592r:146702798_146903709|GENSCAN_predicted_CDS_5|393_bp atgggtatcatgggatctaaccccattagaaagaagaaggagtttgatgaacagaaagac agaggactagtggagggactgaacttcttgacagggtggatgaatgcaatctgtggacct catagactgagaaagctaagaatgcaggacactgatcagagaatggccttgaacatttcc tcacgtgagtattataagcaacttgcatgcgaaagaaaatacatcaaagagccatgcaag aagcaaagaaaagtaggaagctcagaaggtgtccataatgctgtaaatattttaaaggtc agctgccagggtattcgtgttttgaatctaggggaaaaacacagtgtgttgtcacagtgc cccatgaaagtgcaaagcagcacgtcttcctag